Gene PHATRDRAFT_39573 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_39573 
Symbol 
ID7195376 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011688 
Strand
Start bp164636 
End bp166262 
Gene Length1627 bp 
Protein Length505 aa 
Translation table 
GC content48% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002183563 
Protein GI219126646 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.273058 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACAACAC AACGACGATG GTCAATCTTG ATTTTTGTTC TATGCGGGCT ACAGCGTATT 
TCTCAAGCGC AGCCTTCTAA AGGTACCAGA AAGCTTCCAC CACTGAGGCA CAGTATATTC
GAGACACGAT CTTGGGGCTC CTCTGAAACG GGGAAGGGGG AAGGGAAGTC CAAGAGCAGT
AAACGAAGCT CTCAGACATC CCATCCGAAA ACACAAGGTA CGCGCCCTAA AAATTGCGAT
TTGCCATGTT CCTTTTTAAA TGGATTTCCA ATTCACTTTC CCCCCTTGAA TTGCAGCATC
GTCAAAAACG GGAAAGGGAA AAGATGGCAG AAGCTCGAAA AAGTCGAAGA GTTCGAAGTC
GAAGAAATCA AGCAAAACAT CATTGAAAAC GTCGACGCCA ACTGCGGTTG TGCAACCTAC
AAATGCTCCG AATACAGATA CTGCAGATCC AACTTCTCGA CCATCCCCGC CAGCTATTTT
TCCCACTGCG ATCCCCTCTG TAAGCCGGCA AACAGTTTCT CCAACACAGG TCGCCGAGAC
TGAGACGCCC ACTCTACTCC CCACGTTTGT ACCCATGTCT TTAGAGGCAA CTGACCAGCC
AACTGTTGCT GAAACAACAG CACCGACCGC CGCTGGACCA ATCGTTGTAA CGGTTATCCC
TACTATATCC CAAACTCGGC CGCCAGCTAC AAATGAGGTC GAGCCAACAC TAAGTCCGCA
CCTTGATACA AGCTCCCCTA CTGCTCTCCT GGCACCAAAC CGAGAGACAT CAACACCTAC
AATACGTTCT ACGACAACAC AAACACCCAC CGGCATAGGA GAGGAAACAG CGACACCGAC
CATAAATTTT AGTGCGACAC AAGCACCAAG CACCACAACA GTCGACACAC CATCACCGAC
GGTAGATTCT ATTGCAACAC AAGTACCAAC CACCATGACA GTGGACTCCA CTTTGCCAAC
CTCAGGACCA TCTACCAGAG CGCCATCAGC AGCCACTGAT ACCGAAGCTT CGCCCTTCGT
TATTACGTAC CAAACAAGCG ATAGTAGGGA ACCAACACCG GAGGAGTTCG ATCAAGCTCA
AGCGGTCACC CTTCAATATC TTGAAGACTT TTTAGTGGGG GAATACGAGT TCAACTTGAT
CACAGCGCTC AACGACGTTT TGGGATTGGC TCTTTCAGAG TCAACAGATC CTTTGGCTGT
AGCGTATGCA ACCACACTAC TGTTTTCGTC AGAATCAGGA TTTATTCCAT CTCAGGAAGA
TATAGATGTG CAGGTTTTTA CCGCATTTCA AGAACAGGCT GTTTTTGATC TCGTAGCAGC
ACTTCAAGGC CTGCCTCCTG AAAATCCCTT CTCATCGACT ACAAGCGTCC AGTTCACTTC
TTTTGTGATC GAAGTAGTGC AAGCATCTGA GACATCCTCA GGAAGTACGG TGGCTTTCGG
AGCTCTTATT GGGATGATCT TGTTCATTTT TGGTTTGCTC GCGTCTCGTG TTGTGTATCG
GAAGCCATCG TACATCTCTG TTGACAACGA CATCCCCCAT AGGGTAATAA TTCCTGGACA
AAGCAACATC ATGGCATTCA TGGACTGTGA AAGTGAAGCA TCTTCTCGCA GAACTTCAGC
GAATTGA
 
Protein sequence
MTTQRRWSIL IFVLCGLQRI SQAQPSKGTR KLPPLRHSIF ETRSWGSSET GKGEGKSKSS 
KRSSQTSHPK TQGTRPKNCD LPCSFLNGFP IHFPPLNCSI VKNGKGKRWQ KLEKVEEFEV
EEIKQNIIEN VDANCVSPTQ VAETETPTLL PTFVPMSLEA TDQPTVAETT APTAAGPIVV
TVIPTISQTR PPATNEVEPT LSPHLDTSSP TALLAPNRET STPTIRSTTT QTPTGIGEET
ATPTINFSAT QAPSTTTVDT PSPTVDSIAT QVPTTMTVDS TLPTSGPSTR APSAATDTEA
SPFVITYQTS DSREPTPEEF DQAQAVTLQY LEDFLVGEYE FNLITALNDV LGLALSESTD
PLAVAYATTL LFSSESGFIP SQEDIDVQVF TAFQEQAVFD LVAALQGLPP ENPFSSTTSV
QFTSFVIEVV QASETSSGST VAFGALIGMI LFIFGLLASR VVYRKPSYIS VDNDIPHRVI
IPGQSNIMAF MDCESEASSR RTSAN