Gene PHATRDRAFT_37404 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_37404 
Symbol 
ID7202341 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011681 
Strand
Start bp50225 
End bp53609 
Gene Length3385 bp 
Protein Length1121 aa 
Translation table 
GC content48% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002181651 
Protein GI219122643 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.249909 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGTACCGG CCACCAGGCA AATGACTAGC GGAGCTGCTT ACTCGCATTT TTTGGATAAT 
GTATTTTCAC TTCCTCAAGG GCACCCAATC CGACTTAGTT TCGAACAACA AGGGTATAAT
TCTGTTGATG ATCTCCTCAG TATTTTTGAG AACGAACTAG ATGCCCTTGG ATATGTGCCT
TCAGCGAGTC CTGACACCCA TGAAGACCCT CAGTGGACCC CATTGCTCAT GGCGCACCGA
CAGATCCTTC GTCATTTCCT GCGTTGGCAG GCATCACTTG AACGGCAAAA GGGAAGTCCT
TTGGAAAATT CGGAGCTTGT TGCATTGACT AGTGGAGATT TCATTTTATA TCGACGCTCA
GCACTCGGAC AAGTCTCTAA TGTTCCGGCC ACCATCAGTC CTTCTCTGAA CAACCAGTTA
AGTACGTCCA CGAAAGCTCG ATCGGCAGTC GACGAATTCA AGCGAGGAGT CAAGCGTGAC
AAGACCCACT ATCCTATCCT TAAGGATGAC CGATACTGGG ACAATTTCTA CCGGTCTTTC
GTGGTCACCG CGGTATCCCA TAACGTGGAA AAGGTACTTG ACCCATCCTA TGCACCGACG
GACCCCTCAG AGAAGTCTCT TTTTGAGGAA CAGAAGAAAT TTGTGTACTC TGCTTTGGAA
CATACACTTC AGACAGATAT GGGGAAAAAC CTTGTTCGCG AACATAGTTT TGACTTCAAT
GCCCAAGAAG TTTTCCGTAA GGTTGTCAAG CACTACACAG AGTCTGCCAG TGCCAAGATT
GGGTCCTCCA ACACTTTGGC CTACCTCACT ACGGCAAAAT ATGGCACATC CTGGACAGGA
ACGGCGGAAG GGTTCATCCT TCACTGGAAG AACCATCTTC GCATCTACAA TGATATGGTT
CCTATGGCAG AGCAGTTGCC TAAACAGCTT TGCCTCAGTA TGCTTGAAAA CGCTGTACAC
GACATCCCTG AACTCCGCCA GGTCAAGATC ACCGCTACTT TAGACTTAGC TAAAGGAGGC
ACTCCCCTCA ACTACGAAGG TTACCTGAGT CTATTGCTTG CATCTGCTTC TCTATACGAT
AAAGGGAACA ACCTTTCCAA TTCTCGTAGT GTCAAGAGCA AGCGTAGCGC CTTTCTGACC
GACCTCTCGT ATGATCAACC GGACTTCACC GAAGACAATG GGATTGACTA TGATATTGAT
CTCTCTCCTG CAGTGATCTA TGAGGCCAAT GCTCACAACC GCAAAGTCAG TCCATCTGGC
CACCGTAATC GCGATCCGGC AACCAATCGA GAGCGTCCGT ATATCCCTCG CGAAATGTGG
AATCAGCTTT CAGATGATGC CAAAGCCATT CTCCAAGGCC TGTCCGCACC CGACAAAGGC
CCTACTCGAT CCGGCGATGT CTCGCAACGT GCGTTGGAAG CGAATACCCA CGCCAAGATA
TCGAACGATA ATGGCGAGTT CAATCGTAGC GAACCAGACA ACCAGCAAGC TGAAGCATTC
CATGACTGTG ATCAAACGAC GGAGCTCCTT GCACACTTGA CTGACCGTGT GAGTCACATG
GGAGATGGCG ATATCCGAAA AGTTCTTGCT GCATCCCGCC GTACACCAAT CAATTGTACC
CAGTCATCGG ACAATCGACA ACAGTCTGTT CAACTCAACG TTCTGGAATA TCAAGTCTCT
CGTCATTCCG TTGAGAACAA GACTGCTGCT CTAGTCGATC GAGGTGCCAA CGGTGGACTT
GCTGGCTGTG ATGTCAAAGT TGTGAACAAG ACAGGACGGT CTGCTAGTAT AACGGGTATC
AACGAGCATA CCCTGTCAGA TTTGGATATT GTCACTGCCG CTGGGTTTGT CGAGTCTCAC
AAAGGCCCTA TCATTGTGAT TATGCACCAA TACGCCTATC TTGGCAAGGG AAAGACCATC
CACTCCAGTG CCCAACTTGA GCATTACCGA AACACAGTCG AAGACCGGTC TCGCAATGTT
GGAGGACAAC AGCGGATTGT TACCTTGGAT GATTATATCA TTCCTCTTCA TGTTCGACAA
GGCCTCCCGT ATATGGATAT GCGGCAGCCT ACCGATAGCG AGTTCGAATC TCTTCCGCAT
GTTGTGTTGA CTTCCGATAT TGACTGGGAC CCTTCTATTC TAGACAATGA AGTTGACATG
GTGAACGACT GGTACGATGC AATGCAAGAT CTTCCGGGCA ATGCCTATGT TGAACCACGA
TTTGACAACA CAGGCCAATA CCTCCACCGC CATATAGCGT ACTACGATCT CGATCGCGAG
GACGCTATTG ATTGCATTAT CCAGTGTCGT AAGCACAATG TCAAACGCAA TGAACGGGAT
TATGAAGCAT TACGTCCCTG CTTGGGATGG GTATCCGGTG ACACTGTCCG AAAAACCATC
ATGGCTACGA CACAGTACGC TCGCGAAGTC TACAATGCAC CGCTACGAAA GCACTTCAAA
TCGCGATTCC CGGCTCTAAA TGTGCATCGG CGCAACGAGG CTGTTGCAAC GGATACTATC
TGGTCAGACA CACCTGCTGT TGACAACGGA GCCAAGTTTG CACAACTGTT TGTGGGGAGA
CGTTCCTTAG TCACCGATAT TTATCCCATG AAAACAGACA AGGAGTTTGT CAATGCCCTT
GAAGACAATA TTCGCCATCG TGGAGCTATG GATAAACTTC TGAGTGATCG AGCCCAAGTC
GAAATCAGTA AGAAGGTTGC TGATATTACA CGAGCCTACA ACATTGACCA ATGGCAAAGT
GAACCTCATC ATCAACATCA AAATTTTGCC GAACGCCGTA TTGCTACTAT TGAAGCTAAT
ACCAATAACG TTCTTAACAA AACCGGTGCT CCTGATTCCA CTTGGCTCTT GTGCATTGCC
TACATCTGCT ATGTCTTCAA CCATTTGTCC CATGAATCTT TGCATGATCG TACACCGCTC
GAGACTCTTC TTGGTAGCAC CCCTGATATC AGCGTACTTC TCCAGTTTCA TTTTTGGGAA
CCGGTGTACT ACCGTATCGA AGATCCATCT TTCCCTTCCG ATGGTACCGA AAAGAGCGGT
CACTTTGTTG GCATTGCTGA ATCTGTTGGG GATGCTCTCA CTTACAAAGT CCTCACAGAC
GACACCAACA AGATCTTATA CCGCTCTAGT GTGCGTTCCG CTTTGAAATC CGGAGAAACC
AACCTACGCC TTACGCCACA GGATGGGGAG AGTAATTCTA AGCCTATCAA CTTTGTCAAG
TCGCGTAGAA CTGAAAACAA AAATTCCTAT GCCTTAAAGG ATCTACCCGG TTTCACCCCT
GACGACCTTA TTGGACGCAC GTTCCTAACC GATACTCAGG ATGATGGGGA GCGTTTTCGT
GCACGTATCA CAAGGAAAAT CTTAG
 
Protein sequence
MVPATRQMTS GAAYSHFLDN VFSLPQGHPI RLSFEQQGYN SVDDLLSIFE NELDALGYVP 
SASPDTHEDP QWTPLLMAHR QILRHFLRWQ ASLERQKGSP LENSELVALT SGDFILYRRS
ALGQVSNVPA TISPSLNNQL STSTKARSAV DEFKRGVKRD KTHYPILKDD RYWDNFYRSF
VVTAVSHNVE KVLDPSYAPT DPSEKSLFEE QKKFVYSALE HTLQTDMGKN LVREHSFDFN
AQEVFRKVVK HYTESASAKI GSSNTLAYLT TAKYGTSWTG TAEGFILHWK NHLRIYNDMV
PMAEQLPKQL CLSMLENAVH DIPELRQVKI TATLDLAKGG TPLNYEGYLS LLLASASLYD
KGNNLSNSRS VKSKRSAFLT DLSYDQPDFT EDNGIDYDID LSPAVIYEAN AHNRKVSPSG
HRNRDPATNR ERPYIPREMW NQLSDDAKAI LQGLSAPDKG PTRSGDVSQR ALEANTHAKI
SNDNGEFNRS EPDNQQAEAF HDCDQTTELL AHLTDRVSHM GDGDIRKVLA ASRRTPINCT
QSSDNRQQSV QLNVLEYQVS RHSVENKTAA LVDRGANGGL AGCDVKVVNK TGRSASITGI
NEHTLSDLDI VTAAGFVESH KGPIIVIMHQ YAYLGKGKTI HSSAQLEHYR NTVEDRSRNV
GGQQRIVTLD DYIIPLHVRQ GLPYMDMRQP TDSEFESLPH VVLTSDIDWD PSILDNEVDM
VNDWYDAMQD LPGNAYVEPR FDNTGQYLHR HIAYYDLDRE DAIDCIIQCR KHNVKRNERD
YEALRPCLGW VSGDTVRKTI MATTQYAREV YNAPLRKHFK SRFPALNVHR RNEAVATDTI
WSDTPAVDNG AKFAQLFVGR RSLVTDIYPM KTDKEFVNAL EDNIRHRGAM DKLLSDRAQV
EISKKVADIT RAYNIDQWQS EPHHQHQNFA ERRIATIEAN TNNVLNKTGA PDSTWLLCIA
YICYVFNHLS HESLHDRTPL ETLLGSTPDI SVLLQFHFWE PVYYRIEDPS FPSDGTEKSG
HFVGIAESVG DALTYKVLTD DTNKILYRSS VRSALKSGET NLRLTPQDGE SNSKPINFVK
SRRTENKNSY ALKDLPGFTP DDLIGRTMMG SVFVHVSQGK S