Gene PHATRDRAFT_40611 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_40611 
Symbol 
ID7198386 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011692 
Strand
Start bp448979 
End bp452395 
Gene Length3417 bp 
Protein Length1138 aa 
Translation table 
GC content48% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002184616 
Protein GI219128850 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.101521 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGTACCGG CCACCAGGCA AATGACTAGC GGAGCTGCTT ACTCGCATTT TTTGGATAAT 
GTATTTTCAC TTCCTCAAGG GCACCCAATC CGACTTAGTT TCGAACAACA AGGGTATAAT
TCTGTTGATG ATCTCCTCAG TATTTTTGAG AACAAACTAG ATGCCCTTGG ATATGTGCCT
CCAGCGAGTC CTGACACCCA TGAAGACCCT CAGTGGACCC CATTGCTCAT GGCGCACCGA
CAGATCCTTC GTCATTTCCT GCGTTGGCAG GCATCACTTG AACGGCAAAA GGGAAGTCCT
TTGGAAAATT CGGAGCTTGT TGCATTGACT AGTGGAGATT TCATTTTATA CCGACGCTCA
GCACTCGGAC AAGTCTCTAA TGTTCCGGCC ACCATCAGTC CTTCTCTGAA CAACCAGTTA
AGTACGTCCA CGAAAGCTCG ATCGGCAGTC GACGAATTCA AGCGAGGAGT CAAGCGTGAC
AAGACCCACT ATCCTATCCT TAAGGACGAC CGATACTGGG ACAATTTCTA CCGGTCTTTC
GTGGTCACCG CGGTATCCCA TAACGTCGAA AAGGTACTTG ACCCATCCTA TGCACCGACG
GACCCCTCAG AGAAGTCTCT CTTTGAGCAA CAGAAGAAAT TTGTGTACTC TGCTTTGGAA
CATACACTTC AGACAGATAT GGGGAAAAAC CTTGTTCGCG AACATAGTTT TGACTTCAAT
GCTCAAGAAG TTTTCCGTAA GGTTGTCAAG CACTACACAG AGTCTGCCAG TGCCAAGATT
GGGTCCTCCA ACACTTTGGC CTACCTCACT ACGGCAAAAT ATGGCACATC CTGGACAGGA
ACGGCGGAAG GGTTCATCCT TCACTGGAAG AACCATCTTC GCATCTACAA TGATATGGTT
CCTATGGCAG AGCAGTTGCC TAAACAGCTT TGCCTCAGTT TGCTTGAAAA CGCTGTACAC
GACATCCCTG AACTCCGCCA GGTCAAGATC ACCGCTACTT TAGACTTAGC TAAAGGAGGC
ACTCCCCTCA ACTACGAAGG TTACCTGAGT CTATTGCTTG CATCTGCTTC TCTATACGAT
AAAGGGAACA ACCTTTCCAA TTCTCGTAGT GTCAAGAGCA AGCGTAGCGC CTTTCTGACC
GACCTCTCGT ATGATCAACC GGACTTCACC GAAGACAATG GAATTGACTA TGATATTGAT
CTCTCTCCTG CAGTGATCTA TGAGGCCAAT ACTCACAACC GCAAAGTCAG TCCATCTGGC
CACCGTAATC GCGATCCGGC AACCAATCGA GAGCGTCCGT ATATCCCTCG CGAGATGTGG
AATCAGCTTT CAGATGATGC CAAAGCCATT CTCCAAGGCC TGTCCGCACC CGACAAAGGC
CCTACTCGAT CCGGCGATGT CTCGCAACGT GCGTTGGAAG CGAATACCCA CGCCAAGATA
TCGAACGATA ATGGCGAGTT CAACCGTAGC GAACCAGACA ACCAGCAAGC TGAAGCATTC
CATGACTGTG ATCAAACGAC GGAGCTCCTT GCACACTTGA CTGACCGTGT GAGTCACATG
GGAGACGGCG ATATCCAAAA AGTCCTTGCT GCATCCCGCC GTACACCAAT CAATTGTACC
CAGTCATCGG ACAATCGACA ACAGTCTGTT CAACTCAACG TTCTGGAATA TCAAGTCTCT
CGTCATTCCG TTGAGAACAA AACTGCTGCT CTAGTCGATC GAGGTGCCAA CGGTGGACTT
GCTGGCTGTG ATGTCAAAGT TGTGAACAAG ACAGGACGGT CTGCTAGTAT AACGGGTATC
AACGAGCATA CCCTGTCAGA TTTGGATATT GTCACTGCCG CTGGGTTTGT CGAGTCTCAC
AAAGGCCCTA TCATTGTGAT TATGCACCAA TACGCCTATC TTGGCAAGGG AAAGACCATC
CACTCCAGTG CCCAACTTGA GCATTACCGA AACACAGTCG AAGACCGGTC TCGCAATGTT
GGAGGACAAC AGCGGATTGT TACCTTGGAT GATTATATCA TTCCTCTTCA TGTTCGACAA
GGCCTCCCGT ATATGGATAT GCGACAGCCT ACCGATAGCG AGTTCGAATC TCTTCCGCAT
GTTGTGTTGA CTTCCGATAT TGACTGGGAC CCTTCTATTC TAGACAATGA AGTTGACATG
GTGAACGACT GGTACGATGC AATGCAAGAT CTTCCGGGCA ATGCCTATGT TGAACCACGA
TTTGACAACA CAGGCCAATA CCTCCACCGC CATATAGCGT ACTATGATCT CGATCGTGAG
GACGCTATTG ATTGCATTAT CCAGTGTCGT AAGCACAATG TCAAACGCAA TGAACGGGAT
TATGAAGCAT TACGTCCCTG CTTGGGATGG GTATCCGGTG ACACTGTCCG AAAAACCATC
ATGGCTACGA CACAGTACGC TCGCGAAGTC TACAATGCAC CGCTACGAAA GCACTTCAAA
TCGCGATTCC CGGCTCTAAA TGTGCATCGG CGCAACGAGG CTGTTGCAAC GGATACTATC
TGGTCAGACA CACCTGCTGT TGACAACGGA GCCAAGTTTG CACAACTGTT TGTGGGGAGA
CGTTCCTTAG TCACCGATAT TTATCCCATG AAAACAGACA AGGAGTTCGT CAATGCCCTT
GAAGACAATA TTCGCCATCG TGGAGCTATG GATAAACTTC TGAGTGATCG AGCCCAAGTC
GAAATCAGTA AGAAGGTTGC TGATATTACA CGAGCCTACA ACATTGACCA ATGGCAAAGT
GAACCTCATC ATCAACATCA AAATTTTGCC GAACGCCGTA TTGCTACTAT TGAAGCTAAT
ACCAATAGCG TTCTTAACAA AACCGGTGCT CCTGATTCCA CTTGGCTCTT GTGCATTGCC
TACATCTGCT ATGTCTTCAA CCATTTGTCC CATGAATCTT TGCACGATCG TACACCGCTC
GAAATCCTTC TTGGTAGCAC CCCTGATATC AGCGTACTTC TCCAGTTTCA TTTTTGGGAA
CCGGTGTACT ACCGTATCGA AGATCCATCT TTCCCTTCCG ATGGTACCGA AAAGAGCGGT
CGCTTTGTTG GCATTGCTGA ATCTGTTGGG GATGCTCTCA CTTACAAAAT CCTCACAGAC
GACACCAACA AGATCTTATA CCGCTCTAGT GTGCGTTCCG CATTGAAATC CGGAGAAATC
AACCTACGCC TTACGACACA GGATGGGGAG AGTAATTCTA AGCCTATCAA CTTTGTCAAG
TCGCGTAGAA CTGAAAACAA AAATTCCTAT GCCTTAAAGG ATCTACCCGG TTTCACCCCT
GAGGACCTTA TTGGACGCAC GTTCCTAACC GATACTCAGG ATGATGGGGA GCGTTTTCGT
GCACGTATCA CAAGGAAAAT CTTAGATCCC GACAAGCCCT CTGATGTGCG TTTTTAG
 
Protein sequence
MVPATRQMTS GAAYSHFLDN VFSLPQGHPI RLSFEQQGYN SVDDLLSIFE NKLDALGYVP 
PASPDTHEDP QWTPLLMAHR QILRHFLRWQ ASLERQKGSP LENSELVALT SGDFILYRRS
ALGQVSNVPA TISPSLNNQL STSTKARSAV DEFKRGVKRD KTHYPILKDD RYWDNFYRSF
VVTAVSHNVE KVLDPSYAPT DPSEKSLFEQ QKKFVYSALE HTLQTDMGKN LVREHSFDFN
AQEVFRKVVK HYTESASAKI GSSNTLAYLT TAKYGTSWTG TAEGFILHWK NHLRIYNDMV
PMAEQLPKQL CLSLLENAVH DIPELRQVKI TATLDLAKGG TPLNYEGYLS LLLASASLYD
KGNNLSNSRS VKSKRSAFLT DLSYDQPDFT EDNGIDYDID LSPAVIYEAN THNRKVSPSG
HRNRDPATNR ERPYIPREMW NQLSDDAKAI LQGLSAPDKG PTRSGDVSQR ALEANTHAKI
SNDNGEFNRS EPDNQQAEAF HDCDQTTELL AHLTDRVSHM GDGDIQKVLA ASRRTPINCT
QSSDNRQQSV QLNVLEYQVS RHSVENKTAA LVDRGANGGL AGCDVKVVNK TGRSASITGI
NEHTLSDLDI VTAAGFVESH KGPIIVIMHQ YAYLGKGKTI HSSAQLEHYR NTVEDRSRNV
GGQQRIVTLD DYIIPLHVRQ GLPYMDMRQP TDSEFESLPH VVLTSDIDWD PSILDNEVDM
VNDWYDAMQD LPGNAYVEPR FDNTGQYLHR HIAYYDLDRE DAIDCIIQCR KHNVKRNERD
YEALRPCLGW VSGDTVRKTI MATTQYAREV YNAPLRKHFK SRFPALNVHR RNEAVATDTI
WSDTPAVDNG AKFAQLFVGR RSLVTDIYPM KTDKEFVNAL EDNIRHRGAM DKLLSDRAQV
EISKKVADIT RAYNIDQWQS EPHHQHQNFA ERRIATIEAN TNSVLNKTGA PDSTWLLCIA
YICYVFNHLS HESLHDRTPL EILLGSTPDI SVLLQFHFWE PVYYRIEDPS FPSDGTEKSG
RFVGIAESVG DALTYKILTD DTNKILYRSS VRSALKSGEI NLRLTTQDGE SNSKPINFVK
SRRTENKNSY ALKDLPGFTP EDLIGRTFLT DTQDDGERFR ARITRKILDP DKPSDVRF