Gene PHATRDRAFT_39506 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_39506 
Symbol 
ID7195182 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011687 
Strand
Start bp679653 
End bp683037 
Gene Length3385 bp 
Protein Length1121 aa 
Translation table 
GC content48% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002183530 
Protein GI219126575 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGTACCGG CCACCAGGCA AATGACTAGC GGAGCTGCTT ACTCGCATTT TTTGGATAAT 
GTATTTTCAC TTCCTCAAGG GCACCCAATC CGACTTAGTT TCGAACAACA AGGGTATAAT
TCTGTTGATG ATCTCCTCAG TATTTTTGAG AACGAACTAG ATGCCCTTGG ATATGTGCCT
CCAGCGAGTC CTGACACCAA TGAAGACCCT CAGTGGACCC CATTGCTCAT GGCGCACCGA
CAGATCCTTC GACATTTCCT GCGTTGGCAG GCATCACTTG AACGGCAAAA GGGAAGTCCT
TTGGAAAATT CGGAGCTTGT TGCATTGACT AGTGGAGATT TCATTTTATA TCGACGCTCG
GCACTCGGAC AAGTCTCTAA TGTTCCGGCC ACTATCAGTC CTTCTCTGAA CAACCAGTTA
AGTACGTCCA CGAAAGCTCG ATCGGCAGTC GACGAATTCA AGCGAGGAGT CAAGCGTGAC
AAGACACACT ATCCTATCCT TAAGGATGAC CGATACTGGG ACAATTTCTA CCGGTCTTTC
GTGGTCACCG CGGTATCCCA TAACGTCGAA AAGGTACTTG ACCCATCCTA TGCACCAACG
GACCCCTCAG AGAAGTCTCT CTTTGAGGAA CAGAAGAAAT TTGTGTACTC TGCTTTGGAA
CATACACTTC AGACAGATAT GGGGAAAAAC CTTGTTCGCG AACATAGTTT TGACTTCAAT
GCCCAAGAAG TTTTCCGTAA GGTTGTCAAG CACTACACAG AGTCTGCCAG TGCCAAGATT
GGGTCCTCCA ACACTTTGGC CTACCTCACT ACGGCAAAAT ATGGCACATC CTGGACAGGA
ACGGCGGAAG GGTTCATCCT TCACTGGAAG AACCATCTTC GTATCTACAA TGATATGGTC
CCTATGGCAG AGCAGTTGCC TAAACAGCTT TGCCTCAGTT TGCTTGAAAA CGCTGTACAC
GACATCCCTG AACTCCGTCA GGTCAAGATC ACCGCTACTT TAGACTTAGC TAAAGGAGGC
ACTCCCCTCA ACTACGAAGG TTACCTGAGT CTATTGCTTG CATCTGCTTC TCTATACGAT
AAAGGGAACA ACCTTTCCAA TTCTCGTAAT GTCAAGAGCA AGCGTAGCGC CTTTCTGACC
GACCTCTCGT ATGATCAACC TGACTTCACC GAAGACAATG GAATTGACTA TGATATCGAT
CTCTCTCCTG CAGTGATCTA TGAGGCCAAT GCTCACAACC GCAAAGTCAG TCCATCTGGC
CACCGTAATC GCGATCCGGC AACCAATCGA GAGCGTCCGT ATATCCCTCG CGAGATGTGG
AATCAGCTTT CAGATGATGC CAAAGCCATT CTCCAAGGCC TGTCCGCACC CGACAAAGGC
CCTACTCGAT CCGGCGATGT CTCGCAACGT GCGTTGGAAG CGAATACCCA CGCCAAGATA
TCGAACGATA ATGGCGAGTT CAACCGTAGC GAACCAGACA ACCAGCAAGC TGAAGCATTC
CATGACTGTG ATCAAACGAC GGAGCTCCTT GCACACTTGA CTGACCGTGT GAGTCACATG
GGAGACGGCG ATATCCGAAA AGTTCTTGCT GCATCCCGCC GTACACCAAT CAATTGTACC
CAGTCATCGG ACAATCGACA ACAGTCTGTT CAACTCAACG TTCTGGAATA TCAAGTCTCT
CGTCATTCCG TTGAGAACAA AACTGCTGCT CTAGTCGATC GAGGTGCCAA CGGTGGACTT
GCTGGCTGTG ATGTCAAAGT TGTGAACAAG ACAGGACGGT CTGCTAGTAT AACGGGTATC
AACGAGCATA CCCTGTCAGA TTTGGATATT GTCACTGCCG CTGGGTTTGT TGAGTCTCAC
AAAGGCCCTA TCATTGTGAT TATGCACCAA TACGCCTATC TTGGCAAGGG AAAGACCATC
CACTCCAGTG CCCAACTTGA GCATTACCGA AACACAGTCG AAGACCGGTC CCGCAATGTT
GGAGGACAAC AGCGGATTGT TACCTTGGAT GATTATATCA TTCCTCTTCA TATTCGACAA
GGCCTCCCAT ATATGGATAT GCGGCAGCCT ACCGATAGCG AGTTCGAATC TCTTCCGCAT
GTTGTGTTGA CTTCCGATAT TGACTGGGAC CCTTCTATTC TAGACAATGA AGTTGACATG
GTGAACGACT GGTACGATGC AATGCAAGAT CTTCCGGGCA ATGCCTATGT TGAACCACGA
TTTGACAACA CAGGTCAATA CCTCCACCGC CATATTGCGT ACTACGATCT CGATCGCGAG
GACGCTATTG ATTGCATCAT CCAGTGCCAT AAGCACAATG TCAAACGCAA TGAACGGGAT
TATGAAGCAT TACGTCCCTG CTTGGGATGG GTATCCGGTG ACACTGTCCG AAAAACCATC
ATGGCTACGA CACAGTACGC TCGCGAAGTC TACAATGCGC CGCTACGAAA GCACTTCAAA
TCGCGATTCC CGGCTCTAAA TGTGCATCGG CGCAACGAGG CTGTTGCAAC GGATACTATC
TGGTCAGACA CACCTGCTGT TGACAACGGA GCCAAGTTTG CACAACTGTT TGTGGGGAGA
CGTTCCTTAG TCACCGATAT TTATCCCATG AAAACAGACA AGGAGTTTGT CAATGCCCTT
GAGGACAATA TTCGCCATCG TGGAGCTATG GATAAACTTC TGAGTGATCG AGCCCAAGTC
GAAATCAGTA AGAAGGTTGC TGATATTACA CGAGCCTACA ACATTGACCA ATGGCAAAGT
GAACCTCATC ATCAACATCA AAATTTTGCC GAACGCCGTA TTGCTACTAT TGAAGCTAAT
ACCAATAACG TTCTTAACAA AACCGGTGCT CCTGATTCCA CTTGGCTCTT GTGCATTGCC
TACATCTGCT ATGTCTTCAA CCATTTGTCC CATGAATCTT TGCATGATCG TACACCGCTC
GAGACTCTTC TTGGTAGCAC CCCTGATATC AGCGTACTTC TCCAGTTTCA TTTTTGGGAA
CCGGTGTACT ACCGGATCGA AGATCCATCT TTCCCTTCCG ATGGTACCGA AAAGAGCGGT
CGCTTTGTTG GCATTGCTGA ATCTGTTGGG GATGCTCTCA CTTACAAAAT CCTCACAGAC
GACACCAACA AGATCTTATA CCGCTCTAGT GTGCGTTCCG CATTGAAATC CGGAGAAACC
AACCTACGCC TTACGCCACA GGATGGGGAG AGTAATTCTA AGCCTATCAA CTTTGTCAAG
TCGCGTAGAA CTGAAAACAA AAATTCCTAT GCCTTAAAGG ATCTACCCGG TTTCACCCCT
GAGGACCTTA TTGGACGCAC GTTCCTAACC GATACTCAGG ATGATGGGGA GCGTTTTCGT
GCACGTATCA CAAGGAAAAT CTTAG
 
Protein sequence
MVPATRQMTS GAAYSHFLDN VFSLPQGHPI RLSFEQQGYN SVDDLLSIFE NELDALGYVP 
PASPDTNEDP QWTPLLMAHR QILRHFLRWQ ASLERQKGSP LENSELVALT SGDFILYRRS
ALGQVSNVPA TISPSLNNQL STSTKARSAV DEFKRGVKRD KTHYPILKDD RYWDNFYRSF
VVTAVSHNVE KVLDPSYAPT DPSEKSLFEE QKKFVYSALE HTLQTDMGKN LVREHSFDFN
AQEVFRKVVK HYTESASAKI GSSNTLAYLT TAKYGTSWTG TAEGFILHWK NHLRIYNDMV
PMAEQLPKQL CLSLLENAVH DIPELRQVKI TATLDLAKGG TPLNYEGYLS LLLASASLYD
KGNNLSNSRN VKSKRSAFLT DLSYDQPDFT EDNGIDYDID LSPAVIYEAN AHNRKVSPSG
HRNRDPATNR ERPYIPREMW NQLSDDAKAI LQGLSAPDKG PTRSGDVSQR ALEANTHAKI
SNDNGEFNRS EPDNQQAEAF HDCDQTTELL AHLTDRVSHM GDGDIRKVLA ASRRTPINCT
QSSDNRQQSV QLNVLEYQVS RHSVENKTAA LVDRGANGGL AGCDVKVVNK TGRSASITGI
NEHTLSDLDI VTAAGFVESH KGPIIVIMHQ YAYLGKGKTI HSSAQLEHYR NTVEDRSRNV
GGQQRIVTLD DYIIPLHIRQ GLPYMDMRQP TDSEFESLPH VVLTSDIDWD PSILDNEVDM
VNDWYDAMQD LPGNAYVEPR FDNTGQYLHR HIAYYDLDRE DAIDCIIQCH KHNVKRNERD
YEALRPCLGW VSGDTVRKTI MATTQYAREV YNAPLRKHFK SRFPALNVHR RNEAVATDTI
WSDTPAVDNG AKFAQLFVGR RSLVTDIYPM KTDKEFVNAL EDNIRHRGAM DKLLSDRAQV
EISKKVADIT RAYNIDQWQS EPHHQHQNFA ERRIATIEAN TNNVLNKTGA PDSTWLLCIA
YICYVFNHLS HESLHDRTPL ETLLGSTPDI SVLLQFHFWE PVYYRIEDPS FPSDGTEKSG
RFVGIAESVG DALTYKILTD DTNKILYRSS VRSALKSGET NLRLTPQDGE SNSKPINFVK
SRRTENKNSY ALKDLPGFTP EDLIGRTMMG SVFVHVSQGK S