Gene PHATRDRAFT_19937 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_19937 
Symbol 
ID7200572 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011675 
Strand
Start bp298040 
End bp301515 
Gene Length3476 bp 
Protein Length1012 aa 
Translation table 
GC content48% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002179824 
Protein GI219118084 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCCGTCAA CCAACAAAAC CGAAACTATT CAGTGGTGCT CGGATGCCTT GCACGATTTG 
TTGGGCTTTG CCGACACGGC GTTGGCTTCG TACCTGGTCA GCGTTGCGAA GAAGGCAACA
CAATCGTCGG AAATCGTCCA GATCCTCGTG GATGGAGATG TACGAGACGT GACACCGGAA
CGCATGGAAA GATTTGCTGA GCAATTGCTC TCGCACGCTC GACCGACACC GAAGCAAAGC
CACGGCGGAC CTGCTTCTCG ACAAGCAAAG GCCATTCACA GTCAAACAAA AACGAACGCG
GACTGGGTCA AGGCGGCTTC CAGCTATCAA CTGATTGATG TAGAGATCAG CGAAGAACCG
TCTAATCTGA ATAAACCAAG CGACAGACGG AAGGGAAAGA AAGACAGACA GGATAAAAGG
GATTCTTCCC TTTCCGAACG TGTTGGCGAA AAGTCGCGTC GAAGAAAACG GCAGTACCGC
GATGGTGATT CTGGTAGCAG TGACAGCAGC AATGCAGAAG ACGGAGCTGG GGCTAGAGTA
GCTGAACGGT ACCGTCGGAA AGCCGAAGAG CGTCGAGAGC GGAGAAGGCA TCGCCAAGTG
GAGTCAGCTC TTACTCCTGC AGAGCGCGTG GAACTAGAAA GGGAAAAGGA TCTGAAGGAA
CGAGACGAGC TTGTACAGCG CATGATGGAA CGCGATCAGA CAAAAACAAA ACAAAAGGCA
AAGTCAGAAG AAAAATCTGA TTCTGTTCAA AACCTAGCAG AAATCGAGGA GAGGCTAGCA
AAGGGAGAAC CAATGTATGA TGATGCTACA GGGAATGAGT TAACGTTGGA GCGACTCCGC
GAGGAGAGCC GTCGTGCGTA TCTGAAGAAG CGAGAAGAGC GTGAGCTAGC CCTATTGAAA
CAGTCGCTTC AGGATGAAGA GGATCTTTTT AGAGGCGCAA AATTAACGGA AGCAGAAAAA
AAGCGAATTC AGATGGGGAA GCAGATCCTT AGCATGGTTG AGGAGAGAGA CGGTGAGGAA
GACAAGGATG ATGAATTTTA TCGGTTACCT GGGGACTTTC ATGAGAAACA CTCAAGGGCT
AAACAGCAAG AAGCATTGCT GACGTCCCGC TATAACGAGC CTAAACTCGA AAAATCGGAA
CAGGATCTGT GGGAAGAGTC GCAAACGCAA AAGGCTGGTG CTATTGGCGG GCGCCAGAAG
AAGGCGATTG AATCAGACGG TTACGAGTTG TTGTTCGACG ATCAGATCGA TTTTGTTATG
CAGGAAACTA GAGAAGGCTA TGACAAACGT GGCAAGAGAC ACAAGTTGCG AGACCATACT
CGTCAAATTA GAGACGAGAG TCTGTCAGAG ATGCGTCCAG CAACTGAGCA TGAAAAGATT
CTGGAAGGGC GTACCAAACT TCCTGTTTAC GCTTATCGCG AAGAATTCCT AGCTGCAGTC
AAAGAGCATC AGATTCTCAT CTTAGTGGGA GAAACGGGCT CGGGTAGGTC TTAGTCTGTA
ACTTTTGGCT TTTGGGAATC TAGTTGCGAA TGTCTCTGAT TCTATATCCG TTCTTTCTTA
CTTTTAGGCA AAACGACACA AATTCCTCAA TTTCTCAACG AAGTTGGATA TGGTGAGCTG
GGGAAAATTG GTTGCACGCA GCCTCGGCGC GTCGCTGCAA TGAGTGTGGC AGCTCGTGTC
GCGCAAGAAA TGAACGTCAG GCTCGGGCAC GAAGTTGGCT ACTCCATTCG ATTCGAGAAT
TGCACAAGCC CCAAGACGAT TCTCCAGTAC ATGACGGACG GTATGCTTCT GAGGGAAATT
TTGACCCAAC CAGATTTGGC GAGCTACTCA TGCATGGTAA TCGACGAAGC ACATGAGCGC
ACGCTACATA CGGATATACT TTTTGGTCTC GTCAAGGACA TTGTGCGTTT CAGAAGTGAT
CTTAAACTCA TCGTCAGTAG TGCAACGCTT GATGCCGAAA AATTCTCGAA GTATTTTGAC
GATGCCAGCA TTTTCATGAT TCCCGGTCGT ATGTTTCCAG TCGATACATA TTACACAAAA
GCCCCGGAAG CTGACTATGT TGACGCGGCG GTTGTCACCG TGCTACAGAT ACATGTATCC
CAGCCGCTCA ACGGAGATGT GCTAGTATTT TTGACCGGTC AAGAGGAAAT CGAGACTGCG
GCCGAAACCT TGTCCGAGCG TTCGAAAAAC CTTGGCTCTC GCATACCTGA GCTAATCATT
TGTCCGATTT ACGCCAACCT TCCCTCAGAG CAGCAAGCGA AAATCTTTGA AAAGACTCCA
AGCGGTGCTC GCAAAGTAGT TCTTGCTACA AATATCGCAG AGACAAGCCT TACAATTGAC
GGGATCTGTT ACGTGATAGA TACTGGGTTC AATAAACAAA AAACATATAA TGCCAGATCT
GGCATGGAAT CTCTGGTCGT AACTCCCATT TCACAAGCAG CCGCTAACCA ACGAGCTGGT
CGAGCAGGGC GGACGCAACC AGGCAAGTGT TTTCGGCTCT TTACAGCATG GTCTTTCCAA
CATGAACTTG AACCAAACAC CGTGCCGGAG ATATTACGGA CGAACATGGG AAACGTTGTT
TTAATGTTGA AGAGTCTCGG AATCAACGAT CTTTTGAATT TTGACTTCAT GGACCGGCCT
CCTGCCGATG CTTTGATAAG AGCTCTTGAA CAGCTGTACG CCCTCGGTGC GCTCAATGAT
CGGGGAGAAT TGACAAAACT CGGTCGTCGA ATGGCAGAAT TTCCTTTGGA TCCTATGCTA
AGTAAATCTG TAATTGTGTC CGAAAAGTAT GAATGCACAT CCGAGGTGCT GTCGACCGTC
GCGATGCTTT CTCTAGGTGC ATCGGTTTTC TATCGGCCAA AAGAAAAGGC AGTACATGCC
GACACGGCGC GACTTAATTT TGCCCGCGGT GGTGGAGGTG ACCATATCGC TCTGCTTCGA
TGTTACTCTG AATGGGCAGC ATCTGACTTC AGTCCTTCTT GGTGCTTCGA AAATTTTGTT
CAAGTCAAGA ACATTAAAAA AGCCCGTGAC ATTCGGGAGC AGCTAGCAGG ACTTTGTGAT
CGTGTAGAGA TTGATCATAC AGTTTCGAAT TCTGACGATT TCGACGCTAC TCTGAAAACA
ATTACTGCTG GTTTCTTTTA CAACATTGCG AAACTTGGTC GTACTGGAGA GTATCAGACA
GCGAAGCAGC ACAAGACTGT GTATATTCAT CCTAGCAGCG TAATGGCAAA AGAGGAAGAG
CCGCCACCGT GGCTAGTATT TTTTGAGCTT ACCTTTACAA CAAAGGAATT CATGAGACAG
GTAGCCCCTA TCAAGCCATC GTGGTTGGTT GAAATTGCAC CGCACTATTA TCAAGAAACT
GATATCGAAG ATTCGAAGAC CAAAAAAATG CCGAGAACGA GACGCAATTG ATGATTTGCT
GTTTCACCTG CAAATTAGAA TACTGTTTGG AATTCTTGTT TTTTTTTAGG ATGGTG
 
Protein sequence
MPSTNKTETI QWCSDALHDL LGFADTALAS YLVSVAKKAT QSSEIVQILV DGDVRDVTPE 
RMERFAEQLL SHARPTPKQS HGGPASRQAK AIHSQTKTNA DWVKAASSYQ LIDVEISEEP
SNLNKPSDRR KGKKDRQDKR DSSLSEPLTP AERVELEREK DLKERDELVQ RMMERDQTKT
KQKAKSEEKS DSLTLERLRE ESRRAYLKKR EERELALLKQ SLQDEEDLFR GAKLTEAEKK
RIQMGKQILS MVEERDGEED KDDEFYRLPG DFHEKHSRAK QQEALLTSRY NEPKLEKSEQ
DLWEESQTQK AGAIGGRQKK AIESDGYELL FDDQIDFVMQ ETREGYDKHE SLSEMRPATE
HEKILEGRTK LPVYAYREEF LAAVKEHQIL ILVGETGSGK TTQIPQFLNE VGYGELGKIG
CTQPRRVAAM SVAARVAQEM NVRLGHEVGY SIRFENCTSP KTILQYMTDG MLLREILTQP
DLASYSCMVI DEAHERTLHT DILFGLVKDI VRFRSDLKLI VSSATLDAEK FSKYFDDASI
FMIPGRMFPV DTYYTKAPEA DYVDAAVVTV LQIHVSQPLN GDVLVFLTGQ EEIETAAETL
SERSKNLGSR IPELIICPIY ANLPSEQQAK IFEKTPSGAR KVVLATNIAE TSLTIDGICY
VIDTGFNKQK TYNARSGMES LVVTPISQAA ANQRAGRAGR TQPGKCFRLF TAWSFQHELE
PNTVPEILRT NMGNVVLMLK SLGINDLLNF DFMDRPPADA LIRALEQLYA LGALNDRGEL
TKLGRRMAEF PLDPMLSKSV IVSEKYECTS EVLSTVAMLS LGASVFYRPK EKAVHADTAR
LNFARGGGGD HIALLRCYSE WAASDFSPSW CFENFVQVKN IKKARDIREQ LAGLCDRVEI
DHTVSNSDDF DATLKTITAG FFYNIAKLGR TGEYQTAKQH KTVYIHPSSV MAKEEEPPPW
LVFFELTFTT KEFMRQVAPI KPSWLVEIAP HYYQETDIED SKTKKMPRTR RN