Gene Tery_1718 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTery_1718 
Symbol 
ID4243317 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameTrichodesmium erythraeum IMS101 
KingdomBacteria 
Replicon accessionNC_008312 
Strand
Start bp2611093 
End bp2613492 
Gene Length2400 bp 
Protein Length799 aa 
Translation table11 
GC content38% 
IMG OID638106846 
Producttetratricopeptide TPR_4 
Protein accessionYP_721455 
Protein GI113475394 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2956] Predicted N-acetylglucosaminyl transferase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TTGTGTAATG ATTTAGGGTT TTTACCTTTG GGGCTGGAGT TGGTGGCGCG TTATTTGGAG 
CGGAAGACTA GTTTATCTTT GGCAAAAACA GGTGAGAGGT TAGGATTAGA ACATAGGTCT
CTCAAGGTGT TTTCTCAGGA TATGACAGGG AGGAGAGGGG TGGCGGCTGC TTTTGAGTTT
AGTTGGCAAG AGTTGGATGA GGATGATCAG GAGTTGGGGT GTTTTTTGAG TTTGTTTGCT
TGTGCTCGAA TTCCTTGGTA TTTGGTGGAG GGATGTTTGC CTGAGGTTGA TGAAGTAGAG
TTGGAGGATA GGCGAGATGA CAAGTTGGTG AATTTAAGTT TGTTGCAATC ACTTGGTGAA
AATTGTTATG AGTTTCATCC ACTTATACGA GAATTTTTCA GAAATAAAAA TAAGTTGCTT
GAGTTTGACT TGGTTGAGGA AATGAAGGGA AATATTTGTG GGGTAATAGC AGATGCGGTA
AGGAAAATTC CTTATGATGA GTATATTACT GTGGAAAAAG TTAAAGAAGT TAAGGTTGAT
ATACCTCACA TTACGGAAGT AGCAGAAAAT TTGGCAGAGT ATTTGAGTGA TGATGATTTG
ATTTTACCTT TTATCAGCTT AGGTGGGTTC TATCATTGTC AAGCATTGTA CTCACTGGCA
CAACCTTGGT TAGATAGAGG TAGAGAAATA GCAGAAAGAC GTCTAGATAA AAACAATGCT
GATATTGCAA CTGTTTATAA CTGTTTGGCA GCATTATATT TTGCACAAGG AAAATACGAA
GCAGCAGAAC CATTGTACCT ACAAGCATTT AAAATCGTTA AAACCGTCCT CCCTGAAAAT
CATCCAAATA TTGCTGTTAG TCTCAACAAC CTGAAAAATT TATATTATTC ACAAGGAAAA
TATGAAGCAG CAGAACCGTT GTTGCTACAA GCAGTTGAAA TCTACAAAAT TGCCCTCCCT
GAAAATCATC CAAATATTGC TGTTAGTCTG AGCAACCTGG CAGAATTATA TCGCGCACAA
GGAAAATATG AAGCAGCAGA ACCGTTGTTG CTACAAGCTA TTGAAATCTA CAAAATTGCC
CTCCCTGAAA ATCCTCCAAA TATTGCTGTT AGTCTGAGCA ACCTGGCAGA ATTATATCGC
GCACAAGGAA AATACGAAGC AGCAGAACCG TTGTTGCTAC AAGCTATTGA AATCTACAAA
ATTGCCCTTC TTGAAAATCA TCCAAATATT GCTGTTAGTC TGAGCAACCT GGCAGAATTA
TATCGCGCAC AAGGAAAATA TGAAGCAGCA GAACCGTTGT TGCTACAAGC AATTGAAATC
TACAAAATTG CCCTCCCTGA AAATCATCCA AATATTGCTG TTAGTCTGAG CAACCTGGCA
GAATTATATC GCGCACAAGG AAAATATGAA GCAGCAGAAC CGTTGTTGCT ACAAGCAATT
GAAATCTACA AAATTGCCCT CCCTGAAAAT CATCCAAATA TTGCTGTTAG TCTGAGCAAC
CTGGCAGAAT TATATCGCGC ACAAGGAAAA TATGAAGCAG CAGAACCGTT GTTGCTACAA
GCAATTGAAA TCTACAAAAT TGCCCTCCCT GAAAATCATC CAAATATTGC TGTTAGTCTG
AGCAACCTGG CAGAATTATA TCGCGCACAA GGAAAATATG AAGCAGCAGA ACCGTTGTTG
CTACAAGCAA TTGAAATCTA CAAAATTGCC CTCCCTGAAA ATCATCCAAA TATTGCTGTT
AGTCTGAGCA ACCTGACAAA TTTATATTAT TCACAAGGAA AATATGAAGC AGCAGAACCG
TTGTTGCTAC AAGCTATTGA AATCTACAAA ATTGCCCTCC CTGAAAATCA TCCAAATATT
GCTGTTAGTC TGAGCAACCT GGCAGAATTA TATCGCGCAC AAGGAAAATA CGAAGCAGCA
GAACCGTTGT TGCTACAAGC AATTGAAATC TACAAAATTG CCCTTCTTGA AAATCATCCA
AATATTGCTG TTAGTCTCAA CAACCTGACA AATTTATATT ATTCACAAGG AAAATATGAA
GCAGCAGAAC CGTTGTTGCT ACAAGCTATT GAAATCTACA AAATTGCCCT TCTTGAAAAT
CATCCAAATA TTGCTGTTAG TCTCAACAAC CTGACAAATT TATATTATTC ACAAGGAAAA
TATGAAGCAG CAGAACCGTT GTTGCTACAA GCTATTGAAA TCTACAAAAT TGCCCTCCCT
GAAAATCATC CAAATATTGC TGTTAGTCTG AGCAACCTGG CAGAATTATA TCGCGCACAA
GGAAAATACG AAGCAGCAGA ACCGTTGTTG CTACAAGCAA TTGAAATCTA CAAAATTGCC
CTTCTTGAAA ATCATCCAAA TATTGCTGTT AGTCTCAACA ACCTGACAAA TTTATATTAG
 
Protein sequence
MCNDLGFLPL GLELVARYLE RKTSLSLAKT GERLGLEHRS LKVFSQDMTG RRGVAAAFEF 
SWQELDEDDQ ELGCFLSLFA CARIPWYLVE GCLPEVDEVE LEDRRDDKLV NLSLLQSLGE
NCYEFHPLIR EFFRNKNKLL EFDLVEEMKG NICGVIADAV RKIPYDEYIT VEKVKEVKVD
IPHITEVAEN LAEYLSDDDL ILPFISLGGF YHCQALYSLA QPWLDRGREI AERRLDKNNA
DIATVYNCLA ALYFAQGKYE AAEPLYLQAF KIVKTVLPEN HPNIAVSLNN LKNLYYSQGK
YEAAEPLLLQ AVEIYKIALP ENHPNIAVSL SNLAELYRAQ GKYEAAEPLL LQAIEIYKIA
LPENPPNIAV SLSNLAELYR AQGKYEAAEP LLLQAIEIYK IALLENHPNI AVSLSNLAEL
YRAQGKYEAA EPLLLQAIEI YKIALPENHP NIAVSLSNLA ELYRAQGKYE AAEPLLLQAI
EIYKIALPEN HPNIAVSLSN LAELYRAQGK YEAAEPLLLQ AIEIYKIALP ENHPNIAVSL
SNLAELYRAQ GKYEAAEPLL LQAIEIYKIA LPENHPNIAV SLSNLTNLYY SQGKYEAAEP
LLLQAIEIYK IALPENHPNI AVSLSNLAEL YRAQGKYEAA EPLLLQAIEI YKIALLENHP
NIAVSLNNLT NLYYSQGKYE AAEPLLLQAI EIYKIALLEN HPNIAVSLNN LTNLYYSQGK
YEAAEPLLLQ AIEIYKIALP ENHPNIAVSL SNLAELYRAQ GKYEAAEPLL LQAIEIYKIA
LLENHPNIAV SLNNLTNLY