Gene Tery_2116 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTery_2116 
Symbol 
ID4243952 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameTrichodesmium erythraeum IMS101 
KingdomBacteria 
Replicon accessionNC_008312 
Strand
Start bp3305146 
End bp3306342 
Gene Length1197 bp 
Protein Length398 aa 
Translation table11 
GC content39% 
IMG OID638107223 
ProductFO synthase subunit 2 
Protein accessionYP_721824 
Protein GI113475763 
COG category[H] Coenzyme transport and metabolism
[R] General function prediction only 
COG ID[COG1060] Thiamine biosynthesis enzyme ThiH and related uncharacterized enzymes 
TIGRFAM ID[TIGR00423] radical SAM domain protein, CofH subfamily
[TIGR03551] 7,8-didemethyl-8-hydroxy-5-deazariboflavin synthase, CofH subunit 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACTACTT TCAATTTGAC AAACCAAAAC CTAGAAACAA ACCTGGAGAC AATTTTAAAC 
CGTGCTCTCC AAGGTTATGA TCTATCTCCA GCAGAGACAC TTTTATTGTT ATCCCCAACT
ACGAAACCTA ATTTGGGAAA ACTTGCCTTA ACTGAACTTC CAGCAGAAAT AACCGCTATC
CAGAAAACAG CAGACCAACT CCGACAACAA CAAGTAGGTG ATACGGTAAC TTACGTGATC
AACCGTAATA TAAATTTCAC CAATATCTGT GAACAACATT GTAGTTTTTG TGCATTTCGC
CGAGACGACG GGAAAACAGG TGCATTTTGG TTAGACATCA ATCAAATTTT AGCAAAAGCT
AACGATGCGG TGCAACGTGG AGCCACAGAA ATTTGTATGC AGGGAGGATT AAATCTTCAA
GCTAAAGTTG CAGGAAAATC TTTACCTTAT TATTTGCAAC TGGTAAGAGA GATTAAAAAT
GAGTTTTCCC ATTTGCACTT ACATGCTTTT TCTCCCCAGG AAGTTCAGTT TATTGCTAGG
GAGGATGGGG TGAGTTATGA ATATGTAATT GCTGCTTTAC GGGATGCAGG GGTACATTCA
ATGCCCGGTA CTGCTGCAGA AGTTTTGGAT GATGCAGTCA GACGAATTAT TTGCCCGGAA
AAAATTGATA CAGGAACTTG GTTAGAAATA GTGGGTACAG CCCACCGGTT GGGAATGCCA
ACCACAAGTA CTATGTTATG CGGTCATATT GAAACCCCTA AACAGCAGAT TTTACATTTA
GAGAGATTGC GATCGCTACA ACAAACTGCT ATTGAAAAAG ATTATCCAGC AAGAATAACA
GAATTTATTT TATTACCATT TGTGGGACAA GAAGCACCTG CACCTTTACG TCGGAGGGTA
GGGCACGACC AACCTATTTT GTTAGATGTT TTATTGTTAA CAGCGGTGTC GAGAATATTC
TTAGGAAATT GGATTATTAA TCATCAACCC AGTTGGGTGA AAATTGGTTT AGATGGAGCA
AAAGAGGCAT TAAAGTGGGG TTGTAATGAT ATTGGTGGGA CTTTAATGGA AGAACATATT
ACTACAATGG CTGGTGCTAT TGGAGGTACT TTTATGGAAG TCAAAAATTT ACAGGAAGCT
ATTACAAGTT TGGGGAGAAA CTATCAACAA AGAGATACTC TTTATAAATA TTTGTAG
 
Protein sequence
MTTFNLTNQN LETNLETILN RALQGYDLSP AETLLLLSPT TKPNLGKLAL TELPAEITAI 
QKTADQLRQQ QVGDTVTYVI NRNINFTNIC EQHCSFCAFR RDDGKTGAFW LDINQILAKA
NDAVQRGATE ICMQGGLNLQ AKVAGKSLPY YLQLVREIKN EFSHLHLHAF SPQEVQFIAR
EDGVSYEYVI AALRDAGVHS MPGTAAEVLD DAVRRIICPE KIDTGTWLEI VGTAHRLGMP
TTSTMLCGHI ETPKQQILHL ERLRSLQQTA IEKDYPARIT EFILLPFVGQ EAPAPLRRRV
GHDQPILLDV LLLTAVSRIF LGNWIINHQP SWVKIGLDGA KEALKWGCND IGGTLMEEHI
TTMAGAIGGT FMEVKNLQEA ITSLGRNYQQ RDTLYKYL