Gene Tery_4462 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTery_4462 
Symbol 
ID4246115 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameTrichodesmium erythraeum IMS101 
KingdomBacteria 
Replicon accessionNC_008312 
Strand
Start bp6880579 
End bp6882291 
Gene Length1713 bp 
Protein Length570 aa 
Translation table11 
GC content38% 
IMG OID638109345 
Productbifunctional 3,4-dihydroxy-2-butanone 4-phosphate synthase/GTP cyclohydrolase II/unknown domain fusion protein 
Protein accessionYP_723922 
Protein GI113477861 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG0108] 3,4-dihydroxy-2-butanone 4-phosphate synthase 
TIGRFAM ID[TIGR00505] GTP cyclohydrolase II
[TIGR00506] 3,4-dihydroxy-2-butanone 4-phosphate synthase 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGAACAAAC AACAAAACCA AAAATTTAAA TTTGACCCCA TTGATACTGC CCTAGCAGAC 
ATTAAAGCAG GTAAGTGCGT CGTAGTAGTT GATGACGAAC ATCGAGAAAA CGAAGGAGAT
GTTATTTGTG CAGCTCAATT TGCTACTCCA AACATGATTA ACTTTATGGC AGTAAAAGCC
AGAGGTCTAA TTTGTCTCGC ATTGACAGGC GATCGCCTCG ACCAACTAGA AGTTCCACTG
ATGGTTACCA ACAATACTGA CAGTAACCAA ACCGCTTTCA CCGTTAGTAT TGATGCCTCC
CCAGAATTGG GAGTCTCCAC AGGTATCTCA GCAGAAGACC GAGCCCGAAC TATCCAAGCT
GTTATTAATT CTAACACCAA ACCAGAAGAT TTACGTCGCC CCGGTCATGT ATTCCCCATT
CGAGCCAAAG AAGGAGGAGT CTTAAAAAGA GCAGGTCATA CAGAAGCTTC CATTGACTTA
GCTAGACTAT CAGGTTTATA TCCAGCCGGA GTTATTTGTG AAATTCAAAA CCCAGATGGT
TCCATGGCAA GATTACCTCA ACTAGTAGAA TATGCCCAAA CTCATAACCT CAAACTCATT
AGTATTGCCG ACATCATCAG TTATCGCATC AAACACGATC GCTTTGTCTT CCGAGAAGCT
GTTGCTAAAT TACCCTCTCA ATTCGGTAAC TTCCAAATTT ATGCTTACCG CGATACTCAA
AATAATTTAG AACATATAGC TATTGTTAAA GGCAACCCAG CAGAATTTTC CCAAAGAGAT
ATAATGGTGC GGGTCCACTC TGAGTGCTTA ACTGGAGATG CTTTTGGTTC TCTGCGTTGT
GACTGTAGAA TGCAATTACA AGCAGCAATG AAAATGATTG AACATGCAGG TGCAGGAGTC
GTTGTTTATT TACGACAGGA GGGCAGAGGG ATAGGGTTAG TTAATAAACT TAAAGCCTAT
TCATTGCAAG ACATAGGATT AGATACTGTG GAAGCTAATG AAAGACTAGG CTTTCCAGCA
GATTTACGCA ACTATGGTGT AGGAGCCCAA ATATTACATG ATCTAGGGGT CAATAAAATG
CGTTTAATTA CAAATAACCC CCGTAAAATA GCAGGTTTAC ATGGTTATGG TATTGAAATA
GTAGATCGAG TACCTTTGTT AATTGAAACG ACAGATTATA ATTCTGCTTA TCTAGCCACT
AAGGCTCAAA AATTAGGTCA TATTTTGTTA CGGAGTTATT TAGTAACAAT TGCTATTAAT
TGGAATAATC AAAAAATCGA AGAAGAAACT TTCGATAATT CTGATCATAT AAATATGAAG
TCTTTAGCTC AACAGCGGTA TCAATATTTA GAAAAACTAC GCAGTTTGAT CAAAGAATAT
GATTTTCTGT TGCAGGAAGA AACAAGGCCA GTAGCAACTG CAGTCTTTGC CCAAGCTCCT
TTAATCGTTA ATTTTGGTTT AGAACAAGCA ACATTAACTA CATCTAAATG GTATCAAGAA
TCAAATAATC CTTATTTGGT AGCGATCGCT AAAGTCTTAA CAGAAATAGC CCAATGGCAG
AATATGTTAA AATTAGAATT TATCATTGCT TCTGGTTTAG ATCCTATGAT CGCCTTACAG
ATAAAACTAG AACGTCAGAC CTTAGAAATT ACTGAATTAT CCACAGCTAT GGAACATTTA
GAAACACAAA AAATTTACAG TTTAAAAATT TAG
 
Protein sequence
MNKQQNQKFK FDPIDTALAD IKAGKCVVVV DDEHRENEGD VICAAQFATP NMINFMAVKA 
RGLICLALTG DRLDQLEVPL MVTNNTDSNQ TAFTVSIDAS PELGVSTGIS AEDRARTIQA
VINSNTKPED LRRPGHVFPI RAKEGGVLKR AGHTEASIDL ARLSGLYPAG VICEIQNPDG
SMARLPQLVE YAQTHNLKLI SIADIISYRI KHDRFVFREA VAKLPSQFGN FQIYAYRDTQ
NNLEHIAIVK GNPAEFSQRD IMVRVHSECL TGDAFGSLRC DCRMQLQAAM KMIEHAGAGV
VVYLRQEGRG IGLVNKLKAY SLQDIGLDTV EANERLGFPA DLRNYGVGAQ ILHDLGVNKM
RLITNNPRKI AGLHGYGIEI VDRVPLLIET TDYNSAYLAT KAQKLGHILL RSYLVTIAIN
WNNQKIEEET FDNSDHINMK SLAQQRYQYL EKLRSLIKEY DFLLQEETRP VATAVFAQAP
LIVNFGLEQA TLTTSKWYQE SNNPYLVAIA KVLTEIAQWQ NMLKLEFIIA SGLDPMIALQ
IKLERQTLEI TELSTAMEHL ETQKIYSLKI