Gene TM1040_2129 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTM1040_2129 
Symbol 
ID4076443 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRuegeria sp. TM1040 
KingdomBacteria 
Replicon accessionNC_008044 
Strand
Start bp2234044 
End bp2235168 
Gene Length1125 bp 
Protein Length374 aa 
Translation table11 
GC content60% 
IMG OID638007449 
ProductGTP cyclohydrolase II / 3,4-dihydroxy-2-butanone 4-phosphate synthase 
Protein accessionYP_614123 
Protein GI99081969 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG0108] 3,4-dihydroxy-2-butanone 4-phosphate synthase
[COG0807] GTP cyclohydrolase II 
TIGRFAM ID[TIGR00506] 3,4-dihydroxy-2-butanone 4-phosphate synthase 


Plasmid Coverage information

Num covering plasmid clones31 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCTATG AGACGCCCGG TCCAGTCGAG TCTGAGTTGC GCGACGCCAT CAGTCCGATT 
GAAGAGATCA TCGACGCCGC GCGCGCGGGC AAGATGTATA TTCTTGTCGA TCATGAGGAC
CGCGAAAACG AGGGTGATCT GATCATCCCG GCGGAGTTTG CCGACGCGGA TGCGATCAAC
TTTATGGCCA CCTATGGGCG TGGACTCATC TGCCTCCCAA TGACGGCCGA GCGCATTGAT
CGTTTGGGCT TGCCGATGAT GGCGGTGAAC AATTCCTCGC GTCACGAGAC GGCCTTTACC
GTGTCGATCG AGGCGCGCGA AGGGGTCGAT ACCGGGATTT CCGCCGCGGA TCGTGCGCTG
ACCGTGGCCA CGGCGATCAA TGAGCAAAAT ACCATGGCGG CGATTGCAAC GCCGGGCCAT
GTTTTCCCGC TGCGCGCAAA ACGCGGCGGG GTTCTGGTCC GGGCCGGGCA CACCGAGGCC
TCTGTCGATA TTTCGCGCAT CGCGGGCTGT CACCCATCGG CCGTGATCTG CGAGATCATG
AAAGACGATG GCACCATGGC GCGACTGCCG GATCTGGTGG AATTTGCAAA AACCCACGAT
ATGAAAATCG GCACCATCTC GGATCTCATC GCCTACCGCG CCAAGAACGA CAACCTCGTG
GATGAGACCG CCCGCTCTAC CGTTACCTCG GAATATGGGG GCGACTGGGA GATGCGGATC
TTCACCGATC AGACCCATGA TGTGGAGCAT GTTGTCCTGA TCAAGGGCGA CATCACCACG
CCAGAGCCGG TGCTGACTCG CACTCATGCG CTGCATGAGG CGTCCGACTT GCTGGGGCTT
GGTCCCAAAC CCGCTGGGGA ACTGCCGCGC GCGATGGAAT TGATCGCCGA CGAGGGGCGC
GGGATCGTCT GCCTGTTCCG CCAGCCGCGC AACGCGCTCT ATGCCTCCGA CGAGGAAGGG
GTGCGCACCA TCAAACAGAC CGGTCTCGGG GCGCAAATTC TGAATAAACT CGGCGTTGAG
GAACTGATCC TGCTCACCGA CTCGCCGCAA ACCAAATATG TTGGGCTGGA TGCCTATGGG
CTGTCGATTG TCGGCACCCG TCCCATCCTG TCCCGAGACA GCTAA
 
Protein sequence
MSYETPGPVE SELRDAISPI EEIIDAARAG KMYILVDHED RENEGDLIIP AEFADADAIN 
FMATYGRGLI CLPMTAERID RLGLPMMAVN NSSRHETAFT VSIEAREGVD TGISAADRAL
TVATAINEQN TMAAIATPGH VFPLRAKRGG VLVRAGHTEA SVDISRIAGC HPSAVICEIM
KDDGTMARLP DLVEFAKTHD MKIGTISDLI AYRAKNDNLV DETARSTVTS EYGGDWEMRI
FTDQTHDVEH VVLIKGDITT PEPVLTRTHA LHEASDLLGL GPKPAGELPR AMELIADEGR
GIVCLFRQPR NALYASDEEG VRTIKQTGLG AQILNKLGVE ELILLTDSPQ TKYVGLDAYG
LSIVGTRPIL SRDS