Gene Gdia_3537 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGdia_3537 
Symbol 
ID6976989 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGluconacetobacter diazotrophicus PAl 5 
KingdomBacteria 
Replicon accessionNC_011365 
Strand
Start bp3869830 
End bp3870804 
Gene Length975 bp 
Protein Length324 aa 
Translation table11 
GC content74% 
IMG OID643393056 
Productthiamine-monophosphate kinase 
Protein accessionYP_002277875 
Protein GI209545646 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG0611] Thiamine monophosphate kinase 
TIGRFAM ID[TIGR01379] thiamine-monophosphate kinase 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value0.719113 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones51 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCCGCCG CTCTTCCCCC CGCCGGTCCG CTGCCGCCCG AATTCGGCTT CATCCGCCGC 
CATTTCCTTG CCCTGGCGGG CGAGGGCGCG CTGGGCCTGA CCGACGACGC GGCGCTGCTG
CGCGCGCCGG CGGGGCGGGA ACTGGTCGTC GCGGTCGATA CGATGGTCGA GGGCGTACAT
TTCCTGCCCG ACGACCCGGC CGACACGGTC GGCCGCAAGC TGCTGCGCTG CAACCTGTCC
GACCTGGCGG CGATGGACGC GATGCCGCTG GGCTATCTGC TGGCGGTGAC CACGCCGCCG
GCACGGGACG AGGCCTGGTT CGCCGGTTTC GCCCGGGGCC TGGCGGACGA TCAGGGGCGT
TATGGGCTCA GCCTGCTGGG GGGCGATACC ACTTCCACGC CGGGGCCATT GGTGCTGTCG
CTGACCATCC TGGGGCATGG GGCACCCGGC CGGGCCCTGC GGCGCAACGG CGCGCGCGAC
GGCGACGGGA TATGGGTGAC CGGGACGATC GGCGACGGGG CGCTGGGCCT GCGCGCCCTG
CGCGGGGAAG TGGCCGATCC CGACGGGTTC CTGGCCGGCC GCTATCGGCT GCCGCGGCCG
CGCCTGGGGC TGGGGTTGGG CGGAATCGCG TCGGCCGCCA TGGATGTCTC GGACGGGCTG
GTGCAGGATC TGGGCCACTT GGCCCGTGAA AGCGGCGTCG GCGCCCGGAT CGACGCCGGC
CGCGTTCCCC TGTCGCCGGC CGCCAGGCAG GCGGGCCCCC GCTGGCTGCC GACCTGCCTG
ACCGGCGGGG ATGATTACGA ATTGCTGCTG GCCGTGCCCC CGGCGCACGA GGGCGCCCTG
CGGGAAGCCG CCCGGACGCA TGGGGTGGCG GTCACGCGGA TCGGCGCGTT CGACGCCACG
CTATCCGGCG TCCAGGTACT GGACGGGGTA GGGGGAATCC TGGCGCTGGA GCGCACCGGA
TGGAGTCACC TGTAG
 
Protein sequence
MSAALPPAGP LPPEFGFIRR HFLALAGEGA LGLTDDAALL RAPAGRELVV AVDTMVEGVH 
FLPDDPADTV GRKLLRCNLS DLAAMDAMPL GYLLAVTTPP ARDEAWFAGF ARGLADDQGR
YGLSLLGGDT TSTPGPLVLS LTILGHGAPG RALRRNGARD GDGIWVTGTI GDGALGLRAL
RGEVADPDGF LAGRYRLPRP RLGLGLGGIA SAAMDVSDGL VQDLGHLARE SGVGARIDAG
RVPLSPAARQ AGPRWLPTCL TGGDDYELLL AVPPAHEGAL REAARTHGVA VTRIGAFDAT
LSGVQVLDGV GGILALERTG WSHL