Gene Gdia_1129 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGdia_1129 
Symbol 
ID6974533 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGluconacetobacter diazotrophicus PAl 5 
KingdomBacteria 
Replicon accessionNC_011365 
Strand
Start bp1265033 
End bp1267198 
Gene Length2166 bp 
Protein Length721 aa 
Translation table11 
GC content70% 
IMG OID643390658 
Productglucosyltransferase MdoH 
Protein accessionYP_002275527 
Protein GI209543298 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG2943] Membrane glycosyltransferase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones42 
Fosmid unclonability p-value0.546183 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGACACCC TAGACCCAGG CGCGCGGCAG AAGCGGGCGG GACAGGCGGA CGCGTCCGGC 
GCGTTCCGCG CCCTGCCGGA CGAATCCCCG ATGCGGATGG ACGTGCAGTC CCTGTCGGCA
TGGCCCGATA CCCATGGCCG TCCGCGTACG CCGGTCACCG AACCGCCGAT GATCGCCCTG
CGGCGGCTGG CCGTGATCGG CTCGGCGTGC CTGCTGACGG GTTATGGCGC GTATGAAATG
AACCGCGTGT TGAACTCGAT GGGCCTGTCG GTGCTGGGCG CGGTGCTGCT GGTGCTGTTC
GTCCTGCTGT TCCAGTGGAT CGCCCTGGCC TTCACGTCGG CGGTGGGCGG GTTCATCTCG
CTGCTGCGGC ATGGCGGGCT GGGGCTGGGG ATCGCGCGGG ACGGGCCGTT GCCGGCACTG
TCCACGCGCA CCGCCCTGTT GCTGCCGACC TATAACGAAA GCCCGGGCCG GGTGATGGCC
GGGCTGCGCG CGATGTATGA CAGCCTGGCC GCCACGGGCC GGCTGGACGA TTTCGATTTC
TTCATCCTGT CCGACACCAC CAACCCCGAC ATCTGGGTCG AGGAGGAAGC CGCCTTCCTG
GCGCTGCGCG CCGCGACGGG CGGGCAGGGG CGGATCTTCT ATCGCCGCCG CCCGGTCAAT
ACCGAGCGCA AGGCCGGCAA CATCGCCGAA TGGGTGCGCC GCTTCGGCGG GGCCTATCCG
CAGATGGTCA CGCTGGATGC CGACAGCCTG ATGGCGGGCG AGACGGTGGT GCGCATCGTC
GCCGCGATGG AGCGCCATCC CGGCGTGGCC CTGATCCAGA CCCTGCCGGA AATCGTCAAC
GGCACCACGC TGTTCGCGCG CATGCAGCAG TTCGCGGGGC GGGTCTACGG CCCGTTGATC
GCGCATGGCA TCGCCTGGTG GCATGGGGCG GAGGGCAATT ACTGGGGGCA CAACGCCGTC
ATCCGCACCC GTGCCTTCGC CGAGCAGGCG GGCCTGCCGC ATCTGCCGGG ACGCAAGCCG
TTCGGGGGCC ATATCCTCAG CCACGATTTC GTCGAGGCCG CGCTGATGCG GCGTGGCGGC
TGGGCGATCC ACATGGTGCC GGGCCTGGCG GGGTCGTACG AGGAAAGTCC CCCGTCGCTG
ACCGACATCG CCATTCGCGA CCGGCGCTGG TGCCAGGGCA ACCTGCAGCA CGCCAAGGTG
GTGCCCACGC GCGGCCTGCA CTGGATCAGC CGGCTGCACA TGATGATGGG AATCGGCTCC
TACATCACGT CGCCGATGTG GCTGGTGTTC CTGCTGGCGG GCATCCTGAT TTCGCTGCAG
ACCCGGTTCG AGAAGCCGGA ATATTTCGGC GATACCAAGT CGCTCTATCC CAACTGGCCG
CATGTCGATC CCGAACAGGC GAAATACGTG TTCATCGGCA CGATGGCGAT CCTGCTGGCG
CCCAAGCTGA TGGCCTATGT CGCCCTGCTG TTCGACCGCG CCGCCTGCCG GGGCTGCGGC
GGGGCGCTGC GGGCGGCGCT GTCGGTGCTG GTGGAAACGC TGGTCGGCGG ACTGGTCGCG
CCGGTGGCGA TGCTGATCCA GACCTCGGGC GTGATCTCGA TCCTGCTGGG GCAGGATTCG
GGCTGGAACG CGCAACGGCG CGATGACGGG GGCGTGCCGT TCGGCGACAT CGTGCGCGCC
TACTGGCGGT ACATGCTGTT CGGCCTGGTC CTGGGCGGCA GCGCGTGGAG CGTCTCGATC
CCGCTGTTCC TGTGGATGAC CCCGGTGCTG CTGGGGCTGG TCTTCGCCAT TCCGCTGGCG
GCCGTCACCG GTAGCCGGGG CGCGGGCCAG GCCTTGCGGC GGGCCGGCCT GCTGCTGATC
CCCGAGGAAA CGGCGCCGCC GGATATCCTG CTGCGCGCCG CCGCCGCCCG TGCCGCCCTG
CCGCCGCCGC GCGCGGCCGA CGCCATCGCG CTGCTTCTGG ACGATCCCGC GCTGATGGCG
GCGCACCGCG CGATGCTGCC CCCGCCGCGC CGGGCGGGCG ACCCCATCAA TGCCGACCTG
CTGGTGGGCC TGCTGAAACT GGAGGAAAGC CCGGACCGGA CTGTCGCGGT CGCCGCGCTG
AACCGTGCGG AGAAAGCGGC GGTTCTGGCC ACCGCCGATG GGCTGGACCG CCTGACCGTC
CTCTAG
 
Protein sequence
MDTLDPGARQ KRAGQADASG AFRALPDESP MRMDVQSLSA WPDTHGRPRT PVTEPPMIAL 
RRLAVIGSAC LLTGYGAYEM NRVLNSMGLS VLGAVLLVLF VLLFQWIALA FTSAVGGFIS
LLRHGGLGLG IARDGPLPAL STRTALLLPT YNESPGRVMA GLRAMYDSLA ATGRLDDFDF
FILSDTTNPD IWVEEEAAFL ALRAATGGQG RIFYRRRPVN TERKAGNIAE WVRRFGGAYP
QMVTLDADSL MAGETVVRIV AAMERHPGVA LIQTLPEIVN GTTLFARMQQ FAGRVYGPLI
AHGIAWWHGA EGNYWGHNAV IRTRAFAEQA GLPHLPGRKP FGGHILSHDF VEAALMRRGG
WAIHMVPGLA GSYEESPPSL TDIAIRDRRW CQGNLQHAKV VPTRGLHWIS RLHMMMGIGS
YITSPMWLVF LLAGILISLQ TRFEKPEYFG DTKSLYPNWP HVDPEQAKYV FIGTMAILLA
PKLMAYVALL FDRAACRGCG GALRAALSVL VETLVGGLVA PVAMLIQTSG VISILLGQDS
GWNAQRRDDG GVPFGDIVRA YWRYMLFGLV LGGSAWSVSI PLFLWMTPVL LGLVFAIPLA
AVTGSRGAGQ ALRRAGLLLI PEETAPPDIL LRAAAARAAL PPPRAADAIA LLLDDPALMA
AHRAMLPPPR RAGDPINADL LVGLLKLEES PDRTVAVAAL NRAEKAAVLA TADGLDRLTV
L