Gene Gdia_0786 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGdia_0786 
Symbol 
ID6974183 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGluconacetobacter diazotrophicus PAl 5 
KingdomBacteria 
Replicon accessionNC_011365 
Strand
Start bp896799 
End bp898319 
Gene Length1521 bp 
Protein Length506 aa 
Translation table11 
GC content64% 
IMG OID643390315 
Productexopolysaccharide biosynthesis polyprenyl glycosylphosphotransferase 
Protein accessionYP_002275191 
Protein GI209542962 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG2148] Sugar transferases involved in lipopolysaccharide synthesis 
TIGRFAM ID[TIGR03025] exopolysaccharide biosynthesis polyprenyl glycosylphosphotransferase 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value0.21233 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones48 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TTGCCCGATG GAGATCTTCG CATGAGCGAG TGGTATGGCC AGTTGCAGGA TTCGATCGGT 
CTTTCGACCG ATCCGATCAC ATACAAGACG CCTTCGGCCA GGATCGCCGA ACCCGGGACC
TATCCTTATT ATCATTCCGT TCTGGCCGGG ATGATAACCA GCGTAGATAC CCTGGCGATT
CTGGCCGCCA CGTTCATCGG CGCCTCCGTC GAACCGGCGA TCATCCACCA GGACTGGCGG
GTGGCGCAGC CGCTGGCGGA CCTGATGTCG GCGCTGGGGT TCCTGATGCT CCCCAAGAAC
CGCCGTTTGC TTCATGTCCC TACGGTACAG GATTTTTCGG CACAGGTCCG CTATCTGACG
CCGCCGCTTC TGGCCGGGGC GCTGCTGCAC GTCATCGTCC TGTACATGCT GCATCATCCC
ATCGTGCCGT CGTTCGAACT GGCGCTGACG TGGCTGGCTT TTTCGACCGG CGTGCTGGCG
GTGGTGCGCG GAACCGAAAC GGTGCTGCTG TCCCGGCCGG CAATCGCAAA CCAGCTGGCA
CGCAACGTGG CCGTCGTCGG CAGCGACGAC GCCGCCATCA GGCTGGCCGC CCGCATCGGG
GAGGAGGCCG GATCGACCTA TCGCATGATC GGCGTATTCG ACGACCATGA CACCGCGCTG
GACCCCACCG CCACCACCGG CACGCTGGAC GACCTGATCG AGCGCAGCCG CGAAACCCCG
CTGCATGCGA TCATTCTGGC CATTCCCCCC AGTACCGATC CGATCGACCA TGTAGCCGAA
ATCAACTGGC GGCTGCGCAG CGTTCTGGCG GACGTCTATG TGCTGCCCAA TATCGTGCAT
GGCATCGACG TCCTGCTGCC GATCGAACGG CTGGGGCCGT TCGCGCTGCT GGTCCTTCAG
CGTCGCCCGC TGTCGGACTG GCAGATCGTC AAGAAGACGG TCCTGGACGT CTTCCTGGGC
GCAATCGCCC TGGTGATGCT CGCGCCGCTG ATGGCGGCGG TCGCGATCAC GATCAAGGCG
ACCTCGCCCG GTCCCGTCTT CTTCCGGCAG CCGCGCCTGG GGTTCAACAA CCGGACCTTC
ATGGTCTTCA AGTTCCGCAC GATGTTCACC GACAAGTCGG ACATGATGGC CGCGCGCCAG
ACCGCGCGGG ACGATCCGCG CGTGACCCCC ATCGGCAAGT GGCTGCGCAA GCTGAGCATC
GACGAGCTGC CGCAATTGCT GAACGTTCTT CGCGGCGAGA TGTCGCTGGT CGGTCCCCGC
CCGCACGCGC CGCACACCCG CGCGGCCGGC ATGCTGCTAG ACGATGCGCT GGCGGAATAT
GTCATCCGCC ACCAGGTCAA GCCCGGCATT ACCGGATGGG CGCAGGTCAA CGGCGCGCGC
GGCCAGTTGG TCACGCTGGA CGATCTGCGC CGCCGCGTCG AACTGGACCT GGAATACATG
CAGAGATGGT CGCTGCGCTT CGATCTGAAG ATCCTGATGC TGACCGTGGT GCGAGAGGTC
TTCAGCCGTC ATGCCTTCTG A
 
Protein sequence
MPDGDLRMSE WYGQLQDSIG LSTDPITYKT PSARIAEPGT YPYYHSVLAG MITSVDTLAI 
LAATFIGASV EPAIIHQDWR VAQPLADLMS ALGFLMLPKN RRLLHVPTVQ DFSAQVRYLT
PPLLAGALLH VIVLYMLHHP IVPSFELALT WLAFSTGVLA VVRGTETVLL SRPAIANQLA
RNVAVVGSDD AAIRLAARIG EEAGSTYRMI GVFDDHDTAL DPTATTGTLD DLIERSRETP
LHAIILAIPP STDPIDHVAE INWRLRSVLA DVYVLPNIVH GIDVLLPIER LGPFALLVLQ
RRPLSDWQIV KKTVLDVFLG AIALVMLAPL MAAVAITIKA TSPGPVFFRQ PRLGFNNRTF
MVFKFRTMFT DKSDMMAARQ TARDDPRVTP IGKWLRKLSI DELPQLLNVL RGEMSLVGPR
PHAPHTRAAG MLLDDALAEY VIRHQVKPGI TGWAQVNGAR GQLVTLDDLR RRVELDLEYM
QRWSLRFDLK ILMLTVVREV FSRHAF