Gene Gdia_0746 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGdia_0746 
Symbol 
ID6974143 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGluconacetobacter diazotrophicus PAl 5 
KingdomBacteria 
Replicon accessionNC_011365 
Strand
Start bp849698 
End bp850756 
Gene Length1059 bp 
Protein Length352 aa 
Translation table11 
GC content68% 
IMG OID643390275 
Productglycosyl transferase family 2 
Protein accessionYP_002275151 
Protein GI209542922 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0463] Glycosyltransferases involved in cell wall biogenesis 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value0.0487138 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones61 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGAGCGA GCGTCGTCAT TCGTTCCCGC AACGAAGCGG ACCGGCTGCG CCTGACCCTG 
GCATCGCTTG CCAGCCAGAC CGAGGCCGCG GAAGTCGTCG TCGTCAATGA CGGGTCCACC
GACCATACGG CGGAGGTCAT CGAAGACGCC AGGGCCGAAC TGGATATCGT GTCCGTTCAC
CATGCCCGCC CGGCCGGGCG ATCGGCGGCG GCCAATACCG GCGGCACCCA TGCCACGGGG
GACATCCTGA TCTTCCTGGA TGGCGACACC CTTGCCGGGC CCGACCTGGT CGCGGACCAT
CTGGCGATCC ACCGCCAACG GCCCGGCGTG GTGGTCCGTG GCGAGAACTT CCATCTGCGC
TGCACCCGCC CGTTCCTGGA CCCCGAACGC GGCACGCCCC GGCCCGGCGA GGAAGAACGG
GTCGCGCGCA TGTCGGAGGC CGAACGGGCG CGGGCGATCG TCACCCGCGC GCAGGTCACG
CAGCGGTTCG ATGAAATTGA CCATCGCGCC CAGGCCGGCG TCTATCCCGG TTTCGGCCCG
CGCAAGCTGT ACGAACTGGA AATGGAGGCC CTGCGGGCGG AAGGGGATTG CGGCGTCCTG
TGGGCTGCCG CCGCCGGTGC CAACCAGTCG GTGCCGCGCG ATGCCTTCGC CCGTGCGGGG
GGATTTCATC CCGACATATC GATCAACGAA CATCGCGAAC TGGCACTGCG CCTGTGCCAG
GCGGGGCTGA AGATGGTGGC CGGCGCGGCA CGCAGCTATC ACTTGATCCA TCGTAGCGGC
TGGCGGGACC CGCTGGAGGA CAAGGACTGG GAGGACATCT TCTACCAGGC CCATCCGCGC
GCCGACGTCG CCCTGCTGCC GCTGCTGTGG CAGAGCCTGA GCGACACCGC GATCATTCCG
GAAGATTTCC GCATTCTGTC GCTGCCGCAC CTGGCCGAGA TCGCCGGGTC CTACGAGGGC
CTGCCCAGCC GCGAGGCCGT GCGCGAGGCC TACATGGCGG CACGGGAAGC GACGCTGTCG
GAGTCCGACA TTCGTTCCAA CATTCCCTGG GGAACATGA
 
Protein sequence
MRASVVIRSR NEADRLRLTL ASLASQTEAA EVVVVNDGST DHTAEVIEDA RAELDIVSVH 
HARPAGRSAA ANTGGTHATG DILIFLDGDT LAGPDLVADH LAIHRQRPGV VVRGENFHLR
CTRPFLDPER GTPRPGEEER VARMSEAERA RAIVTRAQVT QRFDEIDHRA QAGVYPGFGP
RKLYELEMEA LRAEGDCGVL WAAAAGANQS VPRDAFARAG GFHPDISINE HRELALRLCQ
AGLKMVAGAA RSYHLIHRSG WRDPLEDKDW EDIFYQAHPR ADVALLPLLW QSLSDTAIIP
EDFRILSLPH LAEIAGSYEG LPSREAVREA YMAAREATLS ESDIRSNIPW GT