Gene Gdia_0509 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGdia_0509 
Symbol 
ID6973905 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGluconacetobacter diazotrophicus PAl 5 
KingdomBacteria 
Replicon accessionNC_011365 
Strand
Start bp559186 
End bp562413 
Gene Length3228 bp 
Protein Length1075 aa 
Translation table11 
GC content66% 
IMG OID643390041 
Productglycosyl transferase family 2 
Protein accessionYP_002274918 
Protein GI209542689 
COG category[R] General function prediction only 
COG ID[COG1216] Predicted glycosyltransferases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.253023 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones31 
Fosmid unclonability p-value0.0268717 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCCACCC CGCCCGGCGA CACGGTCGTC GGACTGCTGC TGGATATCGC CTGCCTTCTG 
CTGAGCAACG GTGCGGACGC CCCGGCGATC GAAAACACCG TCGGGGCCTT CGCATCCTGC
CTGGGCCATG ACGCGGGATT GACCATTACG TACCGGGTGG ACGCCTTCCT GCTGGACATC
GGGACCAGTT CGGGCGAGCA GGCGCGGCAC GCCGTGCCGA TCGCGACCAT GCGTGTGGCG
CCGTCGGTGA TCGAGCCGCT GCTGGCGCTG GCCGGCCACG CGTCCTGCAC GCAGGACGAC
ATCCATGCCG CGCGGGCCCG CGCGCTGCGG CGGGACGCGG CGGCCGATCC CGGGCACCAG
CCGGCCTGGT ACCGGTTCGA CCCGGACTGG TATCGGGCGG CCTATCCCTT CGTCGCCGAG
CAGATGGTCT TCCTGGGCTG CGACGACGTC GTGGCCTATT TCCGGGATTT CGGCATCGGG
CTGGGCCATT CCCCCAACCC GTTCTTCGAC GAGGCATGGT ATCGCACGGC CCATCCCGAC
ATTGCGCGAC TGATCGCGGA CGGCGTGGTC CAGAACGGAT TCGTCCATTA CCTGACCACC
GGTTTTGCCG ATCGTTCTCC GCACTGGCTG TTCGATTCCG GCCTGTATCG CAGGGCGCAC
CCCGACCTGT CGCCCGAAGG CCTGGCGATA CGGGGGTACC GCAACCTTTA CGACCATTAC
CTGGAAGTCG GCGACCCGTC CGGCCTGCGG GGGCAATGGC TGTTCGACCC GTCCGGCAGG
TTCCGGCAGG TGGCGGCGTC CCTGCCCCAC GTGGCACCGA CCCTGTCGCT GTCGCCGTGT
TTCGACGCGA TCTGGTATCT GAAGACCTAT CCTGAAGTCG CCGCGCTGAT CGCCGCCGGT
GCCTATTCCT GCGCGCTGCA TCACTACCTG GCCAATCCGA CCCCGACCCG CTTCTGTGCG
ACGCCCTGGT TTTCCGAAGA TTACTACCGT GTTTTTTACG AAGACGTGGA CAGCGCGCTG
CGGAACGGCA CCTTCCGCAC GGGATACGAG CATTTTCTGG AATTCGGCCT GCGGGAATGG
CGGCGCCCCC ACCCGGATGT CGACCTGCGC GCCTTTCGCG ACCGGCTGGC GCGGACGAAC
CCGGAAATGC GCCTGGAACC GGACCCGTTC CGTTTCTGGC TGGCCGTACC GGCCGACCTG
CGCATCGCGC CGGCCCCGCC ACGCATCGAC GAGGCGATAT CCCGCGACGC CTTCCGGCAG
GCGGCCGAGG ATATGCTGCT GCTGCACGCG CACGAACCGA TCGATTTCAC GCCGGACGGG
CCGGCCGACC TGGCCGTGGT GATGGTCGCG CACAACCGGT TCAGCCTGAC GATGCAGGCC
CTGGCCGCGC TGCGCACCGG AGGGCCCGGA AACATGCAGG TCATCATTGC CGATTCCGGA
TCACATGACG AGACGCGGCA TCTGGAGCGG TACGTCGCGG GTGCCCGAAT CATCCGCTTC
GCGCGCAATG TCGGCTATAT CGAGGCCTGC AACGCGGCGC TGCGGATGGT GACGGCACCC
TGCACATTGT ACCTGAACAA CGACCTGATC GTGGAGTACG GAGCCATCGC GCGGGCGCTG
CGCCGACTGC ATGCCGCGCC CGACATCGGG GCGGTCGGTG CGAAGATTGT CCGCAGCAAC
GGCATTCTTC AGGAAGCGGG GTCGATCCTG TGGCGGGACG GCACCACCAG CGGCTATCTG
CGCGACCGCG ATCCCGCGAC GCCCGAGGCG AACTTCGTGC GCGAGGCCGA TTACTGTTCC
GGCGCGTTCC TGCTGGCGCG GACCGGGTTG CTGCATCAGC TTGACGGTTT CGATCCGGCC
TTTTCCCCGG CCTATTACGA GGAAGTCGAT CTGTGCGTGC GGATGCGCAA GGCCGGGTAC
CGGGTCGTCT ATGATCCGTC GGTCATGGTG CGGCATCTGG AATACGGATC GTCCGACACC
GATCATTCCC GTGTCCTGAT GCATCGCAAT CATCGCGTCT TCAGCGACAG GCATCGCGAT
ATCCTGCGCT ATTGCCAGCC CCGCGCCGCA GGGAACGCCA TCTTCGCGCG CTCGCCCCGC
GGCGCCCGGC GACGCATCCT GTATATCGAG GACCGGCCGC CGATCCGCCG CCATGGCGCG
GGCTATGCAC GGTCCAACGA CATCGTGCGC CTGATGGTGG AAATGGATTA TCAGGTCACC
ATCTTTCCGA TCCTGATGAC TGACACGCCC CTGCTGGACA TCTACGGCGC CCTGCCCGAC
AGCGTGGAAA TCCTGCATGA CCGCCATATC GGCATGCTGG CCGACCTGAT CCGCGAACGG
CCGGGATATT ACGATCTGGT CTGGGTGGGC CGGACCCATA ATCTGGCGCA GATCCTGCCG
ATCCTGGCCG CGTCACCGGC CGCCCTGCCG GTCGAGGGCT TCATCCTGGA CACCGAGTGC
ATCGCCGCCC CCCGGACGGC GGAGCGCGCG CGGGTGCTGG GCCTGGCATC CCCGCCGAAA
CTGGATCAGG CGGTGCGCGA CGAACTGGCC TGCGCCTATT TCTGCCAGCA GATCGTGGCA
GTCAGCGACC ATGACGCAGC ACTGGTGCGA TCGGCCGGTT ACGACAATGT CGCCGTGCTG
GGGCACATGC TGGAACCGGC CCCCACGCCC TCGGGCTGGG CGGAGCGGAG CGGCATCCTG
TTTTTGGGCG CCCTGCACGA CATGGAGTCC CCCAATTACG ACAGCATCGC GTGGTTCATC
ACGCAGGTCA TGCCGCGCAT GCCGGCGGAG ATGCATCTGA CGATCGCGGG CCATGTCGAT
CCGTCGGTTC ATTTCAGCGC CCTGGCCGGC CATGGGCGCG TCACCTTCCT GGGCGCGGTC
GATGATCCGC GGCCGCTCTA TGACCGGCAC CGTGTGTTCG TCGCCCCCAC CCGCTTCGCC
GGCGGCCTGC CCTACAAGGT TCACGAGGCA GCCGCCCATG GCCTGCCGGT GGTGGCCAGC
ACGGTACTGT GCCGGCAGGT CGGCTGGGAT GTGGGCACGG ATATCCTGTG CGGCGGATCG
GACGACCCGC AATGCTTTGC CGATGCGATC ATGGCGCTGT ACGAGGATGC CGGATTGTGG
CGTACGGTGC GCGACGGTGC GATCGGACGC ATTGCGCGGG AGAATGATCC GCATGCGTAT
CGCCGCCGGT TGGCGGACAT TTTGGAAAAA CTGTTATCCA TGGGATAA
 
Protein sequence
MPTPPGDTVV GLLLDIACLL LSNGADAPAI ENTVGAFASC LGHDAGLTIT YRVDAFLLDI 
GTSSGEQARH AVPIATMRVA PSVIEPLLAL AGHASCTQDD IHAARARALR RDAAADPGHQ
PAWYRFDPDW YRAAYPFVAE QMVFLGCDDV VAYFRDFGIG LGHSPNPFFD EAWYRTAHPD
IARLIADGVV QNGFVHYLTT GFADRSPHWL FDSGLYRRAH PDLSPEGLAI RGYRNLYDHY
LEVGDPSGLR GQWLFDPSGR FRQVAASLPH VAPTLSLSPC FDAIWYLKTY PEVAALIAAG
AYSCALHHYL ANPTPTRFCA TPWFSEDYYR VFYEDVDSAL RNGTFRTGYE HFLEFGLREW
RRPHPDVDLR AFRDRLARTN PEMRLEPDPF RFWLAVPADL RIAPAPPRID EAISRDAFRQ
AAEDMLLLHA HEPIDFTPDG PADLAVVMVA HNRFSLTMQA LAALRTGGPG NMQVIIADSG
SHDETRHLER YVAGARIIRF ARNVGYIEAC NAALRMVTAP CTLYLNNDLI VEYGAIARAL
RRLHAAPDIG AVGAKIVRSN GILQEAGSIL WRDGTTSGYL RDRDPATPEA NFVREADYCS
GAFLLARTGL LHQLDGFDPA FSPAYYEEVD LCVRMRKAGY RVVYDPSVMV RHLEYGSSDT
DHSRVLMHRN HRVFSDRHRD ILRYCQPRAA GNAIFARSPR GARRRILYIE DRPPIRRHGA
GYARSNDIVR LMVEMDYQVT IFPILMTDTP LLDIYGALPD SVEILHDRHI GMLADLIRER
PGYYDLVWVG RTHNLAQILP ILAASPAALP VEGFILDTEC IAAPRTAERA RVLGLASPPK
LDQAVRDELA CAYFCQQIVA VSDHDAALVR SAGYDNVAVL GHMLEPAPTP SGWAERSGIL
FLGALHDMES PNYDSIAWFI TQVMPRMPAE MHLTIAGHVD PSVHFSALAG HGRVTFLGAV
DDPRPLYDRH RVFVAPTRFA GGLPYKVHEA AAHGLPVVAS TVLCRQVGWD VGTDILCGGS
DDPQCFADAI MALYEDAGLW RTVRDGAIGR IARENDPHAY RRRLADILEK LLSMG