Gene Gdia_2886 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGdia_2886 
Symbol 
ID6976318 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGluconacetobacter diazotrophicus PAl 5 
KingdomBacteria 
Replicon accessionNC_011365 
Strand
Start bp3161358 
End bp3164384 
Gene Length3027 bp 
Protein Length1008 aa 
Translation table11 
GC content76% 
IMG OID643392394 
Productglycosyl transferase family 2 
Protein accessionYP_002277232 
Protein GI209545003 
COG category[R] General function prediction only 
COG ID[COG1216] Predicted glycosyltransferases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones32 
Fosmid unclonability p-value0.0431865 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGGGGCGGT TCCGGCCGCA GGGGGTGACG GGTGCCGATC GGGCCGCGTG GCAGGCGTGG 
GCCGATGCGC GTGCCCAGGC GGCCTTTCGC CGTGCGCGCC TGCTGGAGGA TGCGGGCGAC
CCGGACGGGG CCTGGCGATG GTACGAGCGG GCGAACCGCC TGGCTCCCGA CAGCCCGAAC
GTGATGTTCC CGCTGGCGGT GGCGCGGCTG CGCGGCGGGG ACACCGCCGG CAGCGTGCGT
CTGCTGCGTG AACTGACCCG GCGTTTCGAT TTCCCCGAGG GCTGGGCGGC CCTGTCCGGC
GCCCTGCTGG CGGCGGGCGA GGACGCGGAC GCCATCGCGG CGGCGCAGCA CACCCTGTCC
CGCCACGCGC CGCCGGCCGG CATGGATGCG CTGTTCGGGC GACTGGCGGC CCGGGCGGGC
CGGCCCGGCT GGTGCGGCCT GTCGGCATCG GGCCGGCTGC ATGGCGGTGG TGTCCGCGAC
CCCGATATAT TCTTCGATGG GCAGCCGGTC CGATCGCGGG CCGGGGCGGA CGGGCCGGTG
CTGCCGCCCT GCTGGCGCCT GGCCCGGCAG GTCGAGGTGC TGGCGGGCGG CGTGCCGCTG
CTGGGCAGCC CGCTGGACCC GGCGGCGGTG CAGCGGACCG AGGGATTCGT CGAATCCGGG
CCACGGGGCC TGTCGGGCTG GCTCTGGCAC CCGGCGGACC CGGACCATGT GCCCGAACTG
CGCCTGCTGG ACGGGCGGAC CGGGCGCCTG CTGCTGCGCC AGGCGGTGCC CGAACCGGCG
ACCGAGGTCA GCAGTGCGGT GCCGCTGGCC CGGCCGCGCG GCTTTGCCAT TCCGGCCGCC
TGCCTGCCGG CCGGCCCGGT GCGCGTGCTG GGGTCCGACG GGCGCGACCT GCTGGGCAGC
CCGCTGGCCA CGGCCTGCGA ACCGTGCGGG GGCGGCTGCC CGGGCGAAGC CGATTGCCCG
AAACGCCTGC CTACCGGCCG GCCGCCCTGC CGCCCGCCGG ATGCGCCGCC GGCCCGTGGA
CGCCCGGCGA TCGTGATTCC GGTCCATGGC GACCGCGCGG CCACGCTGGC CTGCCTGGAA
TCGGTGCGCG ACAGCGTGGG GCCGGACGTG CCGGTGGTCG TGGTCGATGA CGCGACGCCG
GTCGCGGCCC TGGCCGCCGA CCTGGACCGG CTGGCGGCGG CGGGGACGAT CCGGCTGGTC
CGGCACGCGC GCACGCTGGG CTTCCCGGCC TCGGCCAATG ACGGGATGCG CGCGATGGCG
GGGCACGACG TCGTGCTGCT GAACAGCGAT ACGCTGGTGG CGGGCGGCTG GCTGGCGGAA
CTGGCCGACG TGGCCTATGG CGCGGCGGAT ATCGGCACCG TGACCCCGCT GTCGAACCAG
GCCAGCATCT TCTCGTGGCC CGGACCGGAC CGGGCGCCGG ACGCGGAATT CTGCCCGGCG
TTCGACCTGG CCCGGGTGCG GCACGTGATG GCGGCGGCGC GGGCGGCCAA TCGCGGGGTG
GCGGTCGATG TGCCGACCGG GCACGGATTC TGCCTGTTCA TCCGCCATGA CTGCCTGGAT
GCGTTCTGGC GGCTGCAGCC CGGCGCGCCG GATTCGGGCC CCTTGCGGGC CGACCTGTTC
GCCCAGGGCT ATGGCGAGGA AAACGATTTC TGCCTGCGCG CCGCCGCCGG GGGGTGGCGG
CATGTCGCCG CGCCCGGGGC GTTCGTCGGC CATGTGGGCG GGGCCTCGTT CGGTGCGGCG
CGGCCGGACC TGCTGCGGCG GAACCTGGAC ATCGTGAACG CCCTGCACCC GGGCTATGAC
GCGCGGGTCG CCGCCCATGT CGCCGCCGAC CCGCTGTTCG CGGCACGGCG GCGGCTGGAC
CTGGTGTGCT GGCGGCGCGG CGGGGCCGGG TGGGATGGGG CGGTGCTGCT GGTCACGCAC
GACCAGGGCG GCGGGGTGGA ACGCGTGGTG CGCGCCCGCG CCGCCGCCTG GCGCGCACAG
GGCGCGCGGC CGGTGGTGCT GCGCCCCGCC CCCGGCGGCT GTCGGCTGGA GGACGTGCCG
GACACGGCGT CCGGGCCGTC CGCCGCGGCG TTCGCCAGCC CGCGATTCCG CCTGCCCGGG
GAAGGGGACA TGCTGCTGGC CGTGCTGCGC GCGTCGGGGG TCGATTCGGT GGAATGGCAT
CATTCGCTGG GCCACGGGAT CGATGTCCGC ACGCTGGCCG CCTCGCTGGG CGTGCCGTAC
GAGGTGCATG TCCATGACTA TGTGTGGTTC TGCCCCCGGG TGGCGCTGGT CGGGGCGTCG
GGCCGCTATT GCGGCGAACC GGGACCGGCG GGATGCGCCG CCTGCCTCGC CCCCGGCGGC
AGCCTGCTGG AGGACGGGGA GGGGACGATC CCCGATCGGC TGGCGCGCCA CCTCGCGCGG
TCCCGCGCGG ACCTGCTGGG CGCGCGCGCC GTCATGGCGC CGTCGGATGA CGCGGCCCGG
CGCATCGGGC GGCATTTTCC CGAAATCGCG GTCCGGGTGA CGCCGCTGGA GGACGACCGG
CCCGGGATGG ACCTGGCATC CTTCGTCCGG ATCTATGGCG GGGTGGCTTG TGGCGGGGCG
GCGCCGGCGG GACGCGCGCG GCCCGCCGGC CGGGTGCGCG TGGCGGTGCC CGGCGCGATC
GGGGTCGAGA AGGGTTATGG CATCCTGCTG GAGGCCGCGC GGGACGCGGC GGCGCGGGAC
CTGAATCTGG AATTCGTGGT GGTGGGCCAC ACGCCCGACG ACGACGCGCT GATCGGCACC
GGCCGGGTGT TCGTCACCGG CCCATACCGC GAATCCGATT CCGTCGCCCT GCTGGCGGCG
CAGGGGGCGG ACCTGGCGCT GCTGCCCTCG GTGTGGCCGG AGACCTGGTG TTTCGCGCTG
GGGCTGGCGT GGCGGGCCGG CCTGCGGGCC GTGGTCTTCG ACCTGGGGGC CATGGCCGAA
CGGGTGCGGC GCACGGGCCG GGGCCGGGTC CTGCCATTGG GTATGCCGAT TCACGAATTG
AATACGTTTT TGCTCTCCTA TGCCTGA
 
Protein sequence
MGRFRPQGVT GADRAAWQAW ADARAQAAFR RARLLEDAGD PDGAWRWYER ANRLAPDSPN 
VMFPLAVARL RGGDTAGSVR LLRELTRRFD FPEGWAALSG ALLAAGEDAD AIAAAQHTLS
RHAPPAGMDA LFGRLAARAG RPGWCGLSAS GRLHGGGVRD PDIFFDGQPV RSRAGADGPV
LPPCWRLARQ VEVLAGGVPL LGSPLDPAAV QRTEGFVESG PRGLSGWLWH PADPDHVPEL
RLLDGRTGRL LLRQAVPEPA TEVSSAVPLA RPRGFAIPAA CLPAGPVRVL GSDGRDLLGS
PLATACEPCG GGCPGEADCP KRLPTGRPPC RPPDAPPARG RPAIVIPVHG DRAATLACLE
SVRDSVGPDV PVVVVDDATP VAALAADLDR LAAAGTIRLV RHARTLGFPA SANDGMRAMA
GHDVVLLNSD TLVAGGWLAE LADVAYGAAD IGTVTPLSNQ ASIFSWPGPD RAPDAEFCPA
FDLARVRHVM AAARAANRGV AVDVPTGHGF CLFIRHDCLD AFWRLQPGAP DSGPLRADLF
AQGYGEENDF CLRAAAGGWR HVAAPGAFVG HVGGASFGAA RPDLLRRNLD IVNALHPGYD
ARVAAHVAAD PLFAARRRLD LVCWRRGGAG WDGAVLLVTH DQGGGVERVV RARAAAWRAQ
GARPVVLRPA PGGCRLEDVP DTASGPSAAA FASPRFRLPG EGDMLLAVLR ASGVDSVEWH
HSLGHGIDVR TLAASLGVPY EVHVHDYVWF CPRVALVGAS GRYCGEPGPA GCAACLAPGG
SLLEDGEGTI PDRLARHLAR SRADLLGARA VMAPSDDAAR RIGRHFPEIA VRVTPLEDDR
PGMDLASFVR IYGGVACGGA APAGRARPAG RVRVAVPGAI GVEKGYGILL EAARDAAARD
LNLEFVVVGH TPDDDALIGT GRVFVTGPYR ESDSVALLAA QGADLALLPS VWPETWCFAL
GLAWRAGLRA VVFDLGAMAE RVRRTGRGRV LPLGMPIHEL NTFLLSYA