Gene Gdia_2855 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGdia_2855 
Symbol 
ID6976287 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGluconacetobacter diazotrophicus PAl 5 
KingdomBacteria 
Replicon accessionNC_011365 
Strand
Start bp3122787 
End bp3124661 
Gene Length1875 bp 
Protein Length624 aa 
Translation table11 
GC content62% 
IMG OID643392363 
Productglycosyl transferase family 2 
Protein accessionYP_002277201 
Protein GI209544972 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG1215] Glycosyltransferases, probably involved in cell wall biogenesis 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value0.152901 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones44 
Fosmid unclonability p-value0.875286 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TTGGCCATGG TCACGCAGCC GGACAAGCGT CTTTTCGGGC AGTTCCTGGT CAGCCGGAAC 
ATCCTGACCG AGGCGCAGCT CGACGAAGCC ATCCAGACGC AGAAGCTCTG GAAATCCCGC
CTTGGCGACA TCATCCTGGC CAAGGGGTGG CTGAAGCCGC GTCGTTTCTA CCATCTTCTG
GCCACGTTCT TCGACCTGGA GTTCGTGGAC CTGATGGGCC ACCCGCCGGA CCCGGACCTG
TTCGACCGCG CGATGATCGA CGAGTATGCG CGGCGTTCGT TCCTGCCCTG GCGGCGCAGC
GCGGACGGCG CCATCATCCT GGCGCTGGCC GACCCGTCGG AAGACACCTT CAACTGGCTG
CGCGCCCGGT ATGGGCAGAA CGCGTGCTTC GTCGGCACCG GGCGGTTCGA TACGGTGTGG
CTGCTGCAGA AGATGGGCAA CGCCGCCCTG TCCGACGACG CGCTGAATGC GTTGTCCACC
ATGACGCCGC AGCATTCCGC CCGTCAGGTG TTCACCCGGG GGCAGATCGC ATTCTTCTAT
ATCACGGCGA TGGTTTTCCT GCTGTTCCTG TACCTGGCGC CGGAACGCAC GCTGTTCGCG
GCCAACCTGG CGGCCGGACT GGTCTTCTTC GCCAGCTTCT TCCTCAAATT CATGCTGTCC
TGCGCGGCGG TGCGGCAGGA GGTGGACGTC AAGGTCAGGG ACAGCGAGAT CCGGTCGCTG
GAGGACAGGG ATTTCCCCGT CTATACCATC CTGGTTCCGA TGTACAAGGA ACCGGACGTG
CTGCCCATCC TGGTCAATGC CATCCGCAAC CTGGAATACC CCCAGTCCAA GCTGGACGTG
AAGCTGGTCC TGGAAGAGGA CGACATCGAA ACCATCGCCG CCGCCCGGAA GCTGGCGCTG
GAGGCGACGT TCGAGATCAT CTGCGTGCCG CCGTCCGAAC CGCGCACCAA GCCCAAGGCC
TGCAACTATG CGCTGCGCTT CGCGCGGGGC GAGTACCTGA CCATCTATGA CGCCGAGGAC
AAGCCGGAGG CCACGCAGCT TGAAAAGGTC CTGGTCGCGT TCCGCAAGCT GCCGGACAAT
GTGGTCTGCA TCCAGGCGCG CCTGAACTAT TACAACGCCA CCGAGAACTG GCTGACGCGC
ATGTTCACGC TGGAATATAC GGCGTGGTTC GATTTCTACC TGCCTGCGCT GGAATATATG
CGCATTCCCA TACCGCTGGG CGGTACCTCG AACCATTTCA AGATCAGCGC GCTGCGCGCC
GTCCATGCGT GGGACCCGTA CAACGTGACG GAAGACGCCG ACCTGGGCGT GCGCCTGACC
CAGCGGGGCT GGAAGGTGGC GGTGGTGGAT TCCACGACCT TCGAGGAAGC CAATGTCAGC
ATCCCGAACT GGATCCGCCA GCGCTCGCGC TGGCTGAAGG GCTACATGCA GACCTATCTG
GTGCATATGC GCAGCCCGCT GGCATTCTAC CGCAAGACCG GGGGCACGGG CTTCTGGGGC
TTCCAGTTCT TTATCGGCGG TACCTTCATG ACGGCGCTGC TGGCCCCCAT CTTCTGGGTC
TTCTTCATCC TGTTCACGCT GTTCGGCCTC AAGGCCGGAA GCGGCGTCTT CTCCGGCCGC
ATCATGGCCC TCAACGCTCT CAACCTTCTG CTGGGAAACG GATTTCTGGT CTATACCTAT
GTGCTGTGCT CGTTCAAGCG GAACTACCGC CATCTGGCGC TCTATGCGCT GACGACGCCG
GTCTACTGGG CGCTGCAGTC GATCGCCGCC TACAAGGGGC TGTTCCAGCT TCTGTACAAG
CCCTTCTACT GGGAAAAGAC CCAGCATGGC CTGAGCAAGC ACACCGCAGC GGAACTGGAA
GAACTGAAAT CATGA
 
Protein sequence
MAMVTQPDKR LFGQFLVSRN ILTEAQLDEA IQTQKLWKSR LGDIILAKGW LKPRRFYHLL 
ATFFDLEFVD LMGHPPDPDL FDRAMIDEYA RRSFLPWRRS ADGAIILALA DPSEDTFNWL
RARYGQNACF VGTGRFDTVW LLQKMGNAAL SDDALNALST MTPQHSARQV FTRGQIAFFY
ITAMVFLLFL YLAPERTLFA ANLAAGLVFF ASFFLKFMLS CAAVRQEVDV KVRDSEIRSL
EDRDFPVYTI LVPMYKEPDV LPILVNAIRN LEYPQSKLDV KLVLEEDDIE TIAAARKLAL
EATFEIICVP PSEPRTKPKA CNYALRFARG EYLTIYDAED KPEATQLEKV LVAFRKLPDN
VVCIQARLNY YNATENWLTR MFTLEYTAWF DFYLPALEYM RIPIPLGGTS NHFKISALRA
VHAWDPYNVT EDADLGVRLT QRGWKVAVVD STTFEEANVS IPNWIRQRSR WLKGYMQTYL
VHMRSPLAFY RKTGGTGFWG FQFFIGGTFM TALLAPIFWV FFILFTLFGL KAGSGVFSGR
IMALNALNLL LGNGFLVYTY VLCSFKRNYR HLALYALTTP VYWALQSIAA YKGLFQLLYK
PFYWEKTQHG LSKHTAAELE ELKS