Gene Gdia_3101 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGdia_3101 
Symbol 
ID6976535 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGluconacetobacter diazotrophicus PAl 5 
KingdomBacteria 
Replicon accessionNC_011365 
Strand
Start bp3395702 
End bp3396811 
Gene Length1110 bp 
Protein Length369 aa 
Translation table11 
GC content75% 
IMG OID643392609 
Productglycosyl transferase group 1 
Protein accessionYP_002277446 
Protein GI209545217 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0438] Glycosyltransferase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones35 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones43 
Fosmid unclonability p-value0.542778 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGAAGACCG TCCTCGTCTA TCGCGATCGG CTGCTGCCGC CCTCGGAACA GGCCTTCATG 
CGGCGGCAAT ATATGGGGTT TCGCACTCTC CGGCCCTGCT GGGTCGGGTG CCGGCGCGAT
GCCCCCGCGC CCGACTTGCC GGGTGATGTG CGTTTCCTCG GCGGCAGCGG GCCCCTGCGG
CCGCTGCGCC AGATGGCGTT CCGGCAGTTC GGCTGGGGCG CGGCGCGCGA GGTCGCGGAC
CTGGCGCCCG TGCTGGTGCA TGCCCAGTTC GGGCGCGGGG GCGCGCTGGC CCTGTCGATC
GCCCGGGCAC TGGGCGTGCC GCTGGTGGTG ACGTTCCATG GCGGCGACGC GTTCAAGGAC
CGGCATTACG CGGGCGGCTT TCCGCCATCG GTGTTCCAGC GGCGCTGGCA GGCGCTGCAA
TCCCATGCCG CGCTGTTCGT CTGCGTGTCC GAGGGCGTGC GCGACCGGTT GCTGGAACGC
GGGGTGCCGG CCCGACTGCT GGAGGTGATT CCCATCGGGG CGGAGCCCGC CCCCCTGGCC
GCCGGCCCCG CCGACCGCTT CGTCTTCGCC GGGCGCTTCG TGGACAAGAA GGGGGTGCCG
GTGCTGATCG ACGCGGTGCG GATTCTGGCC GGGCGCGGGG TGACGCCCCC GGTCGTTCTG
GCGGGGGACG GGCCGCTGCT GCCGGCGATG CGCGACCGTG CGGCGGGCCT GGCCAACCTG
CGCTTCGCCG GCTGGCTGGG GGCGGCGGAC CTGGCGGCGG AGATGGACCG GGCGATCGCG
CTGCTGGTGC CCAGCGTGGT GCCGCCCGGC GGCGACCGCG AGGGCCTGCC CAGCGTCGCG
GTGGAGGCCA TGGCGCGCGG CGTGCCGGTC GTCGCCTCCA GCCAGTCGGG GCTGGAGGGC
GCGGTGGGGC ATGCGGGGGC CGGGATCGTG GTGCCGGCCG GCGATCCGAT GGCGCTGGCG
GATGCGATGC AGGCGATGCT GGTCCCCCGG ACCCGCGATG CGATGGCGGG CGCGGCGGCG
GCGACGGCGC GGGAGTCGTT CTGCGCGCCC GTCCAGTCCG CCCGGCTGGA GGCACGGCTG
CTGTCGCTGC TGCCAGGGGC GACGGGATGA
 
Protein sequence
MKTVLVYRDR LLPPSEQAFM RRQYMGFRTL RPCWVGCRRD APAPDLPGDV RFLGGSGPLR 
PLRQMAFRQF GWGAAREVAD LAPVLVHAQF GRGGALALSI ARALGVPLVV TFHGGDAFKD
RHYAGGFPPS VFQRRWQALQ SHAALFVCVS EGVRDRLLER GVPARLLEVI PIGAEPAPLA
AGPADRFVFA GRFVDKKGVP VLIDAVRILA GRGVTPPVVL AGDGPLLPAM RDRAAGLANL
RFAGWLGAAD LAAEMDRAIA LLVPSVVPPG GDREGLPSVA VEAMARGVPV VASSQSGLEG
AVGHAGAGIV VPAGDPMALA DAMQAMLVPR TRDAMAGAAA ATARESFCAP VQSARLEARL
LSLLPGATG