Gene Gdia_1102 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGdia_1102 
Symbol 
ID6974506 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGluconacetobacter diazotrophicus PAl 5 
KingdomBacteria 
Replicon accessionNC_011365 
Strand
Start bp1238088 
End bp1240949 
Gene Length2862 bp 
Protein Length953 aa 
Translation table11 
GC content71% 
IMG OID643390631 
Productglycosyl transferase family 2 
Protein accessionYP_002275500 
Protein GI209543271 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0463] Glycosyltransferases involved in cell wall biogenesis 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones47 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACGCGTG ACCTCCTCCT GGCAGAATGG ACGACCGACA TCCCGATGGA CGCTGACGAA 
CGGCTTTCGC CCTGCCCGCT GCCCGAAGGG CTTCGCCTGA TGGGCCGCGA CGCGCTCGAT
CCCTATTTCC TGGTGCCGCC CGAACTGGAC CCCGATCCCG CCATCAACGC GGTGCTGCCG
TTCCTGGGCT GGATCGTCAC GCTGGCCCGG CCCCGCCGCA TCGGCCTGAT GCCGGCCCGG
AAGGCGATCG CGGCCCTGAT GGCGGACGTG GCGCACCGGA TGCGCCTGCC GGCGGACATC
CGCGCCCTGC CGTTCCAGCC GCCGCCTGCC GGATTCGACC TGCTATGGCT GGATCTTCCG
CCGGCCAGCA CGCCGGATGC CGCCCTGGCA CCGACGCCCG ACCAGATGCT CCAGCTGCTG
GGCCAGGGGG GCATTGTCGT GCTGCACGGC CTGGATAGCG GCGGCTGGGA CGACCTTTCC
ATGGCGACCC TGAATCTGGG GCGCGGACTG GGCGTGCTGG TAGGGGGCGC GTGTCGCGGG
GGCTCCGTGG CCAGCCTGTG CGCGATGCTG AATCGTTCCG ACGACGGCAC GGCAGCCAAC
CTGGCCGCCC GCTTCGCCGC GATCGGCGCG CATTGGGCCG CCCGCCGCGC CCTGGCCGAC
ACGCAGGCGG AACTGGACCG GACCCGGCAG GCGCTCAGTC ATCTGCGGCT GGATGCCATG
GAGATGCGGC TGGCCCTGAA CCATCAGGAT GCCGCCGGGC AGGATGCGAA CCGGCAGGGG
TCGTCCCCCC CGGCCGCGCC GCCCGTCCCC GTCACGCCGG CCGCACCGCC GCCGAAGGGA
CCATCCCGGT GGCGTCGCCT TGCCCGCAGG CTGATCCGGG GGCCTGCTAC CCCTGCCCCT
GCAAGCGCCG ACCGCACGAT CCGCACGGTC CTGTTCGTAT CCGGCGAACC GGGCACCCCC
GGCACGACCT ACCGCGTCAC GCGCAACGCC GCCGCCTGCG CCGCCGCCGG ATACGCGACC
CGGTGCAGGG ACTGCGCGGC GGTCGGGCCG GACGACATCG CATGGGCCGA CATGGTCGTG
CTGTGGCGCG TGGAATATAG CGGCCATGTC GACACCTTGC TGGGCCTGGC CCGGGCGCGC
GGCGCCGTGC TGGCCTTCGA TGCCGATGAC ATCGTGTTCG AACCCGCCCT GGCGCGCACC
GACCTGATCG ACGGAATCCG CGTCAGTCCG GCCCCCGTGG CGCGGATCGA ACGGATGTAT
GCCGACATGC AGCGCACCAT GCGCCAGTGC GACCTCGGCC TGGCCACCAC CGATACGCTG
GCCGACTGGA TGCGCCCCTT CCTGAAGCTG ACGCTGGTGC TGCCGAACAC CTTCGATAAC
GCGACGCTGC AGCGTGCACG CCACGCCGTC CGCCGGCGGG CGCTGGCCGC GCCCGACGCG
GCGGATGACG TCGTGCGGAT AGGCTATGCC ACCGGATCGC GCACCCACCA GCGCGACTTC
GCCCGTGCCC TGCCCGGCCT GCTGCGGGTC ATGGACCGAC GGGCGCAGGT GCGCCTGGTC
CTGTTCCGCG AACCCGGCGG AGGGCGCCCC CTGCTGCTGA TCGAGGAATT TCCCGACCTG
CACGCGCGGT CGGCGCAGAT CGAATGGCGC GACATGGTGA CGCTGGACGC GCTGCCGGAC
GAACTGGCGC GGCTGGACAT CTCGATTGCC CCGTTGGAGG ACGGCAATCC GTTCTGCGAG
GCCAAGAGCG AACTGAAATT CTTCGAGGCC GCGCTGGCCG GCGTCTGTAC CGTCGCCTCG
CCCACCGCGC CGTTTCGCGC CGCCATCCGG CCGGGCGTGA CCGGCCTGCT GGCGGACGGT
GCGGCGGAAT GGGAAAGCGC GCTGCTGCGT CTGGTGGACG ACCCCGCCCT GCGCCGCCGC
ATGGCGCGCG ACGCGCTGCA CACGGTGCTG TGGGAATACG GCCCCCAGCG ACAGGCCGCC
CTGCTGGGGC CGGCCATCGC CGGGCTGGGC GATGCGCGGG CAGCGGCACG GGCCGGCGCC
ACCGCCCTGG CACGCGGCGC CTTCCGCGTC CGCGCCATTC CCCGGATCCC CGACAGCACG
GTCCTGTTCA CCCAGGACCA TCTGCAGGAC GCCGCCGTCA CGGTCGTCGT GACCGCGTAT
AATTATGCCG GCCACGTCAT CGAGGCCCTG GACTCCGTCC GCCGCCAGAC GCTCGACCCG
CTGGACCTGA TCGTGGTCGA TGATGCCTCG ACCGACGATA CTCCGTCGCT GCTGACGGGC
TGGGCGGCCC GGCATGGCGC ACGGTTCAAC CGGCTGCTGA TCCTGCGCGC CCGGCGCAAT
GCCGGGCTGG GCGGCGCGCG CAATATCGGC ATGGCGGCGG CCGAAACCCC CTATGTCCTG
CAACTGGACG CCGACAACCG CCTGCTGCCC GATGCCTGCG CCCGCTTGCT GGCCGCCATC
GCGGCGGAAA GAGCGGGCTA TGCCTATCCC CTGATCCGCC AGTTCGGGCG CGAGGCCAGC
GTGATGGGCG ATACCCCGTT CCATCCCGGG CGACTGGTCG GCGGCAATAC CATCGACGCC
ATGGCGCTGG TGGCCAAATG GGCTTGGGCC GCCGCCGGCG GCTATTACGT GCGGCGCGAC
GCCATGGGGT GGGAGGATTA CGACCTGTGG TGCACCCTGG CAGAACTGGG CATCGCCGGT
ACCCAGGTGC CCGAAATCCT GGCCGAATAC CGCGTGCATG ACACGGCCAT GACCGACACG
CTGACCGAAC GGCCGCACCA CAAGGACGCG GTAGTCACGC TGCTGCGAGA CCGCCATCCC
TGGATTCGCC TGACGGCCCC CGAGACACGT GCGCGTTCAT GA
 
Protein sequence
MTRDLLLAEW TTDIPMDADE RLSPCPLPEG LRLMGRDALD PYFLVPPELD PDPAINAVLP 
FLGWIVTLAR PRRIGLMPAR KAIAALMADV AHRMRLPADI RALPFQPPPA GFDLLWLDLP
PASTPDAALA PTPDQMLQLL GQGGIVVLHG LDSGGWDDLS MATLNLGRGL GVLVGGACRG
GSVASLCAML NRSDDGTAAN LAARFAAIGA HWAARRALAD TQAELDRTRQ ALSHLRLDAM
EMRLALNHQD AAGQDANRQG SSPPAAPPVP VTPAAPPPKG PSRWRRLARR LIRGPATPAP
ASADRTIRTV LFVSGEPGTP GTTYRVTRNA AACAAAGYAT RCRDCAAVGP DDIAWADMVV
LWRVEYSGHV DTLLGLARAR GAVLAFDADD IVFEPALART DLIDGIRVSP APVARIERMY
ADMQRTMRQC DLGLATTDTL ADWMRPFLKL TLVLPNTFDN ATLQRARHAV RRRALAAPDA
ADDVVRIGYA TGSRTHQRDF ARALPGLLRV MDRRAQVRLV LFREPGGGRP LLLIEEFPDL
HARSAQIEWR DMVTLDALPD ELARLDISIA PLEDGNPFCE AKSELKFFEA ALAGVCTVAS
PTAPFRAAIR PGVTGLLADG AAEWESALLR LVDDPALRRR MARDALHTVL WEYGPQRQAA
LLGPAIAGLG DARAAARAGA TALARGAFRV RAIPRIPDST VLFTQDHLQD AAVTVVVTAY
NYAGHVIEAL DSVRRQTLDP LDLIVVDDAS TDDTPSLLTG WAARHGARFN RLLILRARRN
AGLGGARNIG MAAAETPYVL QLDADNRLLP DACARLLAAI AAERAGYAYP LIRQFGREAS
VMGDTPFHPG RLVGGNTIDA MALVAKWAWA AAGGYYVRRD AMGWEDYDLW CTLAELGIAG
TQVPEILAEY RVHDTAMTDT LTERPHHKDA VVTLLRDRHP WIRLTAPETR ARS