Gene Gdia_1100 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGdia_1100 
Symbol 
ID6974504 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGluconacetobacter diazotrophicus PAl 5 
KingdomBacteria 
Replicon accessionNC_011365 
Strand
Start bp1233844 
End bp1236618 
Gene Length2775 bp 
Protein Length924 aa 
Translation table11 
GC content71% 
IMG OID643390629 
Productglycosyl transferase group 1 
Protein accessionYP_002275498 
Protein GI209543269 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0438] Glycosyltransferase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones48 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAACCGGA TCGCGCCCCG GCCCCGATCG GCGCCGGACG ACAGGGACCT GGACCGCGCC 
GGCGCACGGT TGCAGGCCGC GCGCGCCACC GGCCTGCAGG CGGAATGCGA CCAGTTGCGC
CGGGCGCTGG CGCTGCAGGA TCGCGCGCTG TCGGGTCGGG TGCTGCGGAT CGTCCGTGCC
ATCCGGGCGG CGATGCGGGG GCGCGACCCG TTCGGGCGTC CGCTGGGCGT CGCCGCCGCC
GCGCTGTGGC GAAAGACGCG GCGGGACGGC GTCGTGTCCG CCGCCCGCCT TGTGGCGCGC
GTGTTGTCGC CGCCTCCGGA CGGCGGGGCG GATGGACCGG TGTCCAGCCA CGATGCGTAT
GCCATGGCTG CCGGGCCGGC GGACTGGACG CCGCAGATCC TGATCATCGC GGAACTCAGC
CTGGCCCAAT GTGCGAAATA TCGTGTGTGG CAAAGGGTCG AACAGGTGCG GCACCTGGGA
TGGACCTGCC GCGTGGTGGA CTGGCGCGAC ACCGGCGAGG CGCTGACCGC GCTGCAATTC
TGCACGCGCG TCGTCTTCTA CCGGGTTCCG GCCTTTGCCT CGGTCAGGAT GCTGCTGGCC
GAAACACGAC GTCTGTCGGT GCCGTCATGG TGGGAAGTCG ATGACCTGAT CTTCGATCGG
ACGCTGTATT TTCAGAACAA CAACCTGGCG GCGTTGCCCG AGGCCGAACG TGCCGGGCTT
CTGTCCGGGG TCAGGCTGTT CCGGACCTGC ATGCTGTCCT GCGATCGCGG CATCGCCTCG
ACCCCGGTGC TGGCGCGGGC GATGCGCGAG GCCGGCCTGC CGGCGGCGTC GGTCATCGAA
AATGCGCTGG ACGAGGAAAC GCTGGCTGCC GCGGCTCTGG CCCGCGCCCG GGCAGCGGCG
GGGCACGGGG CGGCTGACGG GACGGTGGTC ATCGTCTACG GGTCCGGTAC CCGCACCCAT
GACGCCGATT TTCGCGTGGC GGTGCCGGGC CTGGTCGCGG CGATGGCGGC GGACGCGCGG
CTGCGTCTGT GGATCGTGGG CGAACTGCAG GTGCCCCGCG CCCTGCAGGC GCTGGGCACA
CGGGTGGTCG TCCTGCCCCT CCGTCCCTAT GCGGAGTACC TGGCGCTGAT GGCGCGCGCG
GATATCGTCA TCGCCCCGCT GGAAGACAGC GTCTTCAATG ATGCCAAGAG CAACATCAAA
TATCTGGAAG CCGCCAGTCT GGGCCTGCCG TCGGTCTGTT CGCCCCGCCG GGCGTTCGCG
GACGTGATCG TGGATGGAAC GACCGGCTAT CTGGCCGCCA CGGATGCCGA CTGGACGCGG
GCCCTGCTGC TGCTGTCGGG CGATGCGACG CTGCGCCGTC AGGTCGGGCG GCGGGCCCTG
GCCGACATCC TGGACCGCTA TGCCCTGGCC CATATGGCCG AACGGCAGGT GGCGGCGGTG
TTCGGCCGTC CGGCCGTGCC GGCCCTGAAT CCGGCTATGA AACCAGCCAT GAAACCGGTC
GGGGGCGCGG TCACGGGCAG GCGGCTGCGG GTGCTGTGCG TGAATGTCTA TTACCCGCCG
CGCGCCTTCG GCGGAGCCAC GCATGTGGCG GTGGAAATGG CCCAGCGGCT GCAGGCCGGC
GGCCAGGCCG ACATCGCGGT GCTGACCACG CGCCCGGCGG AACCGGGGCG CCCGGCCTCG
GCCCTGCGCT ACCGGCACCG TGGGGTGCCG GTGGTCGCGC TGGACGTGCC GGCGGAGCAT
GACGGACTCG CGATGTTCCA CAACCCCGCG GCGGCTGCGA TCTTTGCCGA TTATGTCGCG
GCTTTTCGCC CGGACGTGGT GCATGTCCAT GCTCCGCAGG GGCTGGGGGT CGGGCTGCTG
GATGTCTGCC GGCACCAGGG GATTCCCTAT GTGCTGACAT TGCATGATGC GTGGTGGCTG
TGCGACCGGC AGTTCATGGT GCGCGAGGAC GGCCAGTTCT GCGGGCAGGA ACGGATCGAT
CCGCGCACCT GCCAGCGGTG CCGCCCGCAG GCCCGCTACC TGGCCGACCG GGCGGTGCTG
GCGGGGGCCG GCCTGCGCGA TGCGGCGCTG CTGCTCAGCC CCAGTGCCGC GCATCGCCGG
CTGCACATCG CCAACGGCGT CGATCCGGCG CGGATCGTGG TGCATCGCAA CGGATTTCGC
TGGCCGAAGC GCCCGCGCAC GCCTGTGGCT CCCGGCGGCC GTGCCCTGCG GTTCGGCTAT
GTCGGGGGCA GCGACGCGGT CAAGGGGTAT CCGGTGATCC GCGCGGCGTT CGAGGGGCTG
GCGCGTGCCG ACTGGGTCCT GCGCCTGGTG GACAACAAGA CGGCGCTTGG CCTGCGATCC
ATCGAAGTCG GCGACTGGCG GGTACAGGGC AAGCTGGAGG TTCTTCCGGC CTATGACGGC
GAGACGGTCG ATGCGTTCTT CGATTCCATC GACGTTCTGC TGTTTCCCTC GCGCTGGCCG
GAAAGTTACG GCCTGACGGT GCGCGAGGCC CTGGCCCGCG ACGTCTGGGT CGTCGCATCC
GCGCCCGGCG GCCAGGCGGA GGATATCGTA CCCGGGGTGA ACGGCACATT GATCGGCCTG
TCGGCGCCGG CATCCGATCT GGCGGCGGCG GTGACGGACC TGCTGGACCG CCCGGATCGC
CTGGCCGGTT ATGTCAATCC GTGCAAGGAC CGGCTGGCGA CATGGGACGG GCAGGCGCGC
GAACTGCTCG ATCTGCTGCG CGCGGCATCG GGATGGGTGC AGGCGGGCGA CGATCCCGCC
ACCGCGTGTG GCTGA
 
Protein sequence
MNRIAPRPRS APDDRDLDRA GARLQAARAT GLQAECDQLR RALALQDRAL SGRVLRIVRA 
IRAAMRGRDP FGRPLGVAAA ALWRKTRRDG VVSAARLVAR VLSPPPDGGA DGPVSSHDAY
AMAAGPADWT PQILIIAELS LAQCAKYRVW QRVEQVRHLG WTCRVVDWRD TGEALTALQF
CTRVVFYRVP AFASVRMLLA ETRRLSVPSW WEVDDLIFDR TLYFQNNNLA ALPEAERAGL
LSGVRLFRTC MLSCDRGIAS TPVLARAMRE AGLPAASVIE NALDEETLAA AALARARAAA
GHGAADGTVV IVYGSGTRTH DADFRVAVPG LVAAMAADAR LRLWIVGELQ VPRALQALGT
RVVVLPLRPY AEYLALMARA DIVIAPLEDS VFNDAKSNIK YLEAASLGLP SVCSPRRAFA
DVIVDGTTGY LAATDADWTR ALLLLSGDAT LRRQVGRRAL ADILDRYALA HMAERQVAAV
FGRPAVPALN PAMKPAMKPV GGAVTGRRLR VLCVNVYYPP RAFGGATHVA VEMAQRLQAG
GQADIAVLTT RPAEPGRPAS ALRYRHRGVP VVALDVPAEH DGLAMFHNPA AAAIFADYVA
AFRPDVVHVH APQGLGVGLL DVCRHQGIPY VLTLHDAWWL CDRQFMVRED GQFCGQERID
PRTCQRCRPQ ARYLADRAVL AGAGLRDAAL LLSPSAAHRR LHIANGVDPA RIVVHRNGFR
WPKRPRTPVA PGGRALRFGY VGGSDAVKGY PVIRAAFEGL ARADWVLRLV DNKTALGLRS
IEVGDWRVQG KLEVLPAYDG ETVDAFFDSI DVLLFPSRWP ESYGLTVREA LARDVWVVAS
APGGQAEDIV PGVNGTLIGL SAPASDLAAA VTDLLDRPDR LAGYVNPCKD RLATWDGQAR
ELLDLLRAAS GWVQAGDDPA TACG