Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Gdia_1102 |
Symbol | |
ID | 6974506 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Gluconacetobacter diazotrophicus PAl 5 |
Kingdom | Bacteria |
Replicon accession | NC_011365 |
Strand | + |
Start bp | 1238088 |
End bp | 1240949 |
Gene Length | 2862 bp |
Protein Length | 953 aa |
Translation table | 11 |
GC content | 71% |
IMG OID | 643390631 |
Product | glycosyl transferase family 2 |
Protein accession | YP_002275500 |
Protein GI | 209543271 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG0463] Glycosyltransferases involved in cell wall biogenesis |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 47 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACGCGTG ACCTCCTCCT GGCAGAATGG ACGACCGACA TCCCGATGGA CGCTGACGAA CGGCTTTCGC CCTGCCCGCT GCCCGAAGGG CTTCGCCTGA TGGGCCGCGA CGCGCTCGAT CCCTATTTCC TGGTGCCGCC CGAACTGGAC CCCGATCCCG CCATCAACGC GGTGCTGCCG TTCCTGGGCT GGATCGTCAC GCTGGCCCGG CCCCGCCGCA TCGGCCTGAT GCCGGCCCGG AAGGCGATCG CGGCCCTGAT GGCGGACGTG GCGCACCGGA TGCGCCTGCC GGCGGACATC CGCGCCCTGC CGTTCCAGCC GCCGCCTGCC GGATTCGACC TGCTATGGCT GGATCTTCCG CCGGCCAGCA CGCCGGATGC CGCCCTGGCA CCGACGCCCG ACCAGATGCT CCAGCTGCTG GGCCAGGGGG GCATTGTCGT GCTGCACGGC CTGGATAGCG GCGGCTGGGA CGACCTTTCC ATGGCGACCC TGAATCTGGG GCGCGGACTG GGCGTGCTGG TAGGGGGCGC GTGTCGCGGG GGCTCCGTGG CCAGCCTGTG CGCGATGCTG AATCGTTCCG ACGACGGCAC GGCAGCCAAC CTGGCCGCCC GCTTCGCCGC GATCGGCGCG CATTGGGCCG CCCGCCGCGC CCTGGCCGAC ACGCAGGCGG AACTGGACCG GACCCGGCAG GCGCTCAGTC ATCTGCGGCT GGATGCCATG GAGATGCGGC TGGCCCTGAA CCATCAGGAT GCCGCCGGGC AGGATGCGAA CCGGCAGGGG TCGTCCCCCC CGGCCGCGCC GCCCGTCCCC GTCACGCCGG CCGCACCGCC GCCGAAGGGA CCATCCCGGT GGCGTCGCCT TGCCCGCAGG CTGATCCGGG GGCCTGCTAC CCCTGCCCCT GCAAGCGCCG ACCGCACGAT CCGCACGGTC CTGTTCGTAT CCGGCGAACC GGGCACCCCC GGCACGACCT ACCGCGTCAC GCGCAACGCC GCCGCCTGCG CCGCCGCCGG ATACGCGACC CGGTGCAGGG ACTGCGCGGC GGTCGGGCCG GACGACATCG CATGGGCCGA CATGGTCGTG CTGTGGCGCG TGGAATATAG CGGCCATGTC GACACCTTGC TGGGCCTGGC CCGGGCGCGC GGCGCCGTGC TGGCCTTCGA TGCCGATGAC ATCGTGTTCG AACCCGCCCT GGCGCGCACC GACCTGATCG ACGGAATCCG CGTCAGTCCG GCCCCCGTGG CGCGGATCGA ACGGATGTAT GCCGACATGC AGCGCACCAT GCGCCAGTGC GACCTCGGCC TGGCCACCAC CGATACGCTG GCCGACTGGA TGCGCCCCTT CCTGAAGCTG ACGCTGGTGC TGCCGAACAC CTTCGATAAC GCGACGCTGC AGCGTGCACG CCACGCCGTC CGCCGGCGGG CGCTGGCCGC GCCCGACGCG GCGGATGACG TCGTGCGGAT AGGCTATGCC ACCGGATCGC GCACCCACCA GCGCGACTTC GCCCGTGCCC TGCCCGGCCT GCTGCGGGTC ATGGACCGAC GGGCGCAGGT GCGCCTGGTC CTGTTCCGCG AACCCGGCGG AGGGCGCCCC CTGCTGCTGA TCGAGGAATT TCCCGACCTG CACGCGCGGT CGGCGCAGAT CGAATGGCGC GACATGGTGA CGCTGGACGC GCTGCCGGAC GAACTGGCGC GGCTGGACAT CTCGATTGCC CCGTTGGAGG ACGGCAATCC GTTCTGCGAG GCCAAGAGCG AACTGAAATT CTTCGAGGCC GCGCTGGCCG GCGTCTGTAC CGTCGCCTCG CCCACCGCGC CGTTTCGCGC CGCCATCCGG CCGGGCGTGA CCGGCCTGCT GGCGGACGGT GCGGCGGAAT GGGAAAGCGC GCTGCTGCGT CTGGTGGACG ACCCCGCCCT GCGCCGCCGC ATGGCGCGCG ACGCGCTGCA CACGGTGCTG TGGGAATACG GCCCCCAGCG ACAGGCCGCC CTGCTGGGGC CGGCCATCGC CGGGCTGGGC GATGCGCGGG CAGCGGCACG GGCCGGCGCC ACCGCCCTGG CACGCGGCGC CTTCCGCGTC CGCGCCATTC CCCGGATCCC CGACAGCACG GTCCTGTTCA CCCAGGACCA TCTGCAGGAC GCCGCCGTCA CGGTCGTCGT GACCGCGTAT AATTATGCCG GCCACGTCAT CGAGGCCCTG GACTCCGTCC GCCGCCAGAC GCTCGACCCG CTGGACCTGA TCGTGGTCGA TGATGCCTCG ACCGACGATA CTCCGTCGCT GCTGACGGGC TGGGCGGCCC GGCATGGCGC ACGGTTCAAC CGGCTGCTGA TCCTGCGCGC CCGGCGCAAT GCCGGGCTGG GCGGCGCGCG CAATATCGGC ATGGCGGCGG CCGAAACCCC CTATGTCCTG CAACTGGACG CCGACAACCG CCTGCTGCCC GATGCCTGCG CCCGCTTGCT GGCCGCCATC GCGGCGGAAA GAGCGGGCTA TGCCTATCCC CTGATCCGCC AGTTCGGGCG CGAGGCCAGC GTGATGGGCG ATACCCCGTT CCATCCCGGG CGACTGGTCG GCGGCAATAC CATCGACGCC ATGGCGCTGG TGGCCAAATG GGCTTGGGCC GCCGCCGGCG GCTATTACGT GCGGCGCGAC GCCATGGGGT GGGAGGATTA CGACCTGTGG TGCACCCTGG CAGAACTGGG CATCGCCGGT ACCCAGGTGC CCGAAATCCT GGCCGAATAC CGCGTGCATG ACACGGCCAT GACCGACACG CTGACCGAAC GGCCGCACCA CAAGGACGCG GTAGTCACGC TGCTGCGAGA CCGCCATCCC TGGATTCGCC TGACGGCCCC CGAGACACGT GCGCGTTCAT GA
|
Protein sequence | MTRDLLLAEW TTDIPMDADE RLSPCPLPEG LRLMGRDALD PYFLVPPELD PDPAINAVLP FLGWIVTLAR PRRIGLMPAR KAIAALMADV AHRMRLPADI RALPFQPPPA GFDLLWLDLP PASTPDAALA PTPDQMLQLL GQGGIVVLHG LDSGGWDDLS MATLNLGRGL GVLVGGACRG GSVASLCAML NRSDDGTAAN LAARFAAIGA HWAARRALAD TQAELDRTRQ ALSHLRLDAM EMRLALNHQD AAGQDANRQG SSPPAAPPVP VTPAAPPPKG PSRWRRLARR LIRGPATPAP ASADRTIRTV LFVSGEPGTP GTTYRVTRNA AACAAAGYAT RCRDCAAVGP DDIAWADMVV LWRVEYSGHV DTLLGLARAR GAVLAFDADD IVFEPALART DLIDGIRVSP APVARIERMY ADMQRTMRQC DLGLATTDTL ADWMRPFLKL TLVLPNTFDN ATLQRARHAV RRRALAAPDA ADDVVRIGYA TGSRTHQRDF ARALPGLLRV MDRRAQVRLV LFREPGGGRP LLLIEEFPDL HARSAQIEWR DMVTLDALPD ELARLDISIA PLEDGNPFCE AKSELKFFEA ALAGVCTVAS PTAPFRAAIR PGVTGLLADG AAEWESALLR LVDDPALRRR MARDALHTVL WEYGPQRQAA LLGPAIAGLG DARAAARAGA TALARGAFRV RAIPRIPDST VLFTQDHLQD AAVTVVVTAY NYAGHVIEAL DSVRRQTLDP LDLIVVDDAS TDDTPSLLTG WAARHGARFN RLLILRARRN AGLGGARNIG MAAAETPYVL QLDADNRLLP DACARLLAAI AAERAGYAYP LIRQFGREAS VMGDTPFHPG RLVGGNTIDA MALVAKWAWA AAGGYYVRRD AMGWEDYDLW CTLAELGIAG TQVPEILAEY RVHDTAMTDT LTERPHHKDA VVTLLRDRHP WIRLTAPETR ARS
|
| |