Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Gdia_1100 |
Symbol | |
ID | 6974504 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Gluconacetobacter diazotrophicus PAl 5 |
Kingdom | Bacteria |
Replicon accession | NC_011365 |
Strand | - |
Start bp | 1233844 |
End bp | 1236618 |
Gene Length | 2775 bp |
Protein Length | 924 aa |
Translation table | 11 |
GC content | 71% |
IMG OID | 643390629 |
Product | glycosyl transferase group 1 |
Protein accession | YP_002275498 |
Protein GI | 209543269 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG0438] Glycosyltransferase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 48 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAACCGGA TCGCGCCCCG GCCCCGATCG GCGCCGGACG ACAGGGACCT GGACCGCGCC GGCGCACGGT TGCAGGCCGC GCGCGCCACC GGCCTGCAGG CGGAATGCGA CCAGTTGCGC CGGGCGCTGG CGCTGCAGGA TCGCGCGCTG TCGGGTCGGG TGCTGCGGAT CGTCCGTGCC ATCCGGGCGG CGATGCGGGG GCGCGACCCG TTCGGGCGTC CGCTGGGCGT CGCCGCCGCC GCGCTGTGGC GAAAGACGCG GCGGGACGGC GTCGTGTCCG CCGCCCGCCT TGTGGCGCGC GTGTTGTCGC CGCCTCCGGA CGGCGGGGCG GATGGACCGG TGTCCAGCCA CGATGCGTAT GCCATGGCTG CCGGGCCGGC GGACTGGACG CCGCAGATCC TGATCATCGC GGAACTCAGC CTGGCCCAAT GTGCGAAATA TCGTGTGTGG CAAAGGGTCG AACAGGTGCG GCACCTGGGA TGGACCTGCC GCGTGGTGGA CTGGCGCGAC ACCGGCGAGG CGCTGACCGC GCTGCAATTC TGCACGCGCG TCGTCTTCTA CCGGGTTCCG GCCTTTGCCT CGGTCAGGAT GCTGCTGGCC GAAACACGAC GTCTGTCGGT GCCGTCATGG TGGGAAGTCG ATGACCTGAT CTTCGATCGG ACGCTGTATT TTCAGAACAA CAACCTGGCG GCGTTGCCCG AGGCCGAACG TGCCGGGCTT CTGTCCGGGG TCAGGCTGTT CCGGACCTGC ATGCTGTCCT GCGATCGCGG CATCGCCTCG ACCCCGGTGC TGGCGCGGGC GATGCGCGAG GCCGGCCTGC CGGCGGCGTC GGTCATCGAA AATGCGCTGG ACGAGGAAAC GCTGGCTGCC GCGGCTCTGG CCCGCGCCCG GGCAGCGGCG GGGCACGGGG CGGCTGACGG GACGGTGGTC ATCGTCTACG GGTCCGGTAC CCGCACCCAT GACGCCGATT TTCGCGTGGC GGTGCCGGGC CTGGTCGCGG CGATGGCGGC GGACGCGCGG CTGCGTCTGT GGATCGTGGG CGAACTGCAG GTGCCCCGCG CCCTGCAGGC GCTGGGCACA CGGGTGGTCG TCCTGCCCCT CCGTCCCTAT GCGGAGTACC TGGCGCTGAT GGCGCGCGCG GATATCGTCA TCGCCCCGCT GGAAGACAGC GTCTTCAATG ATGCCAAGAG CAACATCAAA TATCTGGAAG CCGCCAGTCT GGGCCTGCCG TCGGTCTGTT CGCCCCGCCG GGCGTTCGCG GACGTGATCG TGGATGGAAC GACCGGCTAT CTGGCCGCCA CGGATGCCGA CTGGACGCGG GCCCTGCTGC TGCTGTCGGG CGATGCGACG CTGCGCCGTC AGGTCGGGCG GCGGGCCCTG GCCGACATCC TGGACCGCTA TGCCCTGGCC CATATGGCCG AACGGCAGGT GGCGGCGGTG TTCGGCCGTC CGGCCGTGCC GGCCCTGAAT CCGGCTATGA AACCAGCCAT GAAACCGGTC GGGGGCGCGG TCACGGGCAG GCGGCTGCGG GTGCTGTGCG TGAATGTCTA TTACCCGCCG CGCGCCTTCG GCGGAGCCAC GCATGTGGCG GTGGAAATGG CCCAGCGGCT GCAGGCCGGC GGCCAGGCCG ACATCGCGGT GCTGACCACG CGCCCGGCGG AACCGGGGCG CCCGGCCTCG GCCCTGCGCT ACCGGCACCG TGGGGTGCCG GTGGTCGCGC TGGACGTGCC GGCGGAGCAT GACGGACTCG CGATGTTCCA CAACCCCGCG GCGGCTGCGA TCTTTGCCGA TTATGTCGCG GCTTTTCGCC CGGACGTGGT GCATGTCCAT GCTCCGCAGG GGCTGGGGGT CGGGCTGCTG GATGTCTGCC GGCACCAGGG GATTCCCTAT GTGCTGACAT TGCATGATGC GTGGTGGCTG TGCGACCGGC AGTTCATGGT GCGCGAGGAC GGCCAGTTCT GCGGGCAGGA ACGGATCGAT CCGCGCACCT GCCAGCGGTG CCGCCCGCAG GCCCGCTACC TGGCCGACCG GGCGGTGCTG GCGGGGGCCG GCCTGCGCGA TGCGGCGCTG CTGCTCAGCC CCAGTGCCGC GCATCGCCGG CTGCACATCG CCAACGGCGT CGATCCGGCG CGGATCGTGG TGCATCGCAA CGGATTTCGC TGGCCGAAGC GCCCGCGCAC GCCTGTGGCT CCCGGCGGCC GTGCCCTGCG GTTCGGCTAT GTCGGGGGCA GCGACGCGGT CAAGGGGTAT CCGGTGATCC GCGCGGCGTT CGAGGGGCTG GCGCGTGCCG ACTGGGTCCT GCGCCTGGTG GACAACAAGA CGGCGCTTGG CCTGCGATCC ATCGAAGTCG GCGACTGGCG GGTACAGGGC AAGCTGGAGG TTCTTCCGGC CTATGACGGC GAGACGGTCG ATGCGTTCTT CGATTCCATC GACGTTCTGC TGTTTCCCTC GCGCTGGCCG GAAAGTTACG GCCTGACGGT GCGCGAGGCC CTGGCCCGCG ACGTCTGGGT CGTCGCATCC GCGCCCGGCG GCCAGGCGGA GGATATCGTA CCCGGGGTGA ACGGCACATT GATCGGCCTG TCGGCGCCGG CATCCGATCT GGCGGCGGCG GTGACGGACC TGCTGGACCG CCCGGATCGC CTGGCCGGTT ATGTCAATCC GTGCAAGGAC CGGCTGGCGA CATGGGACGG GCAGGCGCGC GAACTGCTCG ATCTGCTGCG CGCGGCATCG GGATGGGTGC AGGCGGGCGA CGATCCCGCC ACCGCGTGTG GCTGA
|
Protein sequence | MNRIAPRPRS APDDRDLDRA GARLQAARAT GLQAECDQLR RALALQDRAL SGRVLRIVRA IRAAMRGRDP FGRPLGVAAA ALWRKTRRDG VVSAARLVAR VLSPPPDGGA DGPVSSHDAY AMAAGPADWT PQILIIAELS LAQCAKYRVW QRVEQVRHLG WTCRVVDWRD TGEALTALQF CTRVVFYRVP AFASVRMLLA ETRRLSVPSW WEVDDLIFDR TLYFQNNNLA ALPEAERAGL LSGVRLFRTC MLSCDRGIAS TPVLARAMRE AGLPAASVIE NALDEETLAA AALARARAAA GHGAADGTVV IVYGSGTRTH DADFRVAVPG LVAAMAADAR LRLWIVGELQ VPRALQALGT RVVVLPLRPY AEYLALMARA DIVIAPLEDS VFNDAKSNIK YLEAASLGLP SVCSPRRAFA DVIVDGTTGY LAATDADWTR ALLLLSGDAT LRRQVGRRAL ADILDRYALA HMAERQVAAV FGRPAVPALN PAMKPAMKPV GGAVTGRRLR VLCVNVYYPP RAFGGATHVA VEMAQRLQAG GQADIAVLTT RPAEPGRPAS ALRYRHRGVP VVALDVPAEH DGLAMFHNPA AAAIFADYVA AFRPDVVHVH APQGLGVGLL DVCRHQGIPY VLTLHDAWWL CDRQFMVRED GQFCGQERID PRTCQRCRPQ ARYLADRAVL AGAGLRDAAL LLSPSAAHRR LHIANGVDPA RIVVHRNGFR WPKRPRTPVA PGGRALRFGY VGGSDAVKGY PVIRAAFEGL ARADWVLRLV DNKTALGLRS IEVGDWRVQG KLEVLPAYDG ETVDAFFDSI DVLLFPSRWP ESYGLTVREA LARDVWVVAS APGGQAEDIV PGVNGTLIGL SAPASDLAAA VTDLLDRPDR LAGYVNPCKD RLATWDGQAR ELLDLLRAAS GWVQAGDDPA TACG
|
| |