Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Gdia_0737 |
Symbol | |
ID | 6974134 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Gluconacetobacter diazotrophicus PAl 5 |
Kingdom | Bacteria |
Replicon accession | NC_011365 |
Strand | + |
Start bp | 838675 |
End bp | 840789 |
Gene Length | 2115 bp |
Protein Length | 704 aa |
Translation table | 11 |
GC content | 66% |
IMG OID | 643390266 |
Product | lipopolysaccharide biosynthesis protein |
Protein accession | YP_002275142 |
Protein GI | 209542913 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG3206] Uncharacterized protein involved in exopolysaccharide biosynthesis |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 0.875761 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 49 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAACCAGC TTCAATTATC TCCCTTGGCC GGCCTTTCGG GCATGCCGAC GGACAGTCAG CATGTTCCCC CCACGCAGGT ATTCCGCATC CTTGGGCGCC ATCGCCTGGC GGTCGTGCTG GTGACGGTGG GGATGTTCGC TCCGGCGGTG GCGTTCATCG AGACGATGAA ACCCTATTAC ACCGCGACGG CGATGCTGAT GGTGGGCACG CGGCAGGCGT CCTTCCGCGA CCTGCAGGCC ACGGTATCGA CCCCCGACAT CGATGCCGTC GGCATCAATA CGCAGGTCGG CGTCCTGCGC AGTTCGACCA TTGCCCGGGC GGTGACCGAA CGCCTGAACC TGGTCGACGA TCCGGAATTC CGCAAGGTCC TCGATACCGT GCCGCTCAAG GCCCGGATCG TCCTTGCCGT GCAGAAGCTG TTCGGCATCG CGCCGCCGCC CGCGCCCCCC ATGACGCCGG CGGCGCGGCT GCAGGCGACG GCGCTCGTGC TGGCCGACAA GGTCAATATC CTCAATGACG GCCGGTCCTA TATCATCACG ATCACGGCGA AGACCGACAA TCCGGCATCC AGCGCCAGGA TCGCGAATGC CTATGCGGAC GCCTATTTCG AATCCCGGCG TCACATGAAG GTCGCTGCGA CATGGCGGGC CAATGCCCTG CTGGATGAAC AGATCGTCCC GTTGCGCGAA AGGCTGCGTC ATGCGGAACA GGCCGTCGAG GCATTTCGCG AACAGAATGG CCTGATCTCG GCCCGCCTGG AACATATGCC CGGCACGCCG GAGGCCGATG CGACCACGGT CGCCGACGAG CAGTTGATGC GCATGAACCA GGAACTGGTC ACCGCCCAGG CCGCGTTGGA GGAAAAGCGC GCACGCTATT CGGAAGTCCG GACCGCGGAA CGCAACGGCA CGGTGGGGAA CCTGCCGGAC GTGGTCTCGG CCCCGCTGAT CCAGCAATTG CAGAACCAGC AGGCCCAGTT GAGCAGCCGC GTGTCCAGCC TCAGCCGCTC CGTCCTGGAC GGCAATCCCG AGATGCAGGC CGCCCAGGCC GCCGCCGCCC AGGTGCGGCG CCAGATCGGC GCCGAGACGG CGCGGATCGC CGACAGCATC GCCAAGGATG TCAGCGGGGC GCAGGCGCGC GTCGCGGCGC TGAGCCACGC CGTGGACAGC CTGCAGAAGC AGGTGACCGC CGAAAACCAG GCCAATGTCA CCCTGCGCCA GCTGGAAAGC GAGGCCAACG CCGCACGCGT CGTCTACCAG GATTATCTGG GCCGCTTTGC CCAGACCTCG ACCCAGGCGC AGTTGCAGGA ACCGGAAGCG GAACTGATTT CGCGCGCCGA GATCCCGCTG GGCTTTTCGG GGCCGCCGCG GACCCAGTAC CTGGCCATCG CCCTGCTGTT CTCGATGCTG TGCGGGACCG GCGCGGCCCT GCTGGCGGAT CGCATTCGCA AGGGAATCCG CAGCACGTCG CAACTGGATT CGGTCCCCGG CCTGTTCACG CTGGGCATGG TCCCGGTCTT CAACGGCGCT CTGGTGCGGC ATTACCGGTC CGCTGCCGCA GGCGTCTCCT CGGCCTATGT CGAGACGATC GAGAACATCC GCAGCATCCT GTGCTTCGGT CACAGCCGCT TCCGCGCCAA GGTGGTCCTG GTCACCTCCG CCCGCCCCGG GGAAGGCAAG ACGACCTTTG CCGTGTCCCT GGCCGCCAAT GCGGGCCGCG ACCTGCAACG CGCGCTGGTG ATCGACTGCG ATTCCCGCAA CCCCTCGGCA TTGGGCGCGC TGGGCAAGAC GGACCAGGCC AGCGACAATT CCCTGCCCGG TGAGCGTGGC CACGGGCAGG CTGGCGCGCG ACGTGATGCC GGGGGTCGAT ATCCTGACCA TGCGGCCGCC GGGCGAACGC AATTACGCCA TGGTGTCGCC GATGGAACTG AGCCGCGTCC TGACGCAGTT TTCACCCCAT TACGACATGA TCGTCCTGGA CACCCCCCCG ATCCTGGCCT TCCCCGATGC CGCCGTCCTG GCCCAGCAGA CCGACGGCAT CGTCATGGTC GTCAAATGGG GGTTGACCGG GTCCATGGAA CTGACCGAGG CGATGCGGAT CCTGCATGCC TATGA
|
Protein sequence | MNQLQLSPLA GLSGMPTDSQ HVPPTQVFRI LGRHRLAVVL VTVGMFAPAV AFIETMKPYY TATAMLMVGT RQASFRDLQA TVSTPDIDAV GINTQVGVLR SSTIARAVTE RLNLVDDPEF RKVLDTVPLK ARIVLAVQKL FGIAPPPAPP MTPAARLQAT ALVLADKVNI LNDGRSYIIT ITAKTDNPAS SARIANAYAD AYFESRRHMK VAATWRANAL LDEQIVPLRE RLRHAEQAVE AFREQNGLIS ARLEHMPGTP EADATTVADE QLMRMNQELV TAQAALEEKR ARYSEVRTAE RNGTVGNLPD VVSAPLIQQL QNQQAQLSSR VSSLSRSVLD GNPEMQAAQA AAAQVRRQIG AETARIADSI AKDVSGAQAR VAALSHAVDS LQKQVTAENQ ANVTLRQLES EANAARVVYQ DYLGRFAQTS TQAQLQEPEA ELISRAEIPL GFSGPPRTQY LAIALLFSML CGTGAALLAD RIRKGIRSTS QLDSVPGLFT LGMVPVFNGA LVRHYRSAAA GVSSAYVETI ENIRSILCFG HSRFRAKVVL VTSARPGEGK TTFAVSLAAN AGRDLQRALV IDCDSRNPSA LGALGKTDQA SDNSLPGERG HGQAGARRDA GGRYPDHAAA GRTQLRHGVA DGTEPRPDAV FTPLRHDRPG HPPDPGLPRC RRPGPADRRH RHGRQMGVDR VHGTDRGDAD PACL
|
| |