Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Gdia_2640 |
Symbol | |
ID | 6976070 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Gluconacetobacter diazotrophicus PAl 5 |
Kingdom | Bacteria |
Replicon accession | NC_011365 |
Strand | + |
Start bp | 2909493 |
End bp | 2910848 |
Gene Length | 1356 bp |
Protein Length | 451 aa |
Translation table | 11 |
GC content | 70% |
IMG OID | 643392155 |
Product | putative L-sorbosone dehydrogenase |
Protein accession | YP_002276996 |
Protein GI | 209544767 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG2133] Glucose/sorbosone dehydrogenases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 21 |
Plasmid unclonability p-value | 0.532913 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 64 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGGCGTT CGCGCATCTG GGTACTGGGG GGCGGTGCCG CCCTCGTTCT GCTGCTGCTG GCCGGCTGCT ACCGCATGGC GGCGCGGCCG GAACAGGCGA CGCTGACCCT TGCGGCCGGA ACGGGCGCGC ATCCGCTGCT GCCGCCGCCC AACCCGACGC TGCTGCCCAC GGTGAATATC GCCACCCCGG TGGGCTGGGC GGCGGGGGCC ACGCCCCGCG TGGTGCCCGG GCTGGCGGTG GCGGCGTTCG CCACCGGCCT GGACCATCCG CGCTGGCTGT ACCGCCTGCC CAACGGCGAT ATCCTGGTCG CGGAATCGAA TTCGCCCATG ACCGATATCA CGACGCTGAA GAACCGGATC GCCCGCTTCG TCATGGGGGC GGTCGGGGCG GGGGATAAAA GCCCGGACCG GATCATCCTG CTGCGCGACT CGACGGGGCG CGGGGTGGCC GACCAGCGGA CGGTGTTCCT GGATCATCTG CGCTCGCCGT TCGGCATGGC GCTGGTGGGC GATACGCTGT ACGTCGCCAA TGCCGATTCG CTGGTGCGCT TTCCCTACCA AACGGGCCAG ACCCGGATCG ATGCGCCGGG GACCAGGGTG ATCGACCTGC CAGCGGGCTA TAACCATCAC TGGACCAAAA ACATCCTGGC CAGCCCGGAC GGCAGCGCGC TGTACATCAC CGTGGGGTCG AACAGCAATG TGGCCGAGCA CGGCATGGCG GTCGAGGAAG GACGGGCGCG GATCGACCGG TTCGACATCG CCACCGGCAC GCTGCGCCCC TTCGCCACCG GCCTGCGCAA CCCCAACGGC CTGGCCTGGA ACCCGCAGAC CGGCGCGTTG TGGACCGCCG TCAACGAACG CGACGAGATC GGCAGCGACC TGGTGCCCGA CTACATCACC GCCGTGCAGG AGGGCGCGTT CTATGGCTGG CCCTACAGCT ATTACGGGCA GCATGTGGAC GTGCGCGTGA CGCCGCAGCG GCCCGACCTG GTGGCGCAGG CCATTGCACC CGACTATGCG CTGGGGCCGC ATACAGCGTC GCTGGGGATC GCGTTTTCGC AGGCCAGCAC GCTGCCCGAA GCCTGGCGGC ACGGGCTGTT CGTGGCGCAG CACGGATCGT GGAACCGCTG GCCCAAGAGC GGCTATCGCG TGATCTACGT CCCGTTCGTC GACGGTGCCC CGGTGGAAGA GCCGCGCGAG GTGCTGACCG GCTTCCTGAC CGCCGACGAA AAGCACGTCC ACGGCCGCCC CGTTGGCCTG GCGCTGGACG GCACGGGCGC GCTGCTGGTG GCCGACGACG TGGGCAACAC CGTCTGGCGC GTGACCGGCG CCCCGGGCAT GCCGCAGGGA CAATAA
|
Protein sequence | MRRSRIWVLG GGAALVLLLL AGCYRMAARP EQATLTLAAG TGAHPLLPPP NPTLLPTVNI ATPVGWAAGA TPRVVPGLAV AAFATGLDHP RWLYRLPNGD ILVAESNSPM TDITTLKNRI ARFVMGAVGA GDKSPDRIIL LRDSTGRGVA DQRTVFLDHL RSPFGMALVG DTLYVANADS LVRFPYQTGQ TRIDAPGTRV IDLPAGYNHH WTKNILASPD GSALYITVGS NSNVAEHGMA VEEGRARIDR FDIATGTLRP FATGLRNPNG LAWNPQTGAL WTAVNERDEI GSDLVPDYIT AVQEGAFYGW PYSYYGQHVD VRVTPQRPDL VAQAIAPDYA LGPHTASLGI AFSQASTLPE AWRHGLFVAQ HGSWNRWPKS GYRVIYVPFV DGAPVEEPRE VLTGFLTADE KHVHGRPVGL ALDGTGALLV ADDVGNTVWR VTGAPGMPQG Q
|
| |