Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | TM1040_1271 |
Symbol | |
ID | 4077431 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Ruegeria sp. TM1040 |
Kingdom | Bacteria |
Replicon accession | NC_008044 |
Strand | - |
Start bp | 1369443 |
End bp | 1370549 |
Gene Length | 1107 bp |
Protein Length | 368 aa |
Translation table | 11 |
GC content | 62% |
IMG OID | 638006579 |
Product | glucose sorbosone dehydrogenase |
Protein accession | YP_613266 |
Protein GI | 99081112 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG2133] Glucose/sorbosone dehydrogenases |
TIGRFAM ID | |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 24 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 18 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCTACGCC CGCTCGCCGT GATCGCCAGT TTGATGTTTA GTGCCGCGAT GGCCGTGCCC TCGCTGGCGC AGCAAATGGA CTCAAGCCAA GGCACGCTCC GCGTTGAAAA AATGGCCGAT GGGTTTGATG TTCCTTGGGG TTTTACGTTT TTGCCCGGAC GCGCACTGCT GGTCACGGAG CGCTCCGGAC AGCTTTGGTA TCTGAACGGC GCGCGACGCC AACAGGTCGA TGGCGTCCCT GAGATCGCTG CGGACGGGCA GGGTGGCCTT CTGGACGTTG TTGCCGCGCG GGATTTCGTT CAGAGCCGCA CCGTATATCT CACGTTTGCC CGTCCGCAGG GGCGCGGCGC GGGGACCGCT GTGGCACGGG CTGAACTCTC TGAAGACGGC AGCCGATTCG ACTCGCTTGA GGTGATTTTT GAAGCCACGC CAGGCGCGCG GGGCGGGCGG CACTTTGGCT CGCGGCTGGT CGAAGCCCCC GACGGCAGTC TTTATGTCAG CCTCGGAGAA CGTGGTGACC GTCCCAGCGC ACAGGATCTG TCACGCGAGC AGGGTTCGAT TATTCGCATC CTTCCGGATG GCAGCATTCC CTCGGACAAT CCTTTTGTAA ATTCTGAGGA CGCGCGTCCG GCGATCTGGT CCTACGGCCA CCGCAACCCG CAGGGCATGG CGCTTGATGC GGCCGGCGAC ATCTGGGCCG TTGAACATGG CGCGCGCGGC GGCGATGAGA TCAACCGGAT CACACGGGGC GCCAACTATG GTTGGCCGGT CATTTCCTAC GGGCGCCACT ATTCGGGGCT GAAGATCGGC GAGGGCACCG AAAAGCCGGG GCTGCAACAG CCGGAGTGGT ATTGGGATCC CTCCATCGCG CCCTCGGGTA TGATGATCTA CTCGGGCAAG CTCTGGCCCA ACTGGCGCGG AGACATCTTT GTGGGATCCC TGAAATTTGA TTACATCTCA AGGCTCTCGG GGGCACCCCT GCAGGAGGTC GAGCAGATGA AATCGCCCGA AACCGCAAGG GTGCGCGATA TCCGCGAAGC CCCCGATGGC AGCATCTGGT TTGCCTCGGA ATACGAGGGC GCCCTCTTTC GGATCACCCC GAACTGA
|
Protein sequence | MLRPLAVIAS LMFSAAMAVP SLAQQMDSSQ GTLRVEKMAD GFDVPWGFTF LPGRALLVTE RSGQLWYLNG ARRQQVDGVP EIAADGQGGL LDVVAARDFV QSRTVYLTFA RPQGRGAGTA VARAELSEDG SRFDSLEVIF EATPGARGGR HFGSRLVEAP DGSLYVSLGE RGDRPSAQDL SREQGSIIRI LPDGSIPSDN PFVNSEDARP AIWSYGHRNP QGMALDAAGD IWAVEHGARG GDEINRITRG ANYGWPVISY GRHYSGLKIG EGTEKPGLQQ PEWYWDPSIA PSGMMIYSGK LWPNWRGDIF VGSLKFDYIS RLSGAPLQEV EQMKSPETAR VRDIREAPDG SIWFASEYEG ALFRITPN
|
| |