Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Dshi_1872 |
Symbol | |
ID | 5712864 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Dinoroseobacter shibae DFL 12 |
Kingdom | Bacteria |
Replicon accession | NC_009952 |
Strand | - |
Start bp | 1951375 |
End bp | 1952643 |
Gene Length | 1269 bp |
Protein Length | 422 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | 641267796 |
Product | putative soluble aldose sugar dehydrogenase |
Protein accession | YP_001533215 |
Protein GI | 159044421 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG2133] Glucose/sorbosone dehydrogenases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 23 |
Plasmid unclonability p-value | 0.413591 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 33 |
Fosmid unclonability p-value | 0.509919 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACACGAC TGTCTGCCCT GTCGCTCGGC GCGGCACTGG CCTCTGCAAC CGCGCTTGCA GCTCCGGCCT TCGCGGCCAA CACCACGTCC CTGAGCCACG AGATTGTGCT CGAAGGGCTC GAGAATCCGT GGGACGTCGC CTTTCTCGAA GACGGCACGA TGTTCTTCAC CGAGAAATGC CTCGGCCTGT CCGTGCGCCT GCCCGATGGC TCGGTCAACA AGCTCTTGGG CATGAAGGGC ACCGACGACG ACTATGCCTC CACCGCCGAG GACCTGTTTT GCGAAGGCCA GGCCGGGATG CAGGGCGTCG CGGTTGACCC GGACTTCGCC GAGAATCGGC AGATCTATGT CTATTCGACC TCCGACCTGA CCGCGCCCGG CTCCAACCGC CTGTTGCGGA TGACCGTGGG CGAGGATCTC GCCAGCGTGG CGGACCGCAC CGACATCGTC GAGGACGTGC CCTACAAGCC CGCCGCGACC GACCACCCCT TCGGTGGCCC CGGCGCCCAT AACGGCGGCC GCGTGCGCTT CGGCCCGGAC GGGTTCATCT ACCTGACCAC CGGCGACACC CATAACGGCG AAGGCCCGCA AAGCCCGACC CTGCTGGCGG GCAAGGTGCT GCGCATCGAC CGCGACGGCA ATGCCGCCGA GGGCAACGCG CCCCCCGAAG GCTTCGATCC GCGGATCTAC ACCTACGGGC ACCGCAACAC CCAGGGCATC ACCTTCCACC CGGAAACCGG TGCTGCCATC ACCGCCGAGC ACGGCCCCTG GCACTCCGAC GAGATCACCG TTCTGCAGAA CGGCGGCAAT GCCGGCTGGG ATCCGCGCCC GAACGTGGGC GGCCGAGGCG AATGCCCGGA TGGCTACTGC GGCTACTCCC CGAACCAGAT GGAGGGCATG GACCGCTACG AGCGCGCGGC CTTCATGCCG ATGACCGATC TGGCAACCTA TCCGGACGCG ATGCAGCCGA TCTGGGACAA TAACGGCTGG TCCCAGGGCA CGTCCTCTGC CGAGTTCCTG ACCGGGGACC AGTGGGGCGA CTGGGAAAAC CACCTGGTCG TCGGCATCAT GGGCATTGGC TTCGGCGGCA CGCCCATCGG CCAGCGCATC GACGTGATCG AGCTCAACGA GGCGGGCACC GAGGTGGTCG ACGTCACCGA GATGACCCTG CCGATGGAGC CCGGTCGCTT CCGCTCCCTC GTCCAGGGCC CCGATGGCGC GCTCTACGCG GTGGTGGATC AGGGGATGAT TCACAAGATG ATGCCGTAA
|
Protein sequence | MTRLSALSLG AALASATALA APAFAANTTS LSHEIVLEGL ENPWDVAFLE DGTMFFTEKC LGLSVRLPDG SVNKLLGMKG TDDDYASTAE DLFCEGQAGM QGVAVDPDFA ENRQIYVYST SDLTAPGSNR LLRMTVGEDL ASVADRTDIV EDVPYKPAAT DHPFGGPGAH NGGRVRFGPD GFIYLTTGDT HNGEGPQSPT LLAGKVLRID RDGNAAEGNA PPEGFDPRIY TYGHRNTQGI TFHPETGAAI TAEHGPWHSD EITVLQNGGN AGWDPRPNVG GRGECPDGYC GYSPNQMEGM DRYERAAFMP MTDLATYPDA MQPIWDNNGW SQGTSSAEFL TGDQWGDWEN HLVVGIMGIG FGGTPIGQRI DVIELNEAGT EVVDVTEMTL PMEPGRFRSL VQGPDGALYA VVDQGMIHKM MP
|
| |