Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | TM1040_3352 |
Symbol | |
ID | 4075251 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Ruegeria sp. TM1040 |
Kingdom | Bacteria |
Replicon accession | NC_008043 |
Strand | + |
Start bp | 364070 |
End bp | 365059 |
Gene Length | 990 bp |
Protein Length | 329 aa |
Translation table | 11 |
GC content | 60% |
IMG OID | 638004860 |
Product | myo-inositol 2-dehydrogenase |
Protein accession | YP_611586 |
Protein GI | 99078328 |
COG category | [R] General function prediction only |
COG ID | [COG0673] Predicted dehydrogenases and related proteins |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 21 |
Plasmid unclonability p-value | 0.977173 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 14 |
Fosmid unclonability p-value | 0.304593 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCAAGGA TCGGACTTCT CGGCTGCGGC CGGATTGGTC AAGTTCACGC GCGCTCGATC AGCCAGATTG AAGGTGCCAC CGTGACGGCA GTTGCAGATG CCTTTGCAGA GCCCGCACAG GCCTTGGCCG ACAGTACTGG CGCGCAAGTT CTGGACCCTT TGGCGCTGAT CGAAAGCACA GAGGTGGATG CCGTCGTGAT CGGCACCCCA ACAGACACGC ATTATGATCT CATCCACGCG GCAGCCCGCG CTGGCAAAGC AATCTTCTGT GAAAAACCAG TGGATCTGTC GTCTGATCGC ATTCGCGATT GTATTGCTGC AGTGGAACGC GCAGGCGTCC CCTTTCTGAC AGCGTTCAAT CGACGGTTTG ACCCGAACTT TGCAGACCTA CAAACGCGGC TCCGCCAGAA GCAGATCGGC GAGGTCGAGA TCGTGACGAT CCAGTCGCGA GATCCCTCTC CGCCACCCGT CAACTACATC CAGAGCTCGG GCGGGCTGTT TCGTGACATG ATGATCCACG ATCTCGATAT GGCGCGGTTC TTGCTGGGCG AAGAAATGGT ACGGGTCTAC GCGGTTGGCT CGGCGCTGAT CGACCCCGAG ATTGGCAAGG CTGGCGATGT CGACACAGCC GCCGTCACGC TCACCACCGC AAGCGGCAAG ATCTGTCAGA TCACCAACTC GCGGCGGGCA AGCTATGGAT ATGACCAGAG GATCGAAGTC CACGGCTCTG GCGGTATGCT GCGCGCGGAA AACGTGCATG AGACAACCGT GGAAATCGCA ACACAGTCCG GGTTCACCAG AGCCCCGGTT CAGCACTTCT TTCTGGAGCG CTATAAGGCC GCCTATCATG CGGAGATGTC TCATTTCGTC GCGGCAATCG AAACAGGCAG TGCGCCGACC CCCAGCCTGT TTGATGGCTT GCAGGCCCAG CTTCTGGCGG ATGCCGCAAC GCGATCATGG GTCGAGGGCG GACCGGTCGA CCTGACCTGA
|
Protein sequence | MARIGLLGCG RIGQVHARSI SQIEGATVTA VADAFAEPAQ ALADSTGAQV LDPLALIEST EVDAVVIGTP TDTHYDLIHA AARAGKAIFC EKPVDLSSDR IRDCIAAVER AGVPFLTAFN RRFDPNFADL QTRLRQKQIG EVEIVTIQSR DPSPPPVNYI QSSGGLFRDM MIHDLDMARF LLGEEMVRVY AVGSALIDPE IGKAGDVDTA AVTLTTASGK ICQITNSRRA SYGYDQRIEV HGSGGMLRAE NVHETTVEIA TQSGFTRAPV QHFFLERYKA AYHAEMSHFV AAIETGSAPT PSLFDGLQAQ LLADAATRSW VEGGPVDLT
|
| |