Gene TM1040_3352 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTM1040_3352 
Symbol 
ID4075251 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRuegeria sp. TM1040 
KingdomBacteria 
Replicon accessionNC_008043 
Strand
Start bp364070 
End bp365059 
Gene Length990 bp 
Protein Length329 aa 
Translation table11 
GC content60% 
IMG OID638004860 
Productmyo-inositol 2-dehydrogenase 
Protein accessionYP_611586 
Protein GI99078328 
COG category[R] General function prediction only 
COG ID[COG0673] Predicted dehydrogenases and related proteins 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value0.977173 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value0.304593 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCAAGGA TCGGACTTCT CGGCTGCGGC CGGATTGGTC AAGTTCACGC GCGCTCGATC 
AGCCAGATTG AAGGTGCCAC CGTGACGGCA GTTGCAGATG CCTTTGCAGA GCCCGCACAG
GCCTTGGCCG ACAGTACTGG CGCGCAAGTT CTGGACCCTT TGGCGCTGAT CGAAAGCACA
GAGGTGGATG CCGTCGTGAT CGGCACCCCA ACAGACACGC ATTATGATCT CATCCACGCG
GCAGCCCGCG CTGGCAAAGC AATCTTCTGT GAAAAACCAG TGGATCTGTC GTCTGATCGC
ATTCGCGATT GTATTGCTGC AGTGGAACGC GCAGGCGTCC CCTTTCTGAC AGCGTTCAAT
CGACGGTTTG ACCCGAACTT TGCAGACCTA CAAACGCGGC TCCGCCAGAA GCAGATCGGC
GAGGTCGAGA TCGTGACGAT CCAGTCGCGA GATCCCTCTC CGCCACCCGT CAACTACATC
CAGAGCTCGG GCGGGCTGTT TCGTGACATG ATGATCCACG ATCTCGATAT GGCGCGGTTC
TTGCTGGGCG AAGAAATGGT ACGGGTCTAC GCGGTTGGCT CGGCGCTGAT CGACCCCGAG
ATTGGCAAGG CTGGCGATGT CGACACAGCC GCCGTCACGC TCACCACCGC AAGCGGCAAG
ATCTGTCAGA TCACCAACTC GCGGCGGGCA AGCTATGGAT ATGACCAGAG GATCGAAGTC
CACGGCTCTG GCGGTATGCT GCGCGCGGAA AACGTGCATG AGACAACCGT GGAAATCGCA
ACACAGTCCG GGTTCACCAG AGCCCCGGTT CAGCACTTCT TTCTGGAGCG CTATAAGGCC
GCCTATCATG CGGAGATGTC TCATTTCGTC GCGGCAATCG AAACAGGCAG TGCGCCGACC
CCCAGCCTGT TTGATGGCTT GCAGGCCCAG CTTCTGGCGG ATGCCGCAAC GCGATCATGG
GTCGAGGGCG GACCGGTCGA CCTGACCTGA
 
Protein sequence
MARIGLLGCG RIGQVHARSI SQIEGATVTA VADAFAEPAQ ALADSTGAQV LDPLALIEST 
EVDAVVIGTP TDTHYDLIHA AARAGKAIFC EKPVDLSSDR IRDCIAAVER AGVPFLTAFN
RRFDPNFADL QTRLRQKQIG EVEIVTIQSR DPSPPPVNYI QSSGGLFRDM MIHDLDMARF
LLGEEMVRVY AVGSALIDPE IGKAGDVDTA AVTLTTASGK ICQITNSRRA SYGYDQRIEV
HGSGGMLRAE NVHETTVEIA TQSGFTRAPV QHFFLERYKA AYHAEMSHFV AAIETGSAPT
PSLFDGLQAQ LLADAATRSW VEGGPVDLT