Gene TM1040_1191 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTM1040_1191 
Symbol 
ID4077800 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRuegeria sp. TM1040 
KingdomBacteria 
Replicon accessionNC_008044 
Strand
Start bp1281161 
End bp1282156 
Gene Length996 bp 
Protein Length331 aa 
Translation table11 
GC content61% 
IMG OID638006497 
Productaldo/keto reductase 
Protein accessionYP_613186 
Protein GI99081032 
COG category[C] Energy production and conversion 
COG ID[COG0667] Predicted oxidoreductases (related to aryl-alcohol dehydrogenases) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.0223212 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones21 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCGCATG AAATGCTGAA ACGTGAGATC GGCCGCTCCG GGATCGAGGC TTCGGCCATC 
GGGCTTGGCA CCTGGGCCAT TGGCGGCTGG ATGTGGGGTG GCACGGATGA GGCGCGCTCG
ATCGCCGCTA TTCAGGCCTC GATTGAGGCC GGAGTGAGCC TCATCGACAC CGCGCCCGCC
TATGGTCAGG GCGTCGCCGA GGAGATCGTC GGCAAGGCCA TCAAGGACCG TCGCGACAAG
GTGGTGCTGG CCACGAAATG CGGGCTCGTC TGGCACACGC AAAAGGGCAA TCACTTCTTT
GATTACGACG GCGCGCCGGT GCATCGCTAT CTTGGCAAGG ATGCGATCAT CTATGAGGTC
GAACAAAGCC TCACGCGTCT CGGCACCGAT TACATCGATC ACTACATCAC CCATTGGCAG
GATCCCACGA CGCCGATTGC CGAAACGATG GAGGCGCTGG AGCAGCTGAA AACACAGGGC
AAAATCCGCT CGATTGGTGC CAGCAATACC ACGCCTTTGG ACGTGCGTGC CTATCTCGAG
GCGGGACAGC TTGATGCGGT TCAAGAAGAA TATTCGATGG TGAACCGCGC GGTTGAGGCC
GAGATGGCGC CGCTCTGTCA TGAAAACGGG GTGTCCATCC TCAGCTATTC CTCGCTCGCG
CTGGGGCTTC TGACCGGCAA GATCGGCCCC GACCGGGTGT TTGAAGGCGA CGATCAGCGC
AAGGACAACC CACGGTTTTC AATTGCCAAT CGCGAAAAAG TGGCCCGCTT GATGGAGGCC
ATCGCGCACA TTGCCGAAGT ACACGGTGCC ACCAAGGCCC AGGTGGTGAT CGCCTGGACG
CTGCAGCAGC CGGGGATAAC CTTCTCGCTC TGCGGGGCGC GCGATGCGAC ACAAGCAGTT
GAAAACGCCA AGGCGGGTCT GCTGCGTCTC AGCGCGGATG ACATTGCCCG GATAAGCGGT
GCCGCCAGCA CGCATCTCAG CGACCTCGAC GGCTGA
 
Protein sequence
MSHEMLKREI GRSGIEASAI GLGTWAIGGW MWGGTDEARS IAAIQASIEA GVSLIDTAPA 
YGQGVAEEIV GKAIKDRRDK VVLATKCGLV WHTQKGNHFF DYDGAPVHRY LGKDAIIYEV
EQSLTRLGTD YIDHYITHWQ DPTTPIAETM EALEQLKTQG KIRSIGASNT TPLDVRAYLE
AGQLDAVQEE YSMVNRAVEA EMAPLCHENG VSILSYSSLA LGLLTGKIGP DRVFEGDDQR
KDNPRFSIAN REKVARLMEA IAHIAEVHGA TKAQVVIAWT LQQPGITFSL CGARDATQAV
ENAKAGLLRL SADDIARISG AASTHLSDLD G