Gene TM1040_0491 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTM1040_0491 
Symbol 
ID4078237 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRuegeria sp. TM1040 
KingdomBacteria 
Replicon accessionNC_008044 
Strand
Start bp513650 
End bp514654 
Gene Length1005 bp 
Protein Length334 aa 
Translation table11 
GC content63% 
IMG OID638005787 
Productaldo/keto reductase 
Protein accessionYP_612486 
Protein GI99080332 
COG category[C] Energy production and conversion 
COG ID[COG0667] Predicted oxidoreductases (related to aryl-alcohol dehydrogenases) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.00588566 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value0.160086 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATCAGGC CAAACTCTGC GTTGCGCGCG CCCGGTCTGC GTGTCAGTCT CCGGCGCATG 
ACTGCTCAGA CTTGCCCCCC GCTCACCACC CTTGATGGCC ATGACGTTGG CCGGTTCGCC
TTTGGCTGCA TGCAATTTGG CGGACGCGCC GATGCGCAGG CCTCTGCCGA AATGTATGAG
GCCTGCCGCG CAGCAGGTCT GCGCCATTTT GATACCGCGT GGCTCTATAC CGAGGGCGCC
AGCGAAGAGA TCCTCGGTCA GTTGATCGCC AAGGACCGCG AGAGCCTCTA TGTCGCGACC
AAGGTTGGCT TCACCGGCGG CGCCAGCGCG GCGAATATGC GAGCGCAGTT CGATCAGTGC
CGACAGCGCC TGAAGCTCGA TCAGGTGGAT CTTCTGTATT TGCACCGGTT TGACCCTGAA
ACCCCGCTCG AGGACACGCT GACCTGTTTT GCCGAACTTA AACAGGAAGG GCACATCCGC
CATGTTGGCC TGTCAAACTT CGCCGCCTGG CAGGTCATGA AAGCCGTTGC TCTCGCGGCG
CGACTGGGCC TCCGGATTGA CGTGCTACAG CCGATGTACT CGCTGGTGAA ACGACAGGCC
GAGGTGGAAA TCCTGCCGAT GTGTGCCGAC CAGGGGATTT TGCCCGTACC CTATTCGCCG
CTGGGCGGCG GGCTCTTGAC CGGGAAATAT GCGCAAGGCG GCACAGGTCG GTTGAGCGAG
GATGAAAACT ATCGTGCCCG CTATGGCCAG GATTGGATGC ACCGGACAGC CTCCGATCTT
CTGCACTTGG CCGAGGATCT TGGCACCGAT CCCGCGACGC TGGCAGTCGC ATGGGCCGCA
GGCCACCCCG CGCGCCCGGC TCCGATCCTC TCGGCACGTT CCGCAACCCA GCTTGCGCCC
TCGCTCAAGG CCACGGAATT TGACATGTCT CCAGAACTTT ATGCGCGTAT CGAAGCCCTG
AGCCCCCGCC CGGCCCCCGC CACGGACCGG CTCGAAGAAG CATGA
 
Protein sequence
MIRPNSALRA PGLRVSLRRM TAQTCPPLTT LDGHDVGRFA FGCMQFGGRA DAQASAEMYE 
ACRAAGLRHF DTAWLYTEGA SEEILGQLIA KDRESLYVAT KVGFTGGASA ANMRAQFDQC
RQRLKLDQVD LLYLHRFDPE TPLEDTLTCF AELKQEGHIR HVGLSNFAAW QVMKAVALAA
RLGLRIDVLQ PMYSLVKRQA EVEILPMCAD QGILPVPYSP LGGGLLTGKY AQGGTGRLSE
DENYRARYGQ DWMHRTASDL LHLAEDLGTD PATLAVAWAA GHPARPAPIL SARSATQLAP
SLKATEFDMS PELYARIEAL SPRPAPATDR LEEA