Gene TM1040_3686 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTM1040_3686 
Symbol 
ID4075655 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRuegeria sp. TM1040 
KingdomBacteria 
Replicon accessionNC_008043 
Strand
Start bp744740 
End bp746095 
Gene Length1356 bp 
Protein Length451 aa 
Translation table11 
GC content67% 
IMG OID638005206 
ProductL-carnitine dehydratase/bile acid-inducible protein F 
Protein accessionYP_611915 
Protein GI99078657 
COG category[C] Energy production and conversion 
COG ID[COG1804] Predicted acyl-CoA transferases/carnitine dehydratase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value0.355974 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value0.187253 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGTATGT TTGACGCCCT GCCCCCTCTG GACGCCTATC ACGTCGTAGG CGAGGGTCAG 
CGTCCAAGCG TCTATGCGGT CTCGGAGCTT GCCAACGACT GCCTCGGCGC GGTTGGACTG
GAAATGGCAA AGCTGATCGA GGTACTGGGG TTGGCCCCCG GCGCGCCCGA CGTCACGGTG
GACCAGCGCC TTGCCTCGCT CTGGTTTGGC TATTCCTTTC GCCCCGTGGG TTGGGAGATG
CCTTCGCTTT GGGATCCGAT CGCGGGGGAT TATCCCTGCG CAGATGGCTG GATCCGCCTG
CACACCAACC TGCCACATCA CCGCGCGGCG GCCCTGTCGG TGCTCGGCTG TAATGCGGAT
CGCGAAGGCG TCGCCAAGGC GGTGCTGACC TGGCAGGGCG ACGCGCTGGA GGCAGCGGTT
GTGGGCGCAG GTGGTGTCGC GGCCGCGATG CGCAGCCGTG AAGAATGGCT GGCGCATCCA
CAGGGCGCGG CGGTCTGCCA AGAGCCTCTG GTGGATTGGA TCAAGCCGCG CCGCGTGGTG
CTGCGCGCGC GCCCGGAGGC CAGCGCAGCG CGACCTCTGA TGGGGGTGCG GGTGCTCGAT
CTGACGCGCG TGTTGGCCGG ACCGGTCAGC ACCCGCACGC TGGCCGGGTT CGGGGCCGAG
GTGCTGCGGA TCGACCCGCC CGATTGGGAC GAGCCGGGCG TGTTGCAGGA CATCTCGCTG
GGCAAACGCA TGGCAAGGCT CAATCTGCGC ACAGAGGCCG GCCGCGCCCA CCTGCGCGCA
CTTCTGGCCG AGGCAGATGT GTTGGTGCAT GGCTTCAGAC CAGGCGCGCT CGACAATCTG
GGGCTGGATT TGGCTACACG CGACGCGATT GCGCCCAACC GGATCGAGGT CACACTCAAC
GCCTATGGCT GGACTGGCCC CTGGGCAAAT CGGCGCGGGT TTGACAGCCT TGTTCAGATG
AGCGCCGGGA TCGCTGATGC GGGGCGGGAC TGGGCGGGCG CACAAAAGCC GACCCCCTTG
CCGGTGCAGG CCCTCGATCA CGCAACCGGC TACCTGATGG CCGCGGCGGT TTTGTCTGCG
CTCTCGGCGG CAGCACGGCA AGAGCCGGTC GGGGTTGCAC GCTTGTCGCT TGCCCGCACG
GCAGAAGCGC TGGTCGCGAT CCCGAAACGG CTATCAGGGC CGGAGATTTC AACTGCCGAG
CCCTGCGACT TTGCCACGTG TGAGGAGGCG AGCGGCTGGG GGGCTGGGTT GCGTCTGAGC
CCGGCGGTGA AGATCAACGG CTGCGAGATG GGTTGGGATA TGCCGGCACA GCCAAGCGGC
ACGCATCCTC CGCAATGGAA CGAGCCGCAA ACCTGA
 
Protein sequence
MSMFDALPPL DAYHVVGEGQ RPSVYAVSEL ANDCLGAVGL EMAKLIEVLG LAPGAPDVTV 
DQRLASLWFG YSFRPVGWEM PSLWDPIAGD YPCADGWIRL HTNLPHHRAA ALSVLGCNAD
REGVAKAVLT WQGDALEAAV VGAGGVAAAM RSREEWLAHP QGAAVCQEPL VDWIKPRRVV
LRARPEASAA RPLMGVRVLD LTRVLAGPVS TRTLAGFGAE VLRIDPPDWD EPGVLQDISL
GKRMARLNLR TEAGRAHLRA LLAEADVLVH GFRPGALDNL GLDLATRDAI APNRIEVTLN
AYGWTGPWAN RRGFDSLVQM SAGIADAGRD WAGAQKPTPL PVQALDHATG YLMAAAVLSA
LSAAARQEPV GVARLSLART AEALVAIPKR LSGPEISTAE PCDFATCEEA SGWGAGLRLS
PAVKINGCEM GWDMPAQPSG THPPQWNEPQ T