Gene TM1040_3639 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTM1040_3639 
Symbol 
ID4075067 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRuegeria sp. TM1040 
KingdomBacteria 
Replicon accessionNC_008043 
Strand
Start bp696015 
End bp697565 
Gene Length1551 bp 
Protein Length516 aa 
Translation table11 
GC content59% 
IMG OID638005159 
ProductTRAP C4-dicarboxylate transport system permease DctM subunit 
Protein accessionYP_611868 
Protein GI99078610 
COG category[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG4664] TRAP-type mannitol/chloroaromatic compound transport system, large permease component 
TIGRFAM ID[TIGR00786] TRAP transporter, DctM subunit 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value0.633734 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value0.269363 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACTTCCT CCCTCGACCT GATCATGTTT GCCGCCTTGA TGGCGGCCAT CCTCATGGGA 
TTTCCGGTCG CCTTCTCGAT TGCGGGCATC GCTGTGTTTT TTGCCTATCT GGGCTGGATG
CTGGGGGTGA TGGATATTTC GCTCCTTGGC GCCTTTGGCC AGCGGGTGTT TGGACTTCTG
AGCAATGAGG TGCTGATCGC CATTCCGCTG TTTGTGCTGA TGGGTGCAAT CCTCGAAAAG
AGCCGTATCG CAGAAGAGCT TCTGGACACG ATGGGGCGTC TTTTTGGGCA GCTCAAAGGC
GGTCTCGGAA TTTCCGTTGT ATTGGTTGGA GCGCTTCTGG CGGCGTCCAC CGGCATTGTG
GGCGCAACCG TTGTCGCAAT GGGCATGATC GCGCTGCCGG CGATGCTGCG GGCTGGTTAT
GACCCAAGAG TCGCCTCGGG CATCGTCTGC ACGGCGGGCA CGCTGGGACA GATCATTCCA
CCCTCGACGC TGCTGATCAT CCTCGCCGAT GTTATGTCCA ACGCCTATCA ACAGGTCCAA
TACGAACAGG GAAAGTTCGC TGTCGAAGCC TTGTCAGTGG GCCAATTCTT TGCCGCGGCC
CTGATCCCAG GGCTTGTACT GGTGGTACTT TATCTTCTGT ATATCCTGAT CCGGGGCCTT
CTGCGCCCCG AGGACATGCC TTCGGCTCCG GCTGGCATTG CGCGACCTCA TCGCACCGAA
GTGTTGCGGG CGGTGGTGCC CCCGATCCTT TTGATCTTTG CCGTGCTGGG CGCGATCCTT
GGTGGCGTTG CCACCCCGAC CGAGGCCGCC TCGGTCGGAG CCATCGGCGC GTTGTTGATG
GCCGGTCTCA GGACCGGAGG CCCGCTTCGC AGCATCTTGC TTGGCGCGGG CGCGCTGATT
GCTCTCGGCA TCTTGTCGGG CCTGCACCCC GTCCGCCTGC AGCGCAACGA TCTGAGCAGC
ATGGATCTGG TGATAGGCTT GTTTTATGCG CTGCTCGCAG CTGCCGGCGC CATTGCGGTG
CTCCTGTCTC TGCGCGCGGG TCTGAAGAAG CGCATCCTGC ATGATGCGGT CACCTCTACG
GCCACCATGA CCTCGATGAT CTTTGCCACC ATGCTCGCAG CGAGCATGTT CTCTCTTGTT
TTCATTGGCC TTGGAGGAGA GGACCGCGTT GCGCATATCC TGAGCGAGTT ACCCGGCGGT
CCCTCTGGGG CGCTCTTGTT CTCCATGTTG TTTATATTTG TGCTGGGATT CTTTCTCGAC
TTTGTCGAGA TCTCCGTAAT CGTCTTGCCA CTGGTGACGC CAACCTTGAT CCTGATGGGA
CATGATCCGG TCTGGCTCGG GATCTTGATC GCGATCAACC TTCAGACGTC CTTCCTGACA
CCACCCTTCG GCTTTTCGCT GTTTTACCTG CGAGGAGCCG CCCCAAAGGA GATCACAACG
CGCCATATCT ATCAGGGCGT CATCCCCTTC ATAGGTTTGC AGGTGCTTGG AGTGATCCTG
GTCTGGTTCA TCCCCGGTCT GGCCACATGG CTTCCCGAGG CCATCTTCTG A
 
Protein sequence
MTSSLDLIMF AALMAAILMG FPVAFSIAGI AVFFAYLGWM LGVMDISLLG AFGQRVFGLL 
SNEVLIAIPL FVLMGAILEK SRIAEELLDT MGRLFGQLKG GLGISVVLVG ALLAASTGIV
GATVVAMGMI ALPAMLRAGY DPRVASGIVC TAGTLGQIIP PSTLLIILAD VMSNAYQQVQ
YEQGKFAVEA LSVGQFFAAA LIPGLVLVVL YLLYILIRGL LRPEDMPSAP AGIARPHRTE
VLRAVVPPIL LIFAVLGAIL GGVATPTEAA SVGAIGALLM AGLRTGGPLR SILLGAGALI
ALGILSGLHP VRLQRNDLSS MDLVIGLFYA LLAAAGAIAV LLSLRAGLKK RILHDAVTST
ATMTSMIFAT MLAASMFSLV FIGLGGEDRV AHILSELPGG PSGALLFSML FIFVLGFFLD
FVEISVIVLP LVTPTLILMG HDPVWLGILI AINLQTSFLT PPFGFSLFYL RGAAPKEITT
RHIYQGVIPF IGLQVLGVIL VWFIPGLATW LPEAIF