Gene TM1040_3197 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTM1040_3197 
Symbol 
ID4075301 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRuegeria sp. TM1040 
KingdomBacteria 
Replicon accessionNC_008043 
Strand
Start bp191776 
End bp193065 
Gene Length1290 bp 
Protein Length429 aa 
Translation table11 
GC content57% 
IMG OID638004706 
ProductTRAP dicarboxylate transporter- DctM subunit 
Protein accessionYP_611433 
Protein GI99078175 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1593] TRAP-type C4-dicarboxylate transport system, large permease component 
TIGRFAM ID[TIGR00786] TRAP transporter, DctM subunit 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.165915 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATCATAA TCGGAAGTCT CGTCATCGTG TGCGCCATGC TCCTATTGGG AGTCAGTGTC 
TTTGTGGCCT TTGGCGCGGT GTTGATGTTC ATCGCGGTCT TTGGCGATCA GTCCATCACC
GGCTATCTTC CAACCGGGGC GACGGGTCTG CGGTCATTGG TGTTGCTCGC CATTCCGCTC
TTTATGATTG CAGGCGGCGT GATGGAGCGC GGGCGGATCG CAGCCCCCCT CGTGGCCATC
GCAGAGATGT TTGTCGGCCA CCTCAAGGGC GGACTGAGCG CGGCCGCCGT CATGGCCTGT
GGCATCTTCG GCTCCATCGC GGGCAGTGCC AACTCAACGC TGACCTGCAT TGGTGGCATC
ATGCTGCCGC ATCTGAAAAA GGCGAACTAC CCCGAAGGCA AATCCGCAGC GCTCCTTGTG
GCGGCAAGCC CACTGGGCCT TCTGATCCCG CCAAGTGCCA ACCACATCCT CTATGGCTGG
GTGGCGCAAC AGCACGTTTT GAAATGCTTT TTGTCGACGG TCATCCCGGC CTTCATCCTG
ATCACGCTGC TGATCATCAC CAACCATTTC ATGTTGCGCA AACATGACGA CATCAAGCTC
TCTGCCCCAC CCTCCCCGTT TTTCCCGGCG CTCAGGGCCC AGGGACGTGT CGCCGGCCCG
GCGCTGATGA TGCCGATCAT CATTCTTGGC GGTATCTACG GCGGGATCAT GACGCCGACC
GAGGCTGCGG GCATCGCCGT TGTCTATGCG ATCCCGGTTG CGATCTACTT CTATCGCGGC
TTGACGTGGA AGGGTTTGAT CGAAACGTTT CAGAAGTCGG GGATCACGAT TGGCGTCGTT
ATGATCATGA TCTTCATGGT CCTCATAGTC TCTGACAACC TCATTGCACA GGGCGCACCG
CAGATTGCGC AGCAGATGGT CTACTCGGTC TCTGACAATC CCATTGTGAT CTTGCTGATG
ATCAATGTGG TCATGATCCT GATCGGGATG CTGATGGACG ATATTTCGGG ACTGCTGCTG
TCGACACCAA TCCTGTTGCC AATCGCGCAA AGCGTGGGCA TGGATCCGAT CCATTTTGCG
GCCGTGATCG GCGTTAATCT GGGTATGGCC AATATCACGC CGCCGACCGC ACCGCTGTTG
TATCTGGGCG CTCAGGTCTC GGAGACGCCG GTTGCAAAGA TGCTCATACC CACGCTGATG
TTCATCATCT TTGCTTGGTT GCCGACCCTG ATGCTCACGA CCTTTGTGCC CTCGGTTGCA
CTTTGGCTGC CGGAACTCCT ACTGGGCTAA
 
Protein sequence
MIIIGSLVIV CAMLLLGVSV FVAFGAVLMF IAVFGDQSIT GYLPTGATGL RSLVLLAIPL 
FMIAGGVMER GRIAAPLVAI AEMFVGHLKG GLSAAAVMAC GIFGSIAGSA NSTLTCIGGI
MLPHLKKANY PEGKSAALLV AASPLGLLIP PSANHILYGW VAQQHVLKCF LSTVIPAFIL
ITLLIITNHF MLRKHDDIKL SAPPSPFFPA LRAQGRVAGP ALMMPIIILG GIYGGIMTPT
EAAGIAVVYA IPVAIYFYRG LTWKGLIETF QKSGITIGVV MIMIFMVLIV SDNLIAQGAP
QIAQQMVYSV SDNPIVILLM INVVMILIGM LMDDISGLLL STPILLPIAQ SVGMDPIHFA
AVIGVNLGMA NITPPTAPLL YLGAQVSETP VAKMLIPTLM FIIFAWLPTL MLTTFVPSVA
LWLPELLLG