Gene TM1040_1811 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTM1040_1811 
Symbol 
ID4076957 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRuegeria sp. TM1040 
KingdomBacteria 
Replicon accessionNC_008044 
Strand
Start bp1902842 
End bp1904218 
Gene Length1377 bp 
Protein Length458 aa 
Translation table11 
GC content58% 
IMG OID638007126 
ProductTRAP dicarboxylate transporter- DctM subunit 
Protein accessionYP_613806 
Protein GI99081652 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1593] TRAP-type C4-dicarboxylate transport system, large permease component 
TIGRFAM ID[TIGR00786] TRAP transporter, DctM subunit 


Plasmid Coverage information

Num covering plasmid clones31 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value0.0867737 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGAAGTGG TTCTTCTTTT TACACTGGTC ATTGGCCTTT TGCTGATCGG GGTGCCGATT 
GCGGTTTCTC TGGGTCTGTC GTCCACCATT TTCCTGCTGA TCTATTCCGA CAGCTCGCTC
GGCTCGGTTG CAGGGACGTT GTTTCAGGCC TTTGAAGGGC ATTTCACCCT CTTGGCGATC
CCGTTCTTCA TCCTTGCGTC GAGCTTTATG ACCACGGGTG GTGTTGCACG CCGGATCATC
CGTTTCTCGA TTGCCTGTGT GGGCCACCTG CCGGGTGGTC TGGCGATTGC GGGGGTCTTT
GCCTGTATGC TCTTTGCTGC GCTGTCCGGC TCGTCGCCAG CAACCGTGGT TGCGATCGGG
ACAATCGTGA TCGCGGGCAT GCGCCAGGTG GGCTATTCCA AGGAGTTCGC CGCAGGTGTG
ATCTGTAACG CGGGCACGCT CGGTATCCTG ATCCCGCCGT CCATCGTGAT GGTGGTCTAC
GCCGCAGCGG TTGAGGTCTC GGTCGGGCGG ATGTTCCTTG CTGGTGTCAT TCCGGGCCTG
ATGGCGGGCA TCATGCTGAT GGTCACGATT TATGTGATGG CCAAGGTCAA GAACCTGCCC
AAAGGCGAGT GGAACGGCTG GGGAGAGGTC TTTGCCTCTG CGCGTGAAGC TGGCTGGGGT
CTGTTCCTGA TCGTGATCAT CCTGGGCGGC ATTTATGGCG GGATCTTCAC CCCGACCGAA
GCGGCAGCGG TTGCGGCTGT CTATGCCTTC TTCATCGCGT GTTTTGTCTA TAAGGACATG
GGGCCGCTTT CCAATGGAGA GGGGCAGCCC AAGGACTCGC TTCTGAAGAA ACCCTATGCG
CTGATCACCG CGTTTTTCCA CAGCGACACC AAACACACGC TGTTTGAGGC TGGCAAACTC
ACCGTGACGC TGTTGTTTGT GATCGCCAAC GCGCTGATCC TGAAGCATGT TCTGACCGAC
GAGCAGGTGC CGCAGCATAT TGCAAACGCG ATGCTCTCGG CAGGTTTTGG CCCAGTGATG
TTCCTGATCG TGGTGAACGT GATCCTGCTG ATTGGCGGTC AGTTCATGGA GCCCTCGGGC
CTCCTGGTGA TCGTGGCGCC CCTGGTGTTC CCGATTGCAA TCGAGCTGGG GATCGACCCC
ATTCACCTCG GGATCATCAT GGTTGTGAAC ATGGAGATCG GGATGATCAC ACCGCCGGTG
GGGCTCAACC TCTTTGTGAC CTCCGGGGTT GCGGGGATGC CGATGATGGC GGTCGTGAAG
GCTGCGCTGC CTTTCCTTGC GGTGCTCTTT GTGTTCCTCA TCATGGTCAC CTACATCCCG
GCGATCTCGA CCTTCCTGCC CAATATGATC ATGGGGCCTG AGATCATAAC CAACTGA
 
Protein sequence
MEVVLLFTLV IGLLLIGVPI AVSLGLSSTI FLLIYSDSSL GSVAGTLFQA FEGHFTLLAI 
PFFILASSFM TTGGVARRII RFSIACVGHL PGGLAIAGVF ACMLFAALSG SSPATVVAIG
TIVIAGMRQV GYSKEFAAGV ICNAGTLGIL IPPSIVMVVY AAAVEVSVGR MFLAGVIPGL
MAGIMLMVTI YVMAKVKNLP KGEWNGWGEV FASAREAGWG LFLIVIILGG IYGGIFTPTE
AAAVAAVYAF FIACFVYKDM GPLSNGEGQP KDSLLKKPYA LITAFFHSDT KHTLFEAGKL
TVTLLFVIAN ALILKHVLTD EQVPQHIANA MLSAGFGPVM FLIVVNVILL IGGQFMEPSG
LLVIVAPLVF PIAIELGIDP IHLGIIMVVN MEIGMITPPV GLNLFVTSGV AGMPMMAVVK
AALPFLAVLF VFLIMVTYIP AISTFLPNMI MGPEIITN