Gene TM1040_2067 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTM1040_2067 
Symbol 
ID4077994 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRuegeria sp. TM1040 
KingdomBacteria 
Replicon accessionNC_008044 
Strand
Start bp2170824 
End bp2172110 
Gene Length1287 bp 
Protein Length428 aa 
Translation table11 
GC content60% 
IMG OID638007386 
ProductTRAP dicarboxylate transporter- DctM subunit 
Protein accessionYP_614061 
Protein GI99081907 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1593] TRAP-type C4-dicarboxylate transport system, large permease component 
TIGRFAM ID[TIGR00786] TRAP transporter, DctM subunit 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones22 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATTATTC TCGGCAGTCT TTTTATCCTG TGTGTGCTGC TCCTCATTGG GGTGAGCGTG 
CCTCTGGCCT TTGGTGGCGT GCTGGTCTTT ATCGGCGTCT TTGGCGGCCA TGATGTGACC
GGGTTCCTGC CCACGGGGCA CTGGAAGATG AATTCCATCG TGCTGCTTGC GATTCCGCTG
TTCATTCTGG CAGGTGCTAT CATGGAGCGG GGGCGGATCG CGGCGCCGCT GGTGTCGGTG
GCGGAGCTTC TGGTGGGGCG CATCCACGGG GGGCTCAGCG CGGCGGCGGT GTTCGCCAGC
GGTATCTTCG GCTCGATCTC GGGCTCTGCG GCGGCGACGC TGACCTGTAT CGGGTCGATT
ATGATGCCGC ACCTGAAGGC CGCGAATTAC CCGCGCGGCC CGGCGGCGGC GCTGATTGTG
GCGGCCTGTC CCTTGGGGCT CCTGATCCCG CCGTCGTCGT CGCAGATCCT TTATGCGTGG
GTGGCGCAGC AATCGGTGCT GAAGTGTTTC CTTTCGACCG TGGTGCCGGG GCTTATCCTG
ATTACGCTTT TGTGCATGGT GAACTACGTA CTGATGCGCA AGGCGGACCT GAAACTGCTC
GAACGCCCGG CAAGCTACCC GCAGGAATTC GTGCGCCGCG GTGGGCGGGC CTTTCCGGCG
CTGTTGATGC CGATCATCAT TCTTGGCGGT ATCTACGGCG GCATCATGAC CCCGACCGAG
GCCGCAGGCG TGGCGGTGAT CTATGCCATT CCCGTTGGCC TGTTCATCTA TCGCGGCCTT
ACGCCTCAGA ATATCTGGCC GACCCTGCGC TATGCGGGCA CCACCATCGG TGTGGTGATG
CTGATGGTCT TTGTGGTAGT GATCGTCAGC CGCTTTCTGG TCTTTGAAGA CATCCCCGGG
ATGGCCAAGG ATCTGATCTT CTCGATCTCG GACAACCCGA TCGTGATCTT GCTGATGGTC
AATCTGGTGA TGATCCTCAT CGGTATGCTG ATGGATGATA TTTCAGGGCT GTTGCTGTCA
GCACCGCTCC TGTTGCCCAT CGTACAAAGC GTCGGAATGG ACCCGGTGCA TTTTGCCGCC
GTCCTTGGCG TCAACCTCGG CATGGCCAAC ATCACGCCGC CCACGGCACC GCTGTTGTAT
CTAGGTGCAA AGGTCACCGA CACACCCGTG AGCGAGATGC TGAAGCCCAC CTTCATCATG
ATCCTGTTTG CATGGCTGCC GACGCTGCTG ATCACCACAT TTGTGCCCGA GGTGGCGCTG
TGGCTGCCCA ATTTTGTCTT TGGCTAA
 
Protein sequence
MIILGSLFIL CVLLLIGVSV PLAFGGVLVF IGVFGGHDVT GFLPTGHWKM NSIVLLAIPL 
FILAGAIMER GRIAAPLVSV AELLVGRIHG GLSAAAVFAS GIFGSISGSA AATLTCIGSI
MMPHLKAANY PRGPAAALIV AACPLGLLIP PSSSQILYAW VAQQSVLKCF LSTVVPGLIL
ITLLCMVNYV LMRKADLKLL ERPASYPQEF VRRGGRAFPA LLMPIIILGG IYGGIMTPTE
AAGVAVIYAI PVGLFIYRGL TPQNIWPTLR YAGTTIGVVM LMVFVVVIVS RFLVFEDIPG
MAKDLIFSIS DNPIVILLMV NLVMILIGML MDDISGLLLS APLLLPIVQS VGMDPVHFAA
VLGVNLGMAN ITPPTAPLLY LGAKVTDTPV SEMLKPTFIM ILFAWLPTLL ITTFVPEVAL
WLPNFVFG