Gene TM1040_1813 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTM1040_1813 
Symbol 
ID4076959 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRuegeria sp. TM1040 
KingdomBacteria 
Replicon accessionNC_008044 
Strand
Start bp1905025 
End bp1906029 
Gene Length1005 bp 
Protein Length334 aa 
Translation table11 
GC content59% 
IMG OID638007128 
ProductTRAP dicarboxylate transporter, DctP subunit 
Protein accessionYP_613808 
Protein GI99081654 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1638] TRAP-type C4-dicarboxylate transport system, periplasmic component 
TIGRFAM ID[TIGR00787] tripartite ATP-independent periplasmic transporter solute receptor, DctP family 


Plasmid Coverage information

Num covering plasmid clones28 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value0.24254 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAGTTTT TGACCGCTGC CGCAACCGCA CTGGCTCTGA GCGTGACCGC AGGTGTGGCG 
CAGGCCGATG CCTGTGATGA TGGCGAAATC GTCGTCAAGT TCAGCCACGT TACCAACACC
GACAAGCACC CCAAGGGGAT CGCGGCGTCC TTGCTGGAAA AGCGTGTAAA CGAAGAGATG
AACGGCACCA TGTGCATGGT CGTCTATCCG AACTCCACGC TGTATGACGA CAACAAGGTT
CTCGAAGCGA TGCTGCAGGG CGACGTGCAG CTGGCGGCGC CTTCGCTGTC GAAATTCGAG
AAGTTCACCA AGCAGTTCCG CCTGTTTGAC CTGCCGTTCA TGTTCAAGAA CATCGACGCC
GTGGACGCAT TCCAGGCTTC TGAAAATGGT CAGGCCATGC TCGACAGCAT GCAGCGCCGC
GGCCTGCAGG GTCTTGGCTA CTGGCACAAC GGCATGAAGC AGATGTCTGC CAACAAGCCG
CTCGTGATGC CCGAAGACGC CAATGGCCTG AAGTTCCGCG TGCAGTCTTC GGACGTGCTG
GTGGCGCAGA TGGAAGCGAT CGGTGGCAGC CCGCAGAAAA TGGCCTTCTC CGAAGTCTAT
GGCGCGCTGC AGCAGGGCGT TGTGGATGGC CAGGAGAACA CCTGGTCCAA CATCTACGGC
AAGAAGTTCT TTGAAGTTCA GGACGGTATC ACAGAAACCA ACCACGGCGT GCTCGACTAT
CTGGTTGTGG CTTCGGTGGA CTGGCTCGAC AGCCTTGAGC CTGAGGTGCG TGACCAGTTC
ATGACCATCA TGACCGAAGT GACCGCAACC CGGAACGCCG AATCCACCCG CGTCAACAAC
GAAGCCAAAG AGGCCATCGT TGCGGCAGGT GGCGAAGTGC GCCAGCTTAC CGCTGAGCAG
CGTCAGGCTT GGGTCGACGT GATGAAGCCC GTCTGGGAGC AGTTCTCCGG TGACGTGGGT
CAGGACATGA TCGACGCTGC ACAGTCGATC AACGCCGGCT TCTAA
 
Protein sequence
MKFLTAAATA LALSVTAGVA QADACDDGEI VVKFSHVTNT DKHPKGIAAS LLEKRVNEEM 
NGTMCMVVYP NSTLYDDNKV LEAMLQGDVQ LAAPSLSKFE KFTKQFRLFD LPFMFKNIDA
VDAFQASENG QAMLDSMQRR GLQGLGYWHN GMKQMSANKP LVMPEDANGL KFRVQSSDVL
VAQMEAIGGS PQKMAFSEVY GALQQGVVDG QENTWSNIYG KKFFEVQDGI TETNHGVLDY
LVVASVDWLD SLEPEVRDQF MTIMTEVTAT RNAESTRVNN EAKEAIVAAG GEVRQLTAEQ
RQAWVDVMKP VWEQFSGDVG QDMIDAAQSI NAGF