Gene TM1040_2069 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTM1040_2069 
Symbol 
ID4077996 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRuegeria sp. TM1040 
KingdomBacteria 
Replicon accessionNC_008044 
Strand
Start bp2172766 
End bp2173767 
Gene Length1002 bp 
Protein Length333 aa 
Translation table11 
GC content59% 
IMG OID638007388 
ProductTRAP dicarboxylate transporter- DctP subunit 
Protein accessionYP_614063 
Protein GI99081909 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1638] TRAP-type C4-dicarboxylate transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value0.828242 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones22 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGTAAGT CACTTTTGAC CACCGCAGCC ACGGCGGCTG CTTTCACCCT GTCGCTCGGC 
GCTGCAGACG CGGCAGACAT GCGGCTCAAA CTCGCCGGGG TCGTCCCGGT CGAACACTTC
GGCAATGACA TTCTGAAGCA GATCGAAGCG GATATCGAAG GCGCTGATGT GGGCCTCTCG
GTGACCTTGT TTGAGGCGGG GCAGCTGGGC TCTGGCGAAG AGCTGTTCGA GGACGCCGCG
CGCGGCAACG TCGATCTGGT GCATTCCGTG ATCTACGCGC ATCGTGACCC GGTGCTGGAG
ATCAACTCCT TGCCTTATCT GGTGTCGAGC TTTGATGAGA TGGAAGACAT CTATCTCAAC
AAGGACAGCG CCTTTAACGA GATTTTTGCC GAGCGTCTGG AGGGGCTGGG GCTGAAACTT
CTGGCCAATG CACCCGAAGG TTTCATCGGC GTTGTGGCCG AGAACCTTCC TGAAAACGCC
ACCTCGGTCG GCGACAAGGA CGTCAATATT CGCGTCTGGT CGAGCCAGGT GATCAAAAAC
ACCGTCGAGG CCATGGGCTT TAACGCCACC ACGATGAACT GGGGTGAGGT TTTCCCCGCG
ATCCAGTCCG GCGTCGTGGA CGGGGCCATC TGCTGCACCG CGCAGCTGGC CTATAGCGCC
TTTGCCACCT CGGATGTGGG CAAGTATTTC ATCCCCTATG GCGCAGTGGT CGAGAACACG
ACCTATTACG CCTCCATGGA AACATGGGAA GAGATGAACG ACGAACAGCG CGCCGCCGTA
CAGGCCGCCT TCGACAAGGC CGCACAGACC TATTTTGCCG AGGCCAAGGC GAATGAGGCG
GGCTATATCG ATAAGCTTAA AGAGACCGGT TACGAGGTCG TTGAAGTTTC TGACGCCGAA
CGCAGCGCGA TTGCTGAAAC CGTGCGTAAG GACGTCTGGC CCGGCATTGC CGAGATCGTT
GGTCAGGACG TCATCGACCG CCTGATGACC GCCAAGAACT GA
 
Protein sequence
MRKSLLTTAA TAAAFTLSLG AADAADMRLK LAGVVPVEHF GNDILKQIEA DIEGADVGLS 
VTLFEAGQLG SGEELFEDAA RGNVDLVHSV IYAHRDPVLE INSLPYLVSS FDEMEDIYLN
KDSAFNEIFA ERLEGLGLKL LANAPEGFIG VVAENLPENA TSVGDKDVNI RVWSSQVIKN
TVEAMGFNAT TMNWGEVFPA IQSGVVDGAI CCTAQLAYSA FATSDVGKYF IPYGAVVENT
TYYASMETWE EMNDEQRAAV QAAFDKAAQT YFAEAKANEA GYIDKLKETG YEVVEVSDAE
RSAIAETVRK DVWPGIAEIV GQDVIDRLMT AKN