Gene TM1040_3195 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTM1040_3195 
Symbol 
ID4075299 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRuegeria sp. TM1040 
KingdomBacteria 
Replicon accessionNC_008043 
Strand
Start bp190067 
End bp191065 
Gene Length999 bp 
Protein Length332 aa 
Translation table11 
GC content57% 
IMG OID638004704 
ProductTRAP dicarboxylate transporter- DctP subunit 
Protein accessionYP_611431 
Protein GI99078173 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1638] TRAP-type C4-dicarboxylate transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value0.166744 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value0.659935 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAACGTGT TTTTGAAAGC CGTGGCCGTC GGCACGATGG CCCTCCACCT TGCCACGCCG 
GCCTTCGCCG ACGACATCAA GCTGCGCTTT GCAGGCGTGT TCCCCATCGA CCATCAAGGC
ACAAAGATGA TGGAACAGGT CGCTGCAGAG GTGAACGCAG CGAATGTTGG TCTCGACATG
ACGGTTTTTC CCGCCAGCCA GCTCGGCTCC GGTGAAGCCC TGTTCGAAGA CGTTGCGCGC
GGCAACATCG ATTTTGCATC GGCTTTCATT TACTCCGATA CGGATCCCCG TCTGGAATTC
CTGAACATGC CGTTCCTTGT CAGCAGCTAT GATGACATGG ACCGCGTCCT GCGCGACATG
GATTCAGATT ACAATCGCAT CCTGCAGGAC ATTACCGCCG AATATGGTGT GCGCGTGATG
GCCGCGAACC CCGAGGGCTT TGTCGGCATC GTGGCCTCCA AGGAGCCCGA CAACTGGAAC
ACCTTCGACG ACAAAGGCAT GAACATCCGC GTCTGGTCGT CAAACGCTGT AAAGGCCACC
GTCGAGTCCC TCGGCTATCG TGCGACCACA ATGGCATGGG GTGACATCTT CCCGGCGCTT
CAGTCCGGCA TCGTCGACGG CGCGATCTGC TGCACAAAAA CCGCGACATA CTCGATCTTT
GCTAAATCCG ACGTCGGCAG CCACTTCATC GAGTATAACT CTTTGCTGGA ACAGACATTC
TACTATGGCT CCGAGCGCAC CCTCGCCAAG CTGAACGACG AGCAGCGCGA CGTCATTCAA
GCTGCGATGA GCAAAGCCTC GGCCGACTTC TTCGCCTACA ACCGCGAAAA CGACGCAGCC
TTCGGTCAAA AGCTGATCGA CAGCGGCTAC ACCATTTTGA AGCTCAACGA CGAGGATCAA
CAGGCGATGG CCGAGTATGT GCGCAAAACC ATCTGGCCGA CAATGGAAAG CGCAGTCGGC
AAAGACGTCA TCGATCGCGT GCTGGCGGCT GTTCAATAA
 
Protein sequence
MNVFLKAVAV GTMALHLATP AFADDIKLRF AGVFPIDHQG TKMMEQVAAE VNAANVGLDM 
TVFPASQLGS GEALFEDVAR GNIDFASAFI YSDTDPRLEF LNMPFLVSSY DDMDRVLRDM
DSDYNRILQD ITAEYGVRVM AANPEGFVGI VASKEPDNWN TFDDKGMNIR VWSSNAVKAT
VESLGYRATT MAWGDIFPAL QSGIVDGAIC CTKTATYSIF AKSDVGSHFI EYNSLLEQTF
YYGSERTLAK LNDEQRDVIQ AAMSKASADF FAYNRENDAA FGQKLIDSGY TILKLNDEDQ
QAMAEYVRKT IWPTMESAVG KDVIDRVLAA VQ