Gene TM1040_3131 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTM1040_3131 
Symbol 
ID4075003 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRuegeria sp. TM1040 
KingdomBacteria 
Replicon accessionNC_008043 
Strand
Start bp105191 
End bp106237 
Gene Length1047 bp 
Protein Length348 aa 
Translation table11 
GC content58% 
IMG OID638004634 
ProductTRAP dicarboxylate transporter- DctP subunit 
Protein accessionYP_611367 
Protein GI99078109 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1638] TRAP-type C4-dicarboxylate transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value0.722243 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones19 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCACTC GCAGAGCACT ACTTGGTGCA GCCACCGCAC TGGCATTTTC CGCAATGGGC 
GCAGTGCCCG CCTTCGCGCA GGAGGTGACG CTGAAGCTGC ACCAGTTCCT GCCCGCACAG
GCCAATGTGC CAAAGCTCAT TCTGGATGTC TGGGCAGACA AGATCGAAGA CGCATCGGGC
GATCGTATCA AGATTGACCG CTACCCCTCG ATGCAGCTGG GCGGCAAGCC GCCAGAATTG
ATTGATCAGG TTCAGGACGG CGTTGCCGAT ATCGTCTGGA CCGTGGTGGG CTACACGCCG
GGTCGTTTCC CATCGACCGA AGTGTTTGAG CTGCCCTTTA TGATGACCAA TGCACGTGCC
GCAAGCCACG CCTATTGGGA CATGATGGAA GATCATTGGC TGGACACCGA ATTCAAGGAC
TTCAAGATCC TTGCAGGGTG GGTGCATGGT CCGGGCATCT TCCACACCTC TGATCCGGTC
GAAGTACCAA AGGATCTTGA GGGCATGAAA ATTCGCGGTG GTGGGCGCTC TGTAAACGCC
TTGCTGACCG AGCTGGGCGC AACACCTGTC GGCATGCCTG TGCCGTCCAT TCCCGAAGCG
CTCTCGAAGG GCGTGATTGA TGGGACCACC ATCCCATGGG AGGTGACCAC CGCCCTGAAA
GTGCCGGAAC TTGTTGAAAA CCATACCGAA TTCTCGGGCC GCGCGCTGTA CACGCTGACC
TTTGTTCTGG CGATGAACAA GGAAAAATAC GACAGCCTGC CTGATGACCT GAAGAAGGTG
ATCGACGACA ACTCCGGTGT CGAGATGTCT GTCTTTGCAG GCGGCACGAT GGCAGATTCG
GACATGCCCG CGCGTGAAAA CGCGCTGGAT CTCGGCAACA ATGTGATCAC GCTCGACGCG
GATCAGACGG CCGTGTGGCG CGAGCGCTCT CAGCCGATCT ACGACAAGTG GCTCGCCGAT
ATGTCGGAGC GCGGCATCGA CGGTCAGGCG CTTCTGGATG AGGCGACCAT GCTGATCGAC
AAATATACGC CGCAGTACGA AAACTGA
 
Protein sequence
MTTRRALLGA ATALAFSAMG AVPAFAQEVT LKLHQFLPAQ ANVPKLILDV WADKIEDASG 
DRIKIDRYPS MQLGGKPPEL IDQVQDGVAD IVWTVVGYTP GRFPSTEVFE LPFMMTNARA
ASHAYWDMME DHWLDTEFKD FKILAGWVHG PGIFHTSDPV EVPKDLEGMK IRGGGRSVNA
LLTELGATPV GMPVPSIPEA LSKGVIDGTT IPWEVTTALK VPELVENHTE FSGRALYTLT
FVLAMNKEKY DSLPDDLKKV IDDNSGVEMS VFAGGTMADS DMPARENALD LGNNVITLDA
DQTAVWRERS QPIYDKWLAD MSERGIDGQA LLDEATMLID KYTPQYEN