Gene TM1040_3220 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTM1040_3220 
Symbol 
ID4075362 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRuegeria sp. TM1040 
KingdomBacteria 
Replicon accessionNC_008043 
Strand
Start bp216964 
End bp217980 
Gene Length1017 bp 
Protein Length338 aa 
Translation table11 
GC content58% 
IMG OID638004729 
ProductTRAP dicarboxylate transporter- DctP subunit 
Protein accessionYP_611456 
Protein GI99078198 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1638] TRAP-type C4-dicarboxylate transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.00317601 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones21 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAAAAAT TCTTGGGACT TACCGCTGTT GCATCCGTGA GCTTTGCCTT TGCGGCAGAA 
GCCATGGCCA CAGAGTGGAA TGTCTCGGTT TGGGGCAAAC GCCGCGCCTT TACCGAGCAC
GTCGAAAAGC TCGCTGAACT GGTGTCCGAG AAAACCGACG GCGAATTCAC CATGAACATC
AGCTATGGTG GACTGTCCAA AAACCGCGAG AACCTCGACG GGATCTCGAT TGGCGCGTTT
GAGATGGCGC AGTTCTGCGC CGGCTATCAC CGCGACAAAA ACCGCGTGAT TACCGTTCTT
GAATTGCCCT TCCTGGGCAT TTCCAACCTC GAAGAGGAGG TTGCGGTCTC TAGCGCGGTC
TACAACCACC CGGCCGCAGC CGAGGAAATG GCGCAGTGGA ACGCAAAGCT GCTCATGACC
TCGCCGATGC CGCAATACAA TATCGTCGGC ACCGGTGATG TGCGTGATGA TCTGGCGGAA
TTTGAAGGCA TGCGCGTGCG GGCAACCGGC GGTATCGGCG AAGCCTTCAA GGCTGTTGGC
GCCGTTCCGA CCTCCGTCAC CGCGACCGAG GCCTATCAGG CGATGGAATC CGGTGTGGTC
GACACCGTAG CGTTCGCACA ACATGCGCAT CTGAGCTTTG GCACCATCAA CCGCGCTGAC
TGGTGGACCG CTAACCTCAA CCCCGGCACC GTGAACTGCC CGGTTGTGGT CAATATTGAC
GCTTACGAAA GCCTCTCTGA CGCCGAGCGC GAAGCGCTGG ACAGCTCGGT TGCCGAAGCG
CTGGATCACT ACCTGGCGAA CTACGGCGAG CTGCTGAAGA AGTGGGATAG TGTTCTCGAG
GAAAAAGGCG TCGAAAAGGT CGAGATTTCG GAAGAGGTGC TCGCAGAATT CCGCTCTACT
GCGGCTGAGC CGATCCGCGA CGCTTGGATC AAGGATATGG AAGCACAGGG CCTGCCGGGT
CAGGAGCTCT ATGATCTAGT TCAGAAAACG CTCGCAGATC ACCGCAACGG CAGCTGA
 
Protein sequence
MKKFLGLTAV ASVSFAFAAE AMATEWNVSV WGKRRAFTEH VEKLAELVSE KTDGEFTMNI 
SYGGLSKNRE NLDGISIGAF EMAQFCAGYH RDKNRVITVL ELPFLGISNL EEEVAVSSAV
YNHPAAAEEM AQWNAKLLMT SPMPQYNIVG TGDVRDDLAE FEGMRVRATG GIGEAFKAVG
AVPTSVTATE AYQAMESGVV DTVAFAQHAH LSFGTINRAD WWTANLNPGT VNCPVVVNID
AYESLSDAER EALDSSVAEA LDHYLANYGE LLKKWDSVLE EKGVEKVEIS EEVLAEFRST
AAEPIRDAWI KDMEAQGLPG QELYDLVQKT LADHRNGS