Gene TM1040_3144 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTM1040_3144 
Symbol 
ID4075016 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRuegeria sp. TM1040 
KingdomBacteria 
Replicon accessionNC_008043 
Strand
Start bp123368 
End bp124948 
Gene Length1581 bp 
Protein Length526 aa 
Translation table11 
GC content60% 
IMG OID638004647 
Productextracellular solute-binding protein 
Protein accessionYP_611380 
Protein GI99078122 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0747] ABC-type dipeptide transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.000116165 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clones19 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAGACCA CGACTGTATT GACACTGACG TCGCTGCTGC TGGCGTCTGC GGCGCCGCTG 
AGCGCGGAAA CCCTGCGCTG GGCGCGTTCT GGCGATGCGC TGACGCTGGA TCCGCATGCC
CAGAACGAAG GGCCCACGCA TACCGTCCGT CACCAGATGT ATGAGCCGCT CATCATCCGC
GACGTGACCG GCGCGTTTGA GCCCGCATTG GCGACCGAAT GGGCACCCAA GGAAGGCGAT
CCCAACGTTT GGGTGTTCAA GCTGCGTGAG GGCGTGAAGT TCCACGGCGG CGAGGATTTC
ACCGCTGAGG ATGTGGTGTT CTCTTTTGAA CGCGCCAAGC AGGCCAACTC CGACATGAAG
GAGCTTATTG GCTCCATCAC CGAGGTGCGC GCTGTTGATG ATCTGACCGT CGAGATCGTC
ACCGATGGTC CGAACCCGAT CCTGCCGTCG AACCTCACCA ACCTGTTCAT CATGGACAAG
GGCTGGACCG AGGCCAACAA CACCGTGAAC GTGCAGGATT TTGAGGGCGG CGAAATCACC
TATGCCACCA CCAATGCCAA CGGCACCGGT CCCTATGTGC TGCAAAGCCG CGAGCCGGAC
GTCAAAACCG TGATGACGCT CAACGAGAAC TACTGGGGCA AGGACCAGTT CCCGCTCGAA
GTGACCGAGA TCGTCTACAC GCCGATCCAG AATCCCGCGA CCCGCGTGGC AGCGCTCTTG
TCGGGTGAGA TCGACTTCCT TCAGGACATG CCGGTGCAGG ATCTTGACCG CGTCAGCGGT
GCAGATGGTC TGATGGTGCG CAAGGCGCCG CAGAACCGCG TGATCTTCTT TGGCATGAAC
ATGGGTGCCG ATGACATCGA AGCCGACAAC GTTGATGGCA AGAACCCGCT CGCTGATGTG
CGCGTGCGCA AGGCGATGTC GATGGCGATC AACCGCGATG CAATCCAGAA GGTCGTTATG
CGCGGCCAGT CGCAGCCGGC AGGCATGATC GCGCCGCCGT TTGTCAACGG CTGGACCGAA
GAGATGGACT CGGAATCCAA GACAGACATC GAAGGCGCCA AGGCGCTGAT GGCCGAAGCG
GGCTACGCGG ATGGCTTCTC GATCCGTCTG GACTGTCCCA ACGACCGTTA CGTCAACGAC
GAGCCGATCT GTCAGGCCGC CGTGGGCATG CTGGGTCAGA TCGGGATTAC CGTGAACCTC
GACGCCAAAC CCAAGGCGCA GCACTTCCCG CTGATCACCG ATGGCAAGAC CGACTTCTAC
ATGCTGGGCT GGGGCGTGCC GACATACGAC TCCGAGTATA TCTTCAACTT CCTCGTGCAT
GGTCGTGAGA GCGACATCGG CACCTGGAAC GGCACCGGCT TTGACAATGA CGAGCTGGAC
GCGAAGATCA AATCTCTGGC GTCGAACACC GATCTTGAAG CGCGCAACCA GGACATCGCA
GATATCTGGC GTGTGGTTCA GGACGAGCAG CTCTATATCC CGATCCACCA TCAGGTGCTG
AACTGGGGCA TGTCCGAGAA GGTCGACATC GCTGTCGATC CCGAGGATCA GCCGAAGGTC
AAATACTTCA AGATGAACTG A
 
Protein sequence
MKTTTVLTLT SLLLASAAPL SAETLRWARS GDALTLDPHA QNEGPTHTVR HQMYEPLIIR 
DVTGAFEPAL ATEWAPKEGD PNVWVFKLRE GVKFHGGEDF TAEDVVFSFE RAKQANSDMK
ELIGSITEVR AVDDLTVEIV TDGPNPILPS NLTNLFIMDK GWTEANNTVN VQDFEGGEIT
YATTNANGTG PYVLQSREPD VKTVMTLNEN YWGKDQFPLE VTEIVYTPIQ NPATRVAALL
SGEIDFLQDM PVQDLDRVSG ADGLMVRKAP QNRVIFFGMN MGADDIEADN VDGKNPLADV
RVRKAMSMAI NRDAIQKVVM RGQSQPAGMI APPFVNGWTE EMDSESKTDI EGAKALMAEA
GYADGFSIRL DCPNDRYVND EPICQAAVGM LGQIGITVNL DAKPKAQHFP LITDGKTDFY
MLGWGVPTYD SEYIFNFLVH GRESDIGTWN GTGFDNDELD AKIKSLASNT DLEARNQDIA
DIWRVVQDEQ LYIPIHHQVL NWGMSEKVDI AVDPEDQPKV KYFKMN