Gene TM1040_3140 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTM1040_3140 
Symbol 
ID4075012 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRuegeria sp. TM1040 
KingdomBacteria 
Replicon accessionNC_008043 
Strand
Start bp118725 
End bp119699 
Gene Length975 bp 
Protein Length324 aa 
Translation table11 
GC content63% 
IMG OID638004643 
Productoligopeptide/dipeptide ABC transporter, ATP-binding protein-like 
Protein accessionYP_611376 
Protein GI99078118 
COG category[E] Amino acid transport and metabolism
[P] Inorganic ion transport and metabolism 
COG ID[COG0444] ABC-type dipeptide/oligopeptide/nickel transport system, ATPase component 
TIGRFAM ID[TIGR01727] oligopeptide/dipeptide ABC transporter, ATP-binding protein, C-terminal domain 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value0.141396 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones21 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCTGACC CTGTTCTGTC CATCCGCAAC CTTTGTGTGG AAATCCCGAC CCGCCACGGC 
ATCCTGAAAC CGGTCGATGG CGTCTCTTAT GACATTGCCA AGGGCGAAAT CCTTGGCATC
GTTGGCGAAA GTGGCGCCGG CAAGTCGATG GCGGGCAATG CCGTCATTGG TCTCCTGAAC
CCGCCGGCGC ATGTGTCTTC CGGTGAGATC TGGCTCAACG GCAAACGTAT CGACACCCTG
AAAGGGGACG CGCTGCGCCG CCTGCGGGGC AAGGAAATCG GTATGGTCTT TCAGGACCCA
CTGACCTCCA TCAATCCGCT GTTGCGGATC GGGGATCAAC TGGTGGAGAC CATGCTGACC
CACCTGCCGA TCAGCAAATC CGAGGCCGAA AAACGCGCCG TGGCCGCCCT AGAAGAAGTG
GGCATTCCCG GTGCTGCGAA ACGTGTGAAC AGCTACCCGC ACGAGTTTTC CGGCGGTATG
CGCCAGCGGG TGGTGATCGC CTTGGCGCTT TGTGCGGAGC CTTCGCTGGT CATCGCGGAT
GAGCCGACAA CGGCGCTGGA TGTGTCTGTG CAGGCACAGA TCATCGCGTT GCTGAAACGG
CTCTGTCGTG AGCGCGGCAC GGCTGTCATG CTGATCACGC ACGACATGGG CGTGATTGCC
GAAGCAGCAG ATCGTGTGGC GGTGATGTAT GCCGGGCGCC TCGCAGAGCT CGGCCCGGTG
CGCGATGTGA TCACCGCGCC TGAGCACCCC TATACGCATG GGCTGATGGC CTCGACACCA
CTCGCGTCGC GTGGCCAGAA ACGTCTGCAC CAGATCCCCG GCGCAATGCC GCGTCTGGAT
GCGGTGCCGG ATGGTTGCGC CTTCAACCCA CGCTGCCCGC ATGCGGCCGA CAAATGCCGC
GCGGCCCCCG CACCCAAGGT CGACGGAGGT TCCGCCGCGT GCTGGTTCCC ACTTCAACAT
GAGGAGGCCT CCTGA
 
Protein sequence
MADPVLSIRN LCVEIPTRHG ILKPVDGVSY DIAKGEILGI VGESGAGKSM AGNAVIGLLN 
PPAHVSSGEI WLNGKRIDTL KGDALRRLRG KEIGMVFQDP LTSINPLLRI GDQLVETMLT
HLPISKSEAE KRAVAALEEV GIPGAAKRVN SYPHEFSGGM RQRVVIALAL CAEPSLVIAD
EPTTALDVSV QAQIIALLKR LCRERGTAVM LITHDMGVIA EAADRVAVMY AGRLAELGPV
RDVITAPEHP YTHGLMASTP LASRGQKRLH QIPGAMPRLD AVPDGCAFNP RCPHAADKCR
AAPAPKVDGG SAACWFPLQH EEAS