Gene TM1040_3600 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTM1040_3600 
Symbol 
ID4075027 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRuegeria sp. TM1040 
KingdomBacteria 
Replicon accessionNC_008043 
Strand
Start bp649135 
End bp650130 
Gene Length996 bp 
Protein Length331 aa 
Translation table11 
GC content62% 
IMG OID638005119 
Productoligopeptide/dipeptide ABC transporter, ATP-binding protein-like 
Protein accessionYP_611829 
Protein GI99078571 
COG category[E] Amino acid transport and metabolism
[P] Inorganic ion transport and metabolism 
COG ID[COG0444] ABC-type dipeptide/oligopeptide/nickel transport system, ATPase component 
TIGRFAM ID[TIGR01727] oligopeptide/dipeptide ABC transporter, ATP-binding protein, C-terminal domain 


Plasmid Coverage information

Num covering plasmid clones27 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones19 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCGATA CGCTTCTTTC CGTAAAGAAC CTGACCCTCG ACATTCCCAC CGGCAGCGGC 
ACGCTGCACG CCGTGCGCGG CATCGATTTT GACCTAATGC GCGGCGAGAC CCTCTCTATC
GTGGGCGAAA GCGGCTCCGG AAAGTCACTG ACCTCGCTGG CTGTCATGGG GCTGTTGGGC
AAGTCAATCA AACGCCGCGC CGATGAGATG CGGTTTGAAA ACATCGACCT TCTGACGGCG
AACCGCCGGG TGATGCGCGA TCTGCGCGGC AACCGCATGG CGATGATCTT CCAAGAGCCG
ATGACATCGC TCAACCCAGC CTACACCATC GGCGATCAGT TGACCGAAAC GCTGCTGCTG
CATCGCAAGG TCAGCAAGTC TGCCGCTCGC GCGCGGGCCA TTGAGCTGCT TGAAAAGGTC
GGCATCACCG CAGCCGAGAG CCGCCTGTCG CAATATCCGC ACCAGTTGTC CGGTGGTTTG
CGTCAGCGGG TGATGATTGC CTTGGCCCTG ATGTGCGAGC CGGACCTGCT GATCGCGGAT
GAGCCGACGA CCGCGCTGGA TGTGACCATT CAGGCGCAGA TCCTACGTCT CTTGGTCGAC
CTCACGCGGG AAATGAACAT GGCCATGATC TTGATCACCC ATGATCTCGG CGTGGTGGCG
CGCGTCGCCG ACAAAGTGGC CGTGATGTAT GCGGGAGAGC TTGTGGAGAC CGGCCCTGCA
GCGGACGTGT TTGCGACTCC AAGCCATCCC TACACACGCG GACTTTTGCG CTGCATCCCG
CAACCCGGCA AGACCGAGCG CGGGGCCGCT CTGGGGACCA TTCCGGGCAT CGTGCCGTCC
CTCATTGGCG AGGTGAGCGG CTGCGCCTTC CGCACCCGCT GCCTGCATGC GCGTCCAGAG
TGCCGCGCCG ACATTCCCCT TCGAGGCGAA GCAAGCCATG AGTTCAAATG CATTCACCCA
GACGGGGCTC TATCCCATGA AGGAGAGGCG GTATGA
 
Protein sequence
MSDTLLSVKN LTLDIPTGSG TLHAVRGIDF DLMRGETLSI VGESGSGKSL TSLAVMGLLG 
KSIKRRADEM RFENIDLLTA NRRVMRDLRG NRMAMIFQEP MTSLNPAYTI GDQLTETLLL
HRKVSKSAAR ARAIELLEKV GITAAESRLS QYPHQLSGGL RQRVMIALAL MCEPDLLIAD
EPTTALDVTI QAQILRLLVD LTREMNMAMI LITHDLGVVA RVADKVAVMY AGELVETGPA
ADVFATPSHP YTRGLLRCIP QPGKTERGAA LGTIPGIVPS LIGEVSGCAF RTRCLHARPE
CRADIPLRGE ASHEFKCIHP DGALSHEGEA V