Gene TM1040_3174 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTM1040_3174 
Symbol 
ID4075344 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRuegeria sp. TM1040 
KingdomBacteria 
Replicon accessionNC_008043 
Strand
Start bp155096 
End bp156124 
Gene Length1029 bp 
Protein Length342 aa 
Translation table11 
GC content63% 
IMG OID638004677 
ProductD-xylose ABC transporter, substrate-binding protein 
Protein accessionYP_611410 
Protein GI99078152 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG4213] ABC-type xylose transport system, periplasmic component 
TIGRFAM ID[TIGR02634] D-xylose ABC transporter, substrate-binding protein 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value0.285981 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value0.43626 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAATTTT TGAACGGACT GTCCGTCATC GCGCTGAGCG CCTTTGCTGC ATCTTCGGCT 
TTTGCCGCTG ATCTGACCGT AGGCGTCAGC TGGTCGAACT TTCAGGAAGA GCGCTGGAAA
ACCGACGAAG CCGCGATTGT TGCCGCACTG GAGGAAGCAG GCGCCGAGTA TATCTCTGCC
GATGCGCAGT CTTCCTCTGC CAAGCAGCTT TCCGACATTG AGAGCCTGAT CGCACAGGGT
GTCGATGCGC TGATCATCCT CGCACAGGAC GCACAGGCCA TCGGCCCCGC CGTGCAAGCG
GCTGCGGACG AGGGTATCCC GGTTGTGGCT TACGACCGCC TGATCGAGGA CAATCGCGCC
TTTTACCTGA CCTTCGACAA CGTCGAAGTG GGCCGGATGC AGGCCCGCGC CGTGCTCGAG
GCGATGCCCA AGGGCAACTA CGTCATGATC AAAGGCTCGC CCACCGACCC CAACGCTGAC
TTCCTGCGGG GCGGTCAGCA AGAGGTGATC CAGGCGGCAG TCGACGCCGG TGACATCAAG
ATCGTTGGCG AAGCCTACAC GGACGGCTGG CTGCCCGCCA ACGCACAGCG CAACATGGAG
CAGATCCTGA CCGCCAACGA CAACAAGGTC GACGCGGTTG TGGCCTCCAA CGACGGCACC
GCTGGCGGTG TGGTTGCAGC GCTCACCGCG CAGGGCATGG ACGGCATCCC GGTGTCCGGT
CAGGACGGCG ACCACGCCGC GCTGAACCGC GTCGCCAAGG GCACCCAGAC CGTTTCCGTC
TGGAAAGACG CCCGTGACCT CGGCAATGCG GCGGGTGAGA TCGCCGTGGC TCTGGCCGGT
GGCACCGCGA TGGAAGAAAT CAGCGGCGCG ACCTCCTGGA CCTCTCCGGC AGGCACCACC
ATGACCGCAC GGTTCCTCGC ACCGATGCCG GTGACCAAAG AGAACCTCTC TGCGGTTGTG
GATGCAGGCT GGATCACACA GGAAGCGCTT TGCCAGGGTG TCGAAAACGG CCCGGCCCCC
TGCAACTGA
 
Protein sequence
MKFLNGLSVI ALSAFAASSA FAADLTVGVS WSNFQEERWK TDEAAIVAAL EEAGAEYISA 
DAQSSSAKQL SDIESLIAQG VDALIILAQD AQAIGPAVQA AADEGIPVVA YDRLIEDNRA
FYLTFDNVEV GRMQARAVLE AMPKGNYVMI KGSPTDPNAD FLRGGQQEVI QAAVDAGDIK
IVGEAYTDGW LPANAQRNME QILTANDNKV DAVVASNDGT AGGVVAALTA QGMDGIPVSG
QDGDHAALNR VAKGTQTVSV WKDARDLGNA AGEIAVALAG GTAMEEISGA TSWTSPAGTT
MTARFLAPMP VTKENLSAVV DAGWITQEAL CQGVENGPAP CN