Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | TM1040_3174 |
Symbol | |
ID | 4075344 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Ruegeria sp. TM1040 |
Kingdom | Bacteria |
Replicon accession | NC_008043 |
Strand | + |
Start bp | 155096 |
End bp | 156124 |
Gene Length | 1029 bp |
Protein Length | 342 aa |
Translation table | 11 |
GC content | 63% |
IMG OID | 638004677 |
Product | D-xylose ABC transporter, substrate-binding protein |
Protein accession | YP_611410 |
Protein GI | 99078152 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG4213] ABC-type xylose transport system, periplasmic component |
TIGRFAM ID | [TIGR02634] D-xylose ABC transporter, substrate-binding protein |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 0.285981 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 15 |
Fosmid unclonability p-value | 0.43626 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAATTTT TGAACGGACT GTCCGTCATC GCGCTGAGCG CCTTTGCTGC ATCTTCGGCT TTTGCCGCTG ATCTGACCGT AGGCGTCAGC TGGTCGAACT TTCAGGAAGA GCGCTGGAAA ACCGACGAAG CCGCGATTGT TGCCGCACTG GAGGAAGCAG GCGCCGAGTA TATCTCTGCC GATGCGCAGT CTTCCTCTGC CAAGCAGCTT TCCGACATTG AGAGCCTGAT CGCACAGGGT GTCGATGCGC TGATCATCCT CGCACAGGAC GCACAGGCCA TCGGCCCCGC CGTGCAAGCG GCTGCGGACG AGGGTATCCC GGTTGTGGCT TACGACCGCC TGATCGAGGA CAATCGCGCC TTTTACCTGA CCTTCGACAA CGTCGAAGTG GGCCGGATGC AGGCCCGCGC CGTGCTCGAG GCGATGCCCA AGGGCAACTA CGTCATGATC AAAGGCTCGC CCACCGACCC CAACGCTGAC TTCCTGCGGG GCGGTCAGCA AGAGGTGATC CAGGCGGCAG TCGACGCCGG TGACATCAAG ATCGTTGGCG AAGCCTACAC GGACGGCTGG CTGCCCGCCA ACGCACAGCG CAACATGGAG CAGATCCTGA CCGCCAACGA CAACAAGGTC GACGCGGTTG TGGCCTCCAA CGACGGCACC GCTGGCGGTG TGGTTGCAGC GCTCACCGCG CAGGGCATGG ACGGCATCCC GGTGTCCGGT CAGGACGGCG ACCACGCCGC GCTGAACCGC GTCGCCAAGG GCACCCAGAC CGTTTCCGTC TGGAAAGACG CCCGTGACCT CGGCAATGCG GCGGGTGAGA TCGCCGTGGC TCTGGCCGGT GGCACCGCGA TGGAAGAAAT CAGCGGCGCG ACCTCCTGGA CCTCTCCGGC AGGCACCACC ATGACCGCAC GGTTCCTCGC ACCGATGCCG GTGACCAAAG AGAACCTCTC TGCGGTTGTG GATGCAGGCT GGATCACACA GGAAGCGCTT TGCCAGGGTG TCGAAAACGG CCCGGCCCCC TGCAACTGA
|
Protein sequence | MKFLNGLSVI ALSAFAASSA FAADLTVGVS WSNFQEERWK TDEAAIVAAL EEAGAEYISA DAQSSSAKQL SDIESLIAQG VDALIILAQD AQAIGPAVQA AADEGIPVVA YDRLIEDNRA FYLTFDNVEV GRMQARAVLE AMPKGNYVMI KGSPTDPNAD FLRGGQQEVI QAAVDAGDIK IVGEAYTDGW LPANAQRNME QILTANDNKV DAVVASNDGT AGGVVAALTA QGMDGIPVSG QDGDHAALNR VAKGTQTVSV WKDARDLGNA AGEIAVALAG GTAMEEISGA TSWTSPAGTT MTARFLAPMP VTKENLSAVV DAGWITQEAL CQGVENGPAP CN
|
| |