Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcSMS35_3890 |
Symbol | xylG |
ID | 6146741 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli SMS-3-5 |
Kingdom | Bacteria |
Replicon accession | NC_010498 |
Strand | + |
Start bp | 3958359 |
End bp | 3959900 |
Gene Length | 1542 bp |
Protein Length | 513 aa |
Translation table | 11 |
GC content | 47% |
IMG OID | 641618716 |
Product | xylose transporter ATP-binding subunit |
Protein accession | YP_001745855 |
Protein GI | 170683069 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG1129] ABC-type sugar transport system, ATPase component |
TIGRFAM ID | [TIGR02633] D-xylose ABC transporter, ATP-binding protein |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 31 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 72 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCCTTATC TACTTGAAAT GAAAAATATT ACCAAAACCT TCGGCAGTGT GAAGGCGATT GATAACGTCA GCTTGCGGTT GAATGCTGGC GAAATCGTCT CACTCTGTGG GGAAAATGGG TCAGGTAAAT CAACGCTGAT GAAAGTGCTG TGTGGTATTT ATCCCCATGG TTCCTACGAA GGCGAAATTA TTTTTGCGGG AGAAGAGATT CAGGCGAGTC ACATCCGCGA TACCGAACGC AAAGGTATCG CCATTATTCA CCAGGAATTG GCGCTGGTGA AAGAACTGAC CGTGCTGGAA AATATCTTCC TGGGTAACGA AATAACCCAT AATGGCATTA TGGATTATGA CCTGATGACG CTACGCTGTC AGAAGCTGCT CGCACAGGTC AGTTTATCCA TTTCACCTGA TACCCGCGTT GGCGATTTAG GACTTGGGCA ACAACAACTG GTTGAAATTG CCAAGGCACT TAATAAACAG GTGCGTTTGT TAATTCTCGA TGAGCCGACA GCCTCATTAA CTGAGCAGGA AACGTCGGTT TTACTGGATA TTATTCGCGA TCTACAACAG CACGGTATCG CCTGTATTTA TATTTCGCAC AAACTCAACG AAGTCAAAGC GATTTCCGAT ACGATTTTCG TTATTCGCGA CGGTCAGCAC ATTGGCACGC GTGATGCTGC TGGAATGAGT GAAGACGATA TTATCACCAT GATGGTCGGG CGAGAGTTAA CCGCGCTTTA CCCTAATGAA CCACATACCA CCGGAGATGA AATATTACGT ATTGAACATC TGACGGCATG GCATCCGGTT AATCGTCATA TTAAACGAGT TAATGATGTC TCGTTTTCCC TGAAACGTGG CGAAATACTG GGTATCGCCG GACTCGTTGG TGCCGGACGT ACCGAGACTA TTCAGTGCCT GTTTGGTGTG TGGCCCGGAC AATGGGAAGG GAAAATTTAT ATTGACGGCA AACAGGTAGA TATTCGTAAC TGTCAGCAAG CCATCGCCCA GGGGATTGCG ATGGTACCCG AAGACAGAAA GCGCGACGGC ATCGTTCCGG TAATGGCGGT TGGTAAAAAT ATTACCCTCG CCGCACTCAA TAAATTTACC GGCGGTATTA GCCAGCTTGA TGACGCGGCA GAGCAAAAAT GTATTCTGGA ATCAATCCAG CAACTCAAAG TTAAAACGTC GTCCTCCGAC CTTGCTATTG GACGTTTGAG CGGCGGCAAT CAACAAAAAG CGATCCTCGC TCGCTGTCTG TTACTTAACC CGCGCATTCT CATTCTTGAT GAACCCACCA GGGGTATCGA TATTGGCGCG AAATACGAGA TCTACAAATT AATTAACCAA CTCGTCCAGC AGGGTATTGC CGTTATTGTC ATCTCTTCCG AATTACCTGA AGTGCTCGGC CTTAGCGATC GTGTACTGGT GATGCATGAA GGGAAACTAA AAGCCAACCT GATAAATCAT AACCTGACTC AGGAGCAGGT GATGGAAGCC GCATTGAGGA GCGAACATCA TGTCGAAAAG CAATCCGTCT GA
|
Protein sequence | MPYLLEMKNI TKTFGSVKAI DNVSLRLNAG EIVSLCGENG SGKSTLMKVL CGIYPHGSYE GEIIFAGEEI QASHIRDTER KGIAIIHQEL ALVKELTVLE NIFLGNEITH NGIMDYDLMT LRCQKLLAQV SLSISPDTRV GDLGLGQQQL VEIAKALNKQ VRLLILDEPT ASLTEQETSV LLDIIRDLQQ HGIACIYISH KLNEVKAISD TIFVIRDGQH IGTRDAAGMS EDDIITMMVG RELTALYPNE PHTTGDEILR IEHLTAWHPV NRHIKRVNDV SFSLKRGEIL GIAGLVGAGR TETIQCLFGV WPGQWEGKIY IDGKQVDIRN CQQAIAQGIA MVPEDRKRDG IVPVMAVGKN ITLAALNKFT GGISQLDDAA EQKCILESIQ QLKVKTSSSD LAIGRLSGGN QQKAILARCL LLNPRILILD EPTRGIDIGA KYEIYKLINQ LVQQGIAVIV ISSELPEVLG LSDRVLVMHE GKLKANLINH NLTQEQVMEA ALRSEHHVEK QSV
|
| |