Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcSMS35_0488 |
Symbol | |
ID | 6144306 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli SMS-3-5 |
Kingdom | Bacteria |
Replicon accession | NC_010498 |
Strand | - |
Start bp | 493926 |
End bp | 495626 |
Gene Length | 1701 bp |
Protein Length | 566 aa |
Translation table | 11 |
GC content | 53% |
IMG OID | 641615382 |
Product | solute-binding family 5 protein |
Protein accession | YP_001742589 |
Protein GI | 170683749 |
COG category | [R] General function prediction only |
COG ID | [COG4533] ABC-type uncharacterized transport system, periplasmic component |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 0.189151 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 57 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCGACTGC TCAACCGTCT TAACCAGTAT CAACGTCTGT GGCAACCTTC CGCCGGAAAG CCGCAAACCG TCACCGTCAG CGAACTGGCC GAACGCTGTT TTTGCAGCGA ACGCCATGTT CGTACGCTGT TGCGTCAGGC ACAGGAGGCG GGATGGCTGG AGTGGCAGGC GCAGTCAGGA CGCGGAAAGC GCGGACAATT ACGCTTTCTG GTCACGCCAG AATCGCTACG CAATGCGATG ATGGAACAGG CGCTGGAAAC CGGAAAGCAG CAAGATGTGC TGGAGCTGGC GCAACTGGCC CCAGGTGAGC TGCGCACTCT GTTACAGCCG TTTATGGGCG GACAATGGCA AAACGATACG CCCACGTTGC GTATTCCCTA CTATCGCCCG CTCGAACCGC TACAACCGGG CTTTTTGCCC GGCCGTGCCG AGCAGCATCT GGCCGGGCAG ATATTTTCCG GCCTGACCCG CTTCGATAAT AATACCCAGC GCCCGATTGG CGATTTAGCG CATCACTGGG AAACCTCTAC TGACGGGTTA CGCTGGGATT TTTATCTTCG CTCAACCTTA CACTGGCATA ACGGCGATGC AGTAAAAGCC TCCCACTTAC ACCAGCGGTT ATTGATGCTG TTACAACTGC CAGCACTGGA TCAATTATTT ATTAGCGTGA AGCGTATTGA AGTCACCCAT CCGCAGTGTC TGACCTTCTT TTTACATCGC CCCGATTACT GGCTTGCGCA CCGGCTGGCG AGCTATTGCA GCCATCTGGC GCATCCGCAA TTCCCCCTGA TCGGCACGGG TCCTTTTCGC TTAACACAAT TCACAGCGGA ACTGGTGCGC CTGGAAAGCC ATGATTATTA CCATTTACGT CATCCGCTGC TTAAAGCGGT TGAGTACTGG ATAACTCCGC CGCTTTTCGA AAAAGATTTG GGAACCAGTT GTCGGCATCC CGTGCAAATC ACCATCGGCA AACCGGAGGA GCTGCAACGG GTCAGCCAGG TCAGTAGTGG CATCAGTTTA GGTTTTTGCT ATTTAACGTT GCGCAAAAGT CCCCGACTGT CCCTCTGGCA GGCGCGAAAA GTGATCTCCA TTATTCATCA ATCCGGTTTA TTACAAACGT TAGAAGTCGG AGAAAACCTG ATCACCGCCA GTCATGCATT ACTGCCAGGC TGGACTATTC CGCAATGGCA AGTACCGGAT GAGGTCAAAC TACCGAAAAC CTTGACGCTG GTTTATCACC TACCGATAGA ACTTCATACC ATGGCAGAAC GTCTACAGGC GACACTGGCA GCGGAAGGCT GTGAACTCAC AATTATTTTT CATAACGCAA AAAACTGGGA CGACACGACC CTACTGGCAC ACGCAGACCT CATGATGGGC GACAGATTAA TTGGCGAAGC ACCGGAATAT ACCCTGGAGC AATGGCTGCG TTGCGATCCA CTGTGGCCAC ATGTTTTCGA CGCTCCAGCA TATGCACATT TGCAATCGAC ACTGGACGCG GTGCAAGTAA TGCCTGATGA GGAAAACCGA TTTAATGCCC TGAAAGCGGT TTTTAGCCAG TTAATGGCAG ACGCGACGCT GACGCCGCTA TTCAACTATC ACTATCGCAT TAGTGCCCCT CCCGGCGTGA ACGGTGTGCG ACTGACACCG CGCGGCTGGT TTGAATTTAC CGAAGCCTGG CTTCCCGCGC CGTCGCAATG A
|
Protein sequence | MRLLNRLNQY QRLWQPSAGK PQTVTVSELA ERCFCSERHV RTLLRQAQEA GWLEWQAQSG RGKRGQLRFL VTPESLRNAM MEQALETGKQ QDVLELAQLA PGELRTLLQP FMGGQWQNDT PTLRIPYYRP LEPLQPGFLP GRAEQHLAGQ IFSGLTRFDN NTQRPIGDLA HHWETSTDGL RWDFYLRSTL HWHNGDAVKA SHLHQRLLML LQLPALDQLF ISVKRIEVTH PQCLTFFLHR PDYWLAHRLA SYCSHLAHPQ FPLIGTGPFR LTQFTAELVR LESHDYYHLR HPLLKAVEYW ITPPLFEKDL GTSCRHPVQI TIGKPEELQR VSQVSSGISL GFCYLTLRKS PRLSLWQARK VISIIHQSGL LQTLEVGENL ITASHALLPG WTIPQWQVPD EVKLPKTLTL VYHLPIELHT MAERLQATLA AEGCELTIIF HNAKNWDDTT LLAHADLMMG DRLIGEAPEY TLEQWLRCDP LWPHVFDAPA YAHLQSTLDA VQVMPDEENR FNALKAVFSQ LMADATLTPL FNYHYRISAP PGVNGVRLTP RGWFEFTEAW LPAPSQ
|
| |