Gene EcSMS35_0488 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_0488 
Symbol 
ID6144306 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp493926 
End bp495626 
Gene Length1701 bp 
Protein Length566 aa 
Translation table11 
GC content53% 
IMG OID641615382 
Productsolute-binding family 5 protein 
Protein accessionYP_001742589 
Protein GI170683749 
COG category[R] General function prediction only 
COG ID[COG4533] ABC-type uncharacterized transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.189151 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones57 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGACTGC TCAACCGTCT TAACCAGTAT CAACGTCTGT GGCAACCTTC CGCCGGAAAG 
CCGCAAACCG TCACCGTCAG CGAACTGGCC GAACGCTGTT TTTGCAGCGA ACGCCATGTT
CGTACGCTGT TGCGTCAGGC ACAGGAGGCG GGATGGCTGG AGTGGCAGGC GCAGTCAGGA
CGCGGAAAGC GCGGACAATT ACGCTTTCTG GTCACGCCAG AATCGCTACG CAATGCGATG
ATGGAACAGG CGCTGGAAAC CGGAAAGCAG CAAGATGTGC TGGAGCTGGC GCAACTGGCC
CCAGGTGAGC TGCGCACTCT GTTACAGCCG TTTATGGGCG GACAATGGCA AAACGATACG
CCCACGTTGC GTATTCCCTA CTATCGCCCG CTCGAACCGC TACAACCGGG CTTTTTGCCC
GGCCGTGCCG AGCAGCATCT GGCCGGGCAG ATATTTTCCG GCCTGACCCG CTTCGATAAT
AATACCCAGC GCCCGATTGG CGATTTAGCG CATCACTGGG AAACCTCTAC TGACGGGTTA
CGCTGGGATT TTTATCTTCG CTCAACCTTA CACTGGCATA ACGGCGATGC AGTAAAAGCC
TCCCACTTAC ACCAGCGGTT ATTGATGCTG TTACAACTGC CAGCACTGGA TCAATTATTT
ATTAGCGTGA AGCGTATTGA AGTCACCCAT CCGCAGTGTC TGACCTTCTT TTTACATCGC
CCCGATTACT GGCTTGCGCA CCGGCTGGCG AGCTATTGCA GCCATCTGGC GCATCCGCAA
TTCCCCCTGA TCGGCACGGG TCCTTTTCGC TTAACACAAT TCACAGCGGA ACTGGTGCGC
CTGGAAAGCC ATGATTATTA CCATTTACGT CATCCGCTGC TTAAAGCGGT TGAGTACTGG
ATAACTCCGC CGCTTTTCGA AAAAGATTTG GGAACCAGTT GTCGGCATCC CGTGCAAATC
ACCATCGGCA AACCGGAGGA GCTGCAACGG GTCAGCCAGG TCAGTAGTGG CATCAGTTTA
GGTTTTTGCT ATTTAACGTT GCGCAAAAGT CCCCGACTGT CCCTCTGGCA GGCGCGAAAA
GTGATCTCCA TTATTCATCA ATCCGGTTTA TTACAAACGT TAGAAGTCGG AGAAAACCTG
ATCACCGCCA GTCATGCATT ACTGCCAGGC TGGACTATTC CGCAATGGCA AGTACCGGAT
GAGGTCAAAC TACCGAAAAC CTTGACGCTG GTTTATCACC TACCGATAGA ACTTCATACC
ATGGCAGAAC GTCTACAGGC GACACTGGCA GCGGAAGGCT GTGAACTCAC AATTATTTTT
CATAACGCAA AAAACTGGGA CGACACGACC CTACTGGCAC ACGCAGACCT CATGATGGGC
GACAGATTAA TTGGCGAAGC ACCGGAATAT ACCCTGGAGC AATGGCTGCG TTGCGATCCA
CTGTGGCCAC ATGTTTTCGA CGCTCCAGCA TATGCACATT TGCAATCGAC ACTGGACGCG
GTGCAAGTAA TGCCTGATGA GGAAAACCGA TTTAATGCCC TGAAAGCGGT TTTTAGCCAG
TTAATGGCAG ACGCGACGCT GACGCCGCTA TTCAACTATC ACTATCGCAT TAGTGCCCCT
CCCGGCGTGA ACGGTGTGCG ACTGACACCG CGCGGCTGGT TTGAATTTAC CGAAGCCTGG
CTTCCCGCGC CGTCGCAATG A
 
Protein sequence
MRLLNRLNQY QRLWQPSAGK PQTVTVSELA ERCFCSERHV RTLLRQAQEA GWLEWQAQSG 
RGKRGQLRFL VTPESLRNAM MEQALETGKQ QDVLELAQLA PGELRTLLQP FMGGQWQNDT
PTLRIPYYRP LEPLQPGFLP GRAEQHLAGQ IFSGLTRFDN NTQRPIGDLA HHWETSTDGL
RWDFYLRSTL HWHNGDAVKA SHLHQRLLML LQLPALDQLF ISVKRIEVTH PQCLTFFLHR
PDYWLAHRLA SYCSHLAHPQ FPLIGTGPFR LTQFTAELVR LESHDYYHLR HPLLKAVEYW
ITPPLFEKDL GTSCRHPVQI TIGKPEELQR VSQVSSGISL GFCYLTLRKS PRLSLWQARK
VISIIHQSGL LQTLEVGENL ITASHALLPG WTIPQWQVPD EVKLPKTLTL VYHLPIELHT
MAERLQATLA AEGCELTIIF HNAKNWDDTT LLAHADLMMG DRLIGEAPEY TLEQWLRCDP
LWPHVFDAPA YAHLQSTLDA VQVMPDEENR FNALKAVFSQ LMADATLTPL FNYHYRISAP
PGVNGVRLTP RGWFEFTEAW LPAPSQ