Gene EcHS_A0522 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcHS_A0522 
Symbol 
ID5591046 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli HS 
KingdomBacteria 
Replicon accessionNC_009800 
Strand
Start bp531677 
End bp533377 
Gene Length1701 bp 
Protein Length566 aa 
Translation table11 
GC content53% 
IMG OID640919705 
Productsolute-binding family 5 protein 
Protein accessionYP_001457290 
Protein GI157159972 
COG category[R] General function prediction only 
COG ID[COG4533] ABC-type uncharacterized transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones30 
Plasmid unclonability p-value0.0612149 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCGATTGC TCAACCGTCT TAACCAGTAT CAACGTCTGT GGCAACCTTC AGCCGGAAAG 
CCGCAAACCG TCACCGTCAG CGAACTGGCC GAACGCTGTT TTTGCAGCGA ACGCCATGTT
CGTACGCTGT TGCGTCAGGC ACAGGAGGCG GGATGGCTGG AGTGGCAGGC GCAGTCAGGA
CGCGGAAAGC GCGGACAATT ACGCTTTCTG GTCACGCCGG AATCGCTACG CAATGCGATG
ATGGAACAGG CACTGGAAAC CGGAAAGCAG CAAGATGTGC TGGAGCTGGC GCAACTGGCC
CCAGGTGAGC TGCGCACTCT GTTACAGCCG TTTATGGGCG GACAATGGCA AAACGATACA
CCCACGTTGC GTATTCCCTA CTATCGCCCG CTCGAACCGC TACAACCAGG CTTTTTGCCC
GGCCGTGCCG AGCAGCATCT CGCCGGGCAG ATATTTTCCG GCCTGACCCG CTTCGATAAT
AATACTCAGC GCCCGATTGG CGATTTAGCG CATCACTGGG AAACCTCTAC TGACGGGTTA
CGCTGGGACT TTTATCTTCG TTCAACCCTA CACTGGCATA ACGGCGATGC AGTAAAAGCC
TCACACTTAC ACCAGCGATT ATTGATGCTG TTACAACTGC CAGCACTGGA TCAATTATTT
ATTAGCGTGA AGCGTATTGA AGTCACCCAT CCGCAGTGTC TGACCTTCTT TTTACATCGC
CCCGATTACT GGCTTGCGCA CCGGCTGGCG AGCTATTGCA GCCATCTGGT GCATCCGCAA
TTCCCACTGA TCGGCACTGG TCCTTTTCGC TTAACACAAT TCACAGCGGA GCTGGTGCGC
CTGGAGAGCC ATGATTATTA CCATTTGCGT CATCCGCTGC TTAAAGCGGT TGAGTACTGG
ATAACTCCGC CGCTTTTCGA AAAAGATTTG GGAACCAGTT GTCGGCATCC CGTGCAAATC
ACCATCGGCA AACCGGAGGA GCTGCAACGG GTCAGCCAGG TCAGTAGTGG CATCAGTTTA
GGTTTTTGCT ATTTAACGTT GCGCAAAAGT CCCCGACTCT CCCTCTGGCA GGCGCGAAAA
GTGATCTCCA TTATTCATCA ATCCGGTTTA TTACAAACGT TAGAAGTCGG AGAAAACCTG
ATCACCGCCA GTCATGCATT ACTGCCAGGC TGGACTATTC CGCACTGGCA AGTACCGGAT
GAAGTCAAAC TACCGAAAAC CTTGACGCTG GTTTATCACC TACCGATAGA ACTTCATACC
ATGGCAGAAC GCCTACAGGC GACACTGTCA GCGGAAGGCT GTGAACTCAC AATTATTTTT
CATAACGCAA AAAACTGGGA CGACACGACC CTACTGGCAC ACGCAGACCT CATGATGGGC
GACAGATTAA TTGGCGAAGC ACCGGAATAT ACTCTGGAGC AATGGCTACG TTGCGATCCA
CTGTGGCCAC ATGTTTTCGA CGCTCCAGCA TATGCACATC TGCAATCGAC GCTGGACGCG
GTGCAAGTAA TGCCTGATGA GGAAAACCGA TTTAATGCCC TGAAAGCGGT TTTTAGCCAG
TTAATGGCAG ACGCGACGCT GACGCCGCTG TTCAACTATC ACTATCGCAT TAGTGCCCCT
CCCGGCGTGA ACGGTGTGCG ACTGACACCG CGCGGCTGGT TTGAATTTAC CGAAGCCTGG
CTTCCCGCGC CGTCGCAATG A
 
Protein sequence
MRLLNRLNQY QRLWQPSAGK PQTVTVSELA ERCFCSERHV RTLLRQAQEA GWLEWQAQSG 
RGKRGQLRFL VTPESLRNAM MEQALETGKQ QDVLELAQLA PGELRTLLQP FMGGQWQNDT
PTLRIPYYRP LEPLQPGFLP GRAEQHLAGQ IFSGLTRFDN NTQRPIGDLA HHWETSTDGL
RWDFYLRSTL HWHNGDAVKA SHLHQRLLML LQLPALDQLF ISVKRIEVTH PQCLTFFLHR
PDYWLAHRLA SYCSHLVHPQ FPLIGTGPFR LTQFTAELVR LESHDYYHLR HPLLKAVEYW
ITPPLFEKDL GTSCRHPVQI TIGKPEELQR VSQVSSGISL GFCYLTLRKS PRLSLWQARK
VISIIHQSGL LQTLEVGENL ITASHALLPG WTIPHWQVPD EVKLPKTLTL VYHLPIELHT
MAERLQATLS AEGCELTIIF HNAKNWDDTT LLAHADLMMG DRLIGEAPEY TLEQWLRCDP
LWPHVFDAPA YAHLQSTLDA VQVMPDEENR FNALKAVFSQ LMADATLTPL FNYHYRISAP
PGVNGVRLTP RGWFEFTEAW LPAPSQ