Gene EcE24377A_0481 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcE24377A_0481 
Symbol 
ID5586781 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli E24377A 
KingdomBacteria 
Replicon accessionNC_009801 
Strand
Start bp500760 
End bp502460 
Gene Length1701 bp 
Protein Length566 aa 
Translation table11 
GC content53% 
IMG OID640924205 
Productsolute-binding family 5 protein 
Protein accessionYP_001461632 
Protein GI157155849 
COG category[R] General function prediction only 
COG ID[COG4533] ABC-type uncharacterized transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones26 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCGATTGC TCAACCGTCT TAACCAGTAT CAACGTCTGT GGCAACCTTC AGCCGGAAAG 
CCGCAAACCG TCACCGTCAG CGAACTGGCC GAACGCTGTT TTTGCAGCGA ACGCCATGTT
CGTACGCTGT TGCGTCAGGC ACAGGAGGCG GGATGGCTGG AGTGGCAGGC GCAGTCAGGA
CGCGGAAAGC GCGGACAATT ACGCTTTCTG GTCACGCCGG AATCGCTACG CAATGCGATG
ATGGAACAGG CACTGGAAAC CGGAAAGCAG CAAGATGTGC TGGAGCTGGC GCAACTGGCC
CCAGGTGAGC TGCGCACTCT GTTACAGCCG TTTATGGGCG GACAATGGCA AAACGATACA
CCCACGTTGC GTATTCCCTA CTATCGCCCG CTCGAACCGC TACAACCAGG CTTTTTGCCC
GGCCGTGCCG AGCAGCATCT CGCCGGGCAG ATATTTTCCG GCCTGACCCG CTTCGATAAT
AATACTCAGC GCCCGATTGG CGATTTAGCG CATCACTGGG AAACCTCTAC TGACGGGTTA
CGCTGGGACT TTTATCTTCG TTCAACCCTA CACTGGCATA ACGGCGATGC AGTAAAAGCC
TCACACTTAC ACCAGCGATT ATTGATGCTG TTACAACTGC CAGCACTGGA TCAATTATTT
ATTAGCGTGA AGCGTATTGA AGTCACCCAT CCGCAGTGTC TGACCTTCTT TTTACATCGC
CCCGATTACT GGCTTGCGCA CCGGCTGGCG AGCTATTGCA GCCATCTGGC GCATCCGCAA
TTCCCACTGA TCGGCACTGG TCCTTTTCGC TTAACACAAT TCACAGCGGA GCTGGTGCGC
CTGGAGAGCC ATGATTATTA CCATTTGCGT CATCCGCTGC TTAAAGCGGT TGAGTACTGG
ATAACTCCGC CGCTTTTCGA AAAAGATTTG GGAACCAGTT GTCGGCATCC CGTGCAAATC
ACCATCGGCA AACCGGAGGA GCTGCAACGG GTCAGCCAGG TCAGTAGTGG CATCAGTTTA
GGTTTTTGCT ATTTAACGTT GCGCAAAAGT CCCCGACTCT CCCTCTGGCA GGCGCGAAAA
GTGATCTCCA TTATTCATCA ATCCGGTTTA TTACAAACGT TAGAAGTCGG AGAAAACCTG
ATCACCGCCA GTCATGCATT ACTGCCAGGC TGGACTATTC CGCACTGGCA AGTACCGGAT
GAAGTCAAAC TACCGAAAAC CTTGACGCTG GTTTATCACT TACCGATAGA ACTTCATACC
ATGGCAGAAC GCCTACAGGC GACACTGGCA GCGGAAGGCT GTGAACTCAC AATTATTTTT
CATAACGCAA AAAACTGGGA CGACACGACC CTACTGGCAC ACGCAGACCT CATGATGGGC
GACAGATTAA TTGGCGAAGC ACCGGAATAT ACTCTGGAGC AATGGCTACG TTGCGATCCA
CTATGGACAC ATGTTTTCGA CGCTCCAGCA TATGCACATC TGCAATCGAC GCTGGACGCG
GTGCAAGTAA TGCCTGATGA GGAAAACCGA TTTAATGCCC TGAAAGCGGT TTTTAGCCAG
TTAATGGCAG ACGCGACGCT GACGCCGCTG TTCAACTATC ACTATCGCAT TAGTGCCCCT
CCCGGCGTGA ACGGTGTGCG ACTGACACCG CGCGGCTGGT TTGAATTTAC CGAAGCCTGG
CTTCCCGCGC CGTCGCAATG A
 
Protein sequence
MRLLNRLNQY QRLWQPSAGK PQTVTVSELA ERCFCSERHV RTLLRQAQEA GWLEWQAQSG 
RGKRGQLRFL VTPESLRNAM MEQALETGKQ QDVLELAQLA PGELRTLLQP FMGGQWQNDT
PTLRIPYYRP LEPLQPGFLP GRAEQHLAGQ IFSGLTRFDN NTQRPIGDLA HHWETSTDGL
RWDFYLRSTL HWHNGDAVKA SHLHQRLLML LQLPALDQLF ISVKRIEVTH PQCLTFFLHR
PDYWLAHRLA SYCSHLAHPQ FPLIGTGPFR LTQFTAELVR LESHDYYHLR HPLLKAVEYW
ITPPLFEKDL GTSCRHPVQI TIGKPEELQR VSQVSSGISL GFCYLTLRKS PRLSLWQARK
VISIIHQSGL LQTLEVGENL ITASHALLPG WTIPHWQVPD EVKLPKTLTL VYHLPIELHT
MAERLQATLA AEGCELTIIF HNAKNWDDTT LLAHADLMMG DRLIGEAPEY TLEQWLRCDP
LWTHVFDAPA YAHLQSTLDA VQVMPDEENR FNALKAVFSQ LMADATLTPL FNYHYRISAP
PGVNGVRLTP RGWFEFTEAW LPAPSQ