Gene EcDH1_3164 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcDH1_3164 
Symbol 
ID
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli DH1 
KingdomBacteria 
Replicon accessionCP001637 
Strand
Start bp3403643 
End bp3405343 
Gene Length1701 bp 
Protein Length566 aa 
Translation table11 
GC content53% 
IMG OID 
Productextracellular solute-binding protein family 5 
Protein accessionACX40790 
Protein GI260450368 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones33 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCGATTGC TCAACCGTCT TAACCAGTAT CAACGTCTGT GGCAACCTTC CGCCGGAAAG 
CCGCAAACCG TCACCGTCAG CGAACTGGCC GAACGCTGTT TTTGCAGCGA ACGCCATGTT
CGTACGCTGT TGCGTCAGGC ACAGGAGGCG GGATGGCTGG AGTGGCAGGC GCAGTCAGGA
CGCGGAAAGC GCGGACAATT ACGCTTTCTG GTCACGCCGG AATCGCTACG CAATGCGATG
ATGGAACAGG CACTGGAAAC CGGAAAGCAG CAAGATGTGC TGGAGCTGGC GCAACTGGCC
CCAGGTGAGC TGCGCACTCT GTTACAGCCG TTTATGGGCG GACAATGGCA AAACGATACA
CCCACGTTGC GTATTCCCTA CTATCGCCCG CTCGAACCGC TACAACCAGG CTTTTTGCCC
GGCCGTGCCG AGCAGCATCT CGCCGGGCAG ATATTTTCCG GCCTGACCCG CTTCGATAAT
AATACTCAGC GCCCGATTGG CGATTTAGCG CATCACTGGG AAACCTCTAC TGACGGGTTA
CGCTGGGACT TTTATCTTCG TTCAACCCTA CACTGGCATA ACGGCGATGC AGTAAAAGCC
TCACACTTAC ACCAGCGATT ATTGATGCTG TTACAACTGC CAGCACTGGA TCAATTATTT
ATTAGCGTGA AGCGTATTGA AGTCACCCAT CCGCAGTGTC TGACCTTCTT TTTACATCGC
CCTGATTACT GGCTTGCGCA CCGGCTGGCG AGCTATTGCA GCCATCTGGC GCATCCGCAA
TTCCCACTGA TCGGCACGGG TCCTTTTCGC TTAACACAAT TCACAGCAGA GCTGGTGCGC
CTGGAAAGCC ATGATTATTA CCATTTACGT CATCCGCTGC TTAAAGCGGT TGAGTACTGG
ATAACTCCGC CGCTTTTCGA AAAAGATTTG GGAACCAGTT GTCGGCATCC CGTGCAAATC
ACCATCGGCA AACCGGAGGA GCTGCAACGG GTCAGCCAGG TCAGTAGCGG CATCAGTTTA
GGTTTTTGCT ATTTGACGTT GCGCAAAAGT CCCCGACTCT CCCTCTGGCA GGCGCGAAAA
GTGATCTCCA TTATTCATCA ATCCGGTTTA TTACAAACGT TAGAAGTCGG AGAAAACCTG
ATCACCGCCA GTCATGCATT ACTGTCAGGC TGGACTATTC CGCATTGGCA GGTACCGGAT
GAAGTCAAAC TACCGAAAAC CTTGACGCTG GTTTATCACC TACCGATAGA ACTTCATACC
ATGGCAGAAC GCCTACAGGC GACACTGGCA GCGGAAGGCT GTGAACTCAC AATTATTTTT
CATAACGCAA AAAACTGGGA CGACACGACC CTACAGGCAC ACGCAGACCT CATGATGGGC
GACAGATTAA TTGGCGAAGC ACCGGAATAT ACTCTGGAGC AATGGCTGCG CTGCGATCCG
CTGTGGCCAC ATGTTTTCGA CGCTCCAGCA TACGCACATC TACAATCGAC ACTGGATGCC
GTGCAAATAA TGCCTGATGA AGAAAACCGA TTTAATGCCC TGAAAGCGGT TTTTAGCCAG
TTAATGACAG ATGCGACGCT GACGCCGCTG TTCAACTATC ACTATCGCAT TAGTGCCCCT
CCCGGCGTGA ACGGTGTGCG ACTGACACCG CGCGGCTGGT TTGAATTTAC CGAAGCCTGG
CTTCCCGCGC CATCGCAATG A
 
Protein sequence
MRLLNRLNQY QRLWQPSAGK PQTVTVSELA ERCFCSERHV RTLLRQAQEA GWLEWQAQSG 
RGKRGQLRFL VTPESLRNAM MEQALETGKQ QDVLELAQLA PGELRTLLQP FMGGQWQNDT
PTLRIPYYRP LEPLQPGFLP GRAEQHLAGQ IFSGLTRFDN NTQRPIGDLA HHWETSTDGL
RWDFYLRSTL HWHNGDAVKA SHLHQRLLML LQLPALDQLF ISVKRIEVTH PQCLTFFLHR
PDYWLAHRLA SYCSHLAHPQ FPLIGTGPFR LTQFTAELVR LESHDYYHLR HPLLKAVEYW
ITPPLFEKDL GTSCRHPVQI TIGKPEELQR VSQVSSGISL GFCYLTLRKS PRLSLWQARK
VISIIHQSGL LQTLEVGENL ITASHALLSG WTIPHWQVPD EVKLPKTLTL VYHLPIELHT
MAERLQATLA AEGCELTIIF HNAKNWDDTT LQAHADLMMG DRLIGEAPEY TLEQWLRCDP
LWPHVFDAPA YAHLQSTLDA VQIMPDEENR FNALKAVFSQ LMTDATLTPL FNYHYRISAP
PGVNGVRLTP RGWFEFTEAW LPAPSQ