Gene EcDH1_2206 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcDH1_2206 
Symbol 
ID
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli DH1 
KingdomBacteria 
Replicon accessionCP001637 
Strand
Start bp2365679 
End bp2366824 
Gene Length1146 bp 
Protein Length381 aa 
Translation table11 
GC content53% 
IMG OID 
Productextracellular solute-binding protein family 1 
Protein accessionACX39856 
Protein GI260449434 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones27 
Plasmid unclonability p-value0.313197 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGCAAGA CATTTGCCCG CAGCAGCCTG TGTGCGCTCA GCATGACAAT AATGACCGCT 
CACGCCGCCG AACCGCCTAC CAATTTAGAT AAACCGGAAG GGCGACTGGA TATTATCGCC
TGGCCGGGAT ACATCGAACG CGGACAAACT GATAAACAAT ACGACTGGGT AACGCAGTTC
GAAAAAGAGA CAGGCTGCGC GGTGAATGTG AAAACCGCCG CGACTTCCGA TGAAATGGTC
AGTCTGATGA CCAAAGGGGG TTACGATCTG GTTACGGCAT CCGGCGATGC CTCGCTGCGT
TTGATTATGG GTAAACGCGT GCAGCCGATT AATACCGCAT TGATTCCCAA CTGGAAAACG
CTCGATCCGC GCGTGGTTAA AGGCGACTGG TTTAATGTTG GCGGCAAAGT TTACGGCACA
CCTTACCAAT GGGGGCCGAA CCTGCTGATG TACAACACTA AAACCTTCCC GACGCCGCCG
GATAGCTGGC AAGTGGTTTT TGTTGAGCAA AATCTGCCGG ACGGCAAGAG CAATAAAGGC
CGCGTTCAGG CTTATGATGG CCCTATCTAT ATTGCGGACG CTGCGTTGTT CGTTAAAGCC
ACTCAGCCGC AGTTGGGCAT CAGCGATCCG TATCAACTCA CCGAAGAACA GTACCAGGCG
GTGCTGAAAG TGCTGCGCGC TCAACACAGT TTGATCCATC GCTACTGGCA TGACACTACC
GTGCAAATGA GCGATTTCAA AAACGAGGGT GTGGTTGCTT CCAGTGCCTG GCCCTATCAG
GCCAACGCCC TGAAAGCCGA AGGCCAGCCT GTTGCTACCG TTTTCCCGAA GGAGGGTGTT
ACCGGTTGGG CTGATACCAC CATGCTGCAT AGCGAAGCGA AACATCCGGT TTGCGCCTAC
AAATGGATGA ACTGGTCATT AACGCCAAAA GTGCAGGGCG ATGTGGCGGC CTGGTTTGGC
TCGTTACCGG TAGTGCCGGA AGGGTGTAAA GCCAGTCCGT TATTAGGCGA AAAAGGTTGT
GAAACCAACG GTTTTAACTA TTTCGACAAA ATCGCCTTCT GGAAAACGCC TATAGCAGAA
GGGGGCAAGT TTGTTCCCTA CAGTCGCTGG ACGCAGGATT ACATTGCCAT TATGGGCGGT
CGCTAA
 
Protein sequence
MSKTFARSSL CALSMTIMTA HAAEPPTNLD KPEGRLDIIA WPGYIERGQT DKQYDWVTQF 
EKETGCAVNV KTAATSDEMV SLMTKGGYDL VTASGDASLR LIMGKRVQPI NTALIPNWKT
LDPRVVKGDW FNVGGKVYGT PYQWGPNLLM YNTKTFPTPP DSWQVVFVEQ NLPDGKSNKG
RVQAYDGPIY IADAALFVKA TQPQLGISDP YQLTEEQYQA VLKVLRAQHS LIHRYWHDTT
VQMSDFKNEG VVASSAWPYQ ANALKAEGQP VATVFPKEGV TGWADTTMLH SEAKHPVCAY
KWMNWSLTPK VQGDVAAWFG SLPVVPEGCK ASPLLGEKGC ETNGFNYFDK IAFWKTPIAE
GGKFVPYSRW TQDYIAIMGG R