Gene EcDH1_2160 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcDH1_2160 
Symbol 
ID
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli DH1 
KingdomBacteria 
Replicon accessionCP001637 
Strand
Start bp2315997 
End bp2317547 
Gene Length1551 bp 
Protein Length516 aa 
Translation table11 
GC content50% 
IMG OID 
Productextracellular solute-binding protein family 5 
Protein accessionACX39812 
Protein GI260449390 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.0024579 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAGAGAT CGATATCGTT TCGTCCCACA TTGCTCGCGC TCGTCCTTGC CACAAATTTC 
CCGGTTGCGC ACGCCGCCGT ACCAAAAGAT ATGCTGGTGA TTGGTAAGGC CGCCGATCCA
CAAACCCTCG ACCCGGCGGT AACAATAGAT AATAACGACT GGACAGTGAC CTACCCGTCT
TATCAGCGGC TGGTTCAGTA CAAAACGGAC GGTGATAAAG GCTCAACCGA CGTTGAAGGC
GATCTGGCAA GTAGCTGGAA AGCGTCTGAC GATCAAAAAG AGTGGACGTT CACCCTGAAA
GATAATGCTA AATTTGCCGA TGGCACACCT GTCACTGCCG AAGCAGTAAA ACTTTCTTTT
GAGCGGCTAC TAAAAATCGG CCAGGGGCCA GCAGAAGCAT TTCCCAAAGA TTTAAAGATT
GATGCTCCCG ACGAACATAC GGTGAAGTTT ACCCTTAGCC AACCATTCGC ACCGTTCCTC
TACACGCTGG CGAATGACGG TGCATCCATT ATCAATCCGG CGGTCTTAAA GGAACATGCA
GCGGATGATG CTCGCGGCTT CCTCGCGCAA AATACCGCCG GTTCCGGACC ATTTATGCTG
AAAAGCTGGC AAAAAGGTCA GCAATTAGTT CTGGTGCCAA ATCCGCATTA CCCAGGCAAT
AAACCGAACT TCAAACGGGT ATCGGTAAAA ATTATTGGTG AAAGTGCCTC CCGTCGCCTG
CAGCTCTCCC GTGGCGACAT TGACATTGCC GATGCGCTGC CGGTGGATCA ACTCAACGCC
CTGAAGCAGG AAAATAAAGT CAATGTGGCA GAGTATCCGT CACTGCGCGT TACCTATCTG
TATCTGAATA ACAGCAAAGC GCCTCTTAAT CAGGCGGATC TGCGTCGGGC CATTTCCTGG
TCTACCGATT ATCAGGGCAT GGTTAACGGC ATTCTGAGTG GTAACGGAAA ACAGATGCGC
GGCCCGATTC CGGAAGGCAT GTGGGGCTAC GATGCGACGG CAATGCAATA CAACCATGAC
GAAACGAAAG CCAAAGCCGA ATGGGATAAA GTGACGAGCA AACCCACCAG CCTGACGTTT
CTCTACTCCG ATAACGATCC GAACTGGGAG CCTATTGCTC TGGCGACACA ATCCAGTCTC
AACAAGCTGG GCATCATTGT GAAGCTGGAA AAGCTGGCGA ACGCCACCAT GCGCGACAGA
GTGGGTAAAG GTGATTACGA CATTGCGATT GGCAACTGGA GTCCGGATTT TGCCGACCCG
TATATGTTTA TGAATTACTG GTTTGAGTCA GACAAAAAAG GTCTGCCGGG TAACCGCTCG
TTCTATGAAA ACAGTGAGGT CGATAAGTTA CTGCGCAATG CGCTTGCGAC CACCGACCAG
ACGCAGCGTA CCCGGGACTA CCAGCAGGCA CAGAAAATCG TCATTGATGA CGCTGCTTAT
GTGTACCTGT TCCAGAAAAA CTACCAACTG GCGATGAACA AAGAGGTGAA AGGCTTTGTG
TTCAATCCCA TGCTGGAACA GGTCTTCAAT ATCAATACCA TGAGTAAATA A
 
Protein sequence
MKRSISFRPT LLALVLATNF PVAHAAVPKD MLVIGKAADP QTLDPAVTID NNDWTVTYPS 
YQRLVQYKTD GDKGSTDVEG DLASSWKASD DQKEWTFTLK DNAKFADGTP VTAEAVKLSF
ERLLKIGQGP AEAFPKDLKI DAPDEHTVKF TLSQPFAPFL YTLANDGASI INPAVLKEHA
ADDARGFLAQ NTAGSGPFML KSWQKGQQLV LVPNPHYPGN KPNFKRVSVK IIGESASRRL
QLSRGDIDIA DALPVDQLNA LKQENKVNVA EYPSLRVTYL YLNNSKAPLN QADLRRAISW
STDYQGMVNG ILSGNGKQMR GPIPEGMWGY DATAMQYNHD ETKAKAEWDK VTSKPTSLTF
LYSDNDPNWE PIALATQSSL NKLGIIVKLE KLANATMRDR VGKGDYDIAI GNWSPDFADP
YMFMNYWFES DKKGLPGNRS FYENSEVDKL LRNALATTDQ TQRTRDYQQA QKIVIDDAAY
VYLFQKNYQL AMNKEVKGFV FNPMLEQVFN INTMSK