Gene EcDH1_2336 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcDH1_2336 
Symbol 
ID
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli DH1 
KingdomBacteria 
Replicon accessionCP001637 
Strand
Start bp2506476 
End bp2507768 
Gene Length1293 bp 
Protein Length430 aa 
Translation table11 
GC content50% 
IMG OID 
Productextracellular solute-binding protein family 1 
Protein accessionACX39979 
Protein GI260449557 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones25 
Plasmid unclonability p-value0.193691 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGATTAAAT CAAAAATCGT GCTGTTATCA GCACTGGTTT CATGCGCCCT GATTTCAGGC 
TGTAAAGAAG AAAATAAAAC GAATGTATCC ATCGAATTTA TGCATTCTTC GGTGGAGCAG
GAGCGCCAGG CCGTTATCAG TAAATTGATT GCCCGTTTTG AAAAAGAAAA CCCTGGCATC
ACCGTTAAGC AAGTGCCCGT GGAAGAAGAT GCCTATAACA CTAAAGTCAT TACTCTTTCA
CGTAGCGGTT CGCTGCCGGA AGTGATCGAA ACCAGCCATG ACTACGCCAA AGTGATGGAC
AAAGAGCAGC TTATCGATCG CAAAGCGGTT GCCACAGTCA TCAGCAACGT TGGTGAAGGC
GCGTTTTACG ATGGCGTACT GCGTATTGTG CGTACCGAAG ATGGTAGCGC ATGGACCGGT
GTTCCTGTCA GCGCCTGGAT TGGCGGTATC TGGTATCGCA AAGATGTGCT GGCAAAAGCG
GGGCTTGAGG AGCCGAAAAA CTGGCAACAG CTGCTGGACG TTGCACAGAA ACTGAATGAC
CCGGCGAATA AAAAATACGG CATTGCGCTG CCTACAGCAG AAAGCGTGTT GACGGAACAA
TCCTTCTCCC AGTTTGCGTT ATCCAACCAG GCTAACGTCT TTAACGCCGA AGGCAAAATC
ACCCTTGATA CACCAGAGAT GATGCAGGCA CTGACCTATT ACCGCGACCT TACTGCCAAC
ACTATGCCGG GTTCTAACGA CATCATGGAA GTGAAAGACG CCTTTATGAA CGGCACCGCG
CCGATGGCGA TTTACTCCAC CTATATCCTT CCGGCTGTGA TTAAAGAAGG CGACCCGAAA
AACGTCGGTT TCGTGGTGCC AACCGAGAAA AACTCTGCGG TCTACGGCAT GTTGACCTCG
CTGACCATTA CCGCCGGGCA AAAGACCGAA GAGACGGAAG CAGCAGAAAA ATTTGTCACC
TTTATGGAGC AGGCAGACAA CATTGCCGAC TGGGTGATGA TGTCGCCAGG TGCTGCGCTG
CCGGTGAATA AAGCGGTGGT GACTACCGCC ACCTGGAAAG ACAACGACGT TATTAAGGCG
CTGGGTGAAC TACCGAATCA GCTAATCGGT GAACTGCCAA ATATTCAGGT TTTTGGCGCA
GTAGGGGATA AAAACTTTAC CCGCATGGGT GATGTGACGG GTTCTGGCGT GGTGAGTTCA
ATGGTGCATA ACGTCACCGT GGGTAAAGCC GATCTCTCTA CTACGCTGCA AGCGAGCCAG
AAAAAACTGG ATGAACTGAT CGAACAGCAC TAA
 
Protein sequence
MIKSKIVLLS ALVSCALISG CKEENKTNVS IEFMHSSVEQ ERQAVISKLI ARFEKENPGI 
TVKQVPVEED AYNTKVITLS RSGSLPEVIE TSHDYAKVMD KEQLIDRKAV ATVISNVGEG
AFYDGVLRIV RTEDGSAWTG VPVSAWIGGI WYRKDVLAKA GLEEPKNWQQ LLDVAQKLND
PANKKYGIAL PTAESVLTEQ SFSQFALSNQ ANVFNAEGKI TLDTPEMMQA LTYYRDLTAN
TMPGSNDIME VKDAFMNGTA PMAIYSTYIL PAVIKEGDPK NVGFVVPTEK NSAVYGMLTS
LTITAGQKTE ETEAAEKFVT FMEQADNIAD WVMMSPGAAL PVNKAVVTTA TWKDNDVIKA
LGELPNQLIG ELPNIQVFGA VGDKNFTRMG DVTGSGVVSS MVHNVTVGKA DLSTTLQASQ
KKLDELIEQH