Gene EcDH1_2139 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcDH1_2139 
Symbol 
ID
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli DH1 
KingdomBacteria 
Replicon accessionCP001637 
Strand
Start bp2286302 
End bp2287624 
Gene Length1323 bp 
Protein Length440 aa 
Translation table11 
GC content48% 
IMG OID 
ProductHipA N-terminal domain protein 
Protein accessionACX39793 
Protein GI260449371 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value0.00420525 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCCTAAAC TTGTCACTTG GATGAACAAC CAGCGGGTAG GCGAGTTAAC GAAGTTAGCC 
AACGGCGCGC ACACCTTTAA GTATGCACCG GAGTGGTTAG CAAGCCGTTA TGCCAGACCG
TTGTCACTTT CGCTGCCATT GCAGAGGGGG AATATCACCT CTGATGCCGT ATTTAACTTC
TTCGATAACC TGTTACCCGA TAGCCCGATT GTACGTGACC GGATCGTTAA ACGTTATCAT
GCCAAATCCA GACAACCGTT TGATTTATTG TCAGAAATAG GGCGAGACAG CGTTGGTGCC
GTGACGTTAA TACCCGAAGA CGAAACCGTA ACGCATCCGA TAATGGCATG GGAAAAGCTT
ACTGAAGCCA GACTTGAAGA AGTATTAACG GCTTATAAAG CAGATATCCC GCTAGGCATG
ATTAGAGAAG AAAATGACTT TCGCATCTCG GTTGCTGGCG CACAGGAGAA GACAGCACTG
CTCAGAATAG GCAATGACTG GTGCATTCCG AAAGGAATAA CGCCGACGAC GCACATCATT
AAATTACCGA TTGGCGAAAT CAGGCAGCCC AATGCGACGC TCGATCTCAG CCAAAGCGTT
GATAATGAGT ATTACTGTCT GCTGCTGGCG AAAGAACTTG GGTTGAATGT TCCGGACGCA
GAAATCATTA AAGCGGGAAA TGTGCGCGCG TTAGCGGTCG AACGTTTTGA CAGGCGTTGG
AATGCTGAGC GAACGGTTTT ACTTCGCTTG CCACAGGAGG ATATGTGTCA GACATTCGGT
TTACCTTCAT CGGTGAAATA TGAATCAGAT GGAGGCCCAG GCATCGCGCG GATCATGGCT
TTTTTGATGG GGTCCAGCGA GGCGCTGAAA GATCGCTATG ATTTTATGAA ATTCCAGGTC
TTCCAGTGGT TGATTGGCGC AACGGACGGT CATGCAAAAA ACTTCTCCGT ATTTATTCAG
GCTGGCGGCA GTTATCGACT CACGCCATTT TACGACATCA TTTCAGCATT TCCGGTCCTT
GGCGGTACGG GAATACACAT CAGCGATCTC AAACTGGCAA TGGGGCTTAA CGCATCCAAA
GGCAAAAAAA CGGCAATCGA TAAAATTTAT CCGCGACATT TTTTGGCGAC AGCAAAGGTG
CTGAGATTCC CGGAAGTGCA GATGCATGAA ATCCTGAGTG ACTTTGCCAG AATGATTCCA
GCAGCACTGG ATAACGTGAA GACTTCATTA CCGACAGATT TTCCGGAGAA CGTGGTGACG
GCAGTTGAAA GCAATGTGTT GAGGTTGCAT GGACGGTTAA GCCGAGAATA CGGTAGTAAG
TGA
 
Protein sequence
MPKLVTWMNN QRVGELTKLA NGAHTFKYAP EWLASRYARP LSLSLPLQRG NITSDAVFNF 
FDNLLPDSPI VRDRIVKRYH AKSRQPFDLL SEIGRDSVGA VTLIPEDETV THPIMAWEKL
TEARLEEVLT AYKADIPLGM IREENDFRIS VAGAQEKTAL LRIGNDWCIP KGITPTTHII
KLPIGEIRQP NATLDLSQSV DNEYYCLLLA KELGLNVPDA EIIKAGNVRA LAVERFDRRW
NAERTVLLRL PQEDMCQTFG LPSSVKYESD GGPGIARIMA FLMGSSEALK DRYDFMKFQV
FQWLIGATDG HAKNFSVFIQ AGGSYRLTPF YDIISAFPVL GGTGIHISDL KLAMGLNASK
GKKTAIDKIY PRHFLATAKV LRFPEVQMHE ILSDFARMIP AALDNVKTSL PTDFPENVVT
AVESNVLRLH GRLSREYGSK