Gene EcDH1_1974 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcDH1_1974 
Symbol 
ID
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli DH1 
KingdomBacteria 
Replicon accessionCP001637 
Strand
Start bp2129743 
End bp2131347 
Gene Length1605 bp 
Protein Length534 aa 
Translation table11 
GC content48% 
IMG OID 
Productconserved hypothetical protein 
Protein accessionACX39631 
Protein GI260449209 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.000437192 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAAAAAA TTGCTATTGT GGGTGCCGGG CCTACGGGGA TCTACACCTT ATTCTCGCTT 
CTACAGCAAC AAACTCCACT TTCTATTTCT ATCTTCGAGC AGGCTGACGA GGCCGGTGTC
GGGATGCCAT ACAGTGATGA GGAAAACTCA AAAATGATGC TGGCAAATAT TGCCAGTATT
GAAATACCGC CGATTTATTG TACGTATCTC GAATGGCTAC AAAAGCAAGA AGACAGCCAT
CTCCAGCGTT ATGGCGTTAA AAAAGAAACC TTGCACGATC GTCAGTTTTT ACCGCGAATT
CTGCTGGGCG AATATTTCCG CGATCAATTT TTACGACTAG TAGACCAGGC ACGACAGCAA
AAATTTGCAG TGGCTGTTTA TGAATCATGC CAGGTTACCG ATCTGCAAAT TACAAATGCT
GGCGTCATGC TCGCTACAAA TCAGGATTTA CCCAGCGAGA CGTTTGATTT AGCGGTGATC
GCCACGGGTC ACGTCTGGCC TGATGAAGAA GAAGCAACCC GAACGTATTT TCCCAGCCCG
TGGTCAGGCC TGATGGAAGC AAAGGTCGAT GCGTGTAACG TGGGTATTAT GGGAACATCC
TTGAGCGGAC TGGATGCGGC AATGGCAGTG GCTATTCAGC ATGGTTCGTT CATTGAAGAT
GATAAACAAC ACGTCGTTTT TCACCGCGAT AACGCAAGTG AAAAGCTAAA TATCACGTTG
TTGTCGCGCA CGGGTATTTT ACCCGAAGCC GATTTCTATT GCCCTATTCC CTACGAGCCC
TTACACATCG TCACCGATCA GGCATTAAAT GCTGAGATTC AAAAAGGCGA AGAGGGCCTT
TTGGATCGGG TATTTAGATT GATAGTAGAG GAAATCAAGT TTGCTGATCC AGACTGGAGT
CAACGCATAG CCTTAGAGAG CCTGAATGTC GATTCCTTTG CTCAAGCCTG GTTTGCCGAG
CGCAAACAAC GCGACCCATT TGACTGGGCA GAAAAAAATC TCCAGGAAGT CGAACGCAAT
AAACGAGAAA AACATACTGT TCCCTGGCGT TATGTCATTC TGCGCCTGCA TGAAGCCGTA
CAGGAAATTG TTCCACATCT GAATGAACAC GACCATAAAC GGTTCAGTAA AGGCCTTGCC
CGGGTTTTCA TCGATAATTA TGCGGCAATC CCTTCAGAGT CTATTCGTCG CCTACTTGCC
TTACGTGAAG CGGGAATCAT TCATATTCTC GCCCTCGGTG AAGACTACAA AATGGAAATT
AATGAGTCGC GCACCGTCCT GAAAACGGAA GACAACAGCT ACTCGTTTGA CGTTTTTATT
GATGCCCGCG GACAACGTCC GCTTAAAGTG AAAGATATCC CTTTCCCTGG GCTACGCGAG
CAATTACAGA AAACAGGGGA TGAAATCCCT GATGTTGGCG AAGATTATAC GTTACAGCAA
CCCGAAGATA TTCGTGGGCG CGTAGCGTTC GGCGCGTTGC CCTGGTTGAT GCACGACCAG
CCTTTCGTTC AGGGACTTAC GGCATGTGCA GAAATTGGTG AGGCGATGGC TCGGGCGGTC
GTAAAGCCTG CATCCCGTGC TCGTCGGCGT CTTTCGTTTG ATTAA
 
Protein sequence
MKKIAIVGAG PTGIYTLFSL LQQQTPLSIS IFEQADEAGV GMPYSDEENS KMMLANIASI 
EIPPIYCTYL EWLQKQEDSH LQRYGVKKET LHDRQFLPRI LLGEYFRDQF LRLVDQARQQ
KFAVAVYESC QVTDLQITNA GVMLATNQDL PSETFDLAVI ATGHVWPDEE EATRTYFPSP
WSGLMEAKVD ACNVGIMGTS LSGLDAAMAV AIQHGSFIED DKQHVVFHRD NASEKLNITL
LSRTGILPEA DFYCPIPYEP LHIVTDQALN AEIQKGEEGL LDRVFRLIVE EIKFADPDWS
QRIALESLNV DSFAQAWFAE RKQRDPFDWA EKNLQEVERN KREKHTVPWR YVILRLHEAV
QEIVPHLNEH DHKRFSKGLA RVFIDNYAAI PSESIRRLLA LREAGIIHIL ALGEDYKMEI
NESRTVLKTE DNSYSFDVFI DARGQRPLKV KDIPFPGLRE QLQKTGDEIP DVGEDYTLQQ
PEDIRGRVAF GALPWLMHDQ PFVQGLTACA EIGEAMARAV VKPASRARRR LSFD