Gene EcDH1_3849 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcDH1_3849 
Symbol 
ID
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli DH1 
KingdomBacteria 
Replicon accessionCP001637 
Strand
Start bp4141910 
End bp4143556 
Gene Length1647 bp 
Protein Length548 aa 
Translation table11 
GC content53% 
IMG OID 
Productchaperonin GroEL 
Protein accessionACX41451 
Protein GI260451029 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones52 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCAGCTA AAGACGTAAA ATTCGGTAAC GACGCTCGTG TGAAAATGCT GCGCGGCGTA 
AACGTACTGG CAGATGCAGT GAAAGTTACC CTCGGTCCAA AAGGCCGTAA CGTAGTTCTG
GATAAATCTT TCGGTGCACC GACCATCACC AAAGATGGTG TTTCCGTTGC TCGTGAAATC
GAACTGGAAG ACAAGTTCGA AAATATGGGT GCGCAGATGG TGAAAGAAGT TGCCTCTAAA
GCAAACGACG CTGCAGGCGA CGGTACCACC ACTGCAACCG TACTGGCTCA GGCTATCATC
ACTGAAGGTC TGAAAGCTGT TGCTGCGGGC ATGAACCCGA TGGACCTGAA ACGTGGTATC
GACAAAGCGG TTACCGTTGC AGTTGAAGAA CTGAAAGCGC TGTCCGTACC ATGCTCTGAC
TCTAAAGCGA TTGCTCAGGT TGGTACCATC TCCGCTAACT CCGACGAAAC CGTAGGTAAA
CTGATCGCTG AAGCGATGGA CAAAGTCGGT AAAGAAGGCG TTATCACCGT TGAAGACGGT
ACCGGTCTGC AGGACGAACT GGACGTGGTT GAAGGTATGC AGTTCGACCG TGGCTACCTG
TCTCCTTACT TCATCAACAA GCCGGAAACT GGCGCAGTAG AACTGGAAAG CCCGTTCATC
CTGCTGGCTG ACAAGAAAAT CTCCAACATC CGCGAAATGC TGCCGGTTCT GGAAGCTGTT
GCCAAAGCAG GCAAACCGCT GCTGATCATC GCTGAAGATG TAGAAGGCGA AGCGCTGGCA
ACTCTGGTTG TTAACACCAT GCGTGGCATC GTGAAAGTCG CTGCGGTTAA AGCACCGGGC
TTCGGCGATC GTCGTAAAGC TATGCTGCAG GATATCGCAA CCCTGACTGG CGGTACCGTG
ATCTCTGAAG AGATCGGTAT GGAGCTGGAA AAAGCAACCC TGGAAGACCT GGGTCAGGCT
AAACGTGTTG TGATCAACAA AGACACCACC ACTATCATCG ATGGCGTGGG TGAAGAAGCT
GCAATCCAGG GCCGTGTTGC TCAGATCCGT CAGCAGATTG AAGAAGCAAC TTCTGACTAC
GACCGTGAAA AACTGCAGGA ACGCGTAGCG AAACTGGCAG GCGGCGTTGC AGTTATCAAA
GTGGGTGCTG CTACCGAAGT TGAAATGAAA GAGAAAAAAG CACGCGTTGA AGATGCCCTG
CACGCGACCC GTGCTGCGGT AGAAGAAGGC GTGGTTGCTG GTGGTGGTGT TGCGCTGATC
CGCGTAGCGT CTAAACTGGC TGACCTGCGT GGTCAGAACG AAGACCAGAA CGTGGGTATC
AAAGTTGCAC TGCGTGCAAT GGAAGCTCCG CTGCGTCAGA TCGTATTGAA CTGCGGCGAA
GAACCGTCTG TTGTTGCTAA CACCGTTAAA GGCGGCGACG GCAACTACGG TTACAACGCA
GCAACCGAAG AATACGGCAA CATGATCGAC ATGGGTATCC TGGATCCAAC CAAAGTAACT
CGTTCTGCTC TGCAGTACGC AGCTTCTGTG GCTGGCCTGA TGATCACCAC CGAATGCATG
GTTACCGACC TGCCGAAAAA CGATGCAGCT GACTTAGGCG CTGCTGGCGG TATGGGCGGC
ATGGGTGGCA TGGGCGGCAT GATGTAA
 
Protein sequence
MAAKDVKFGN DARVKMLRGV NVLADAVKVT LGPKGRNVVL DKSFGAPTIT KDGVSVAREI 
ELEDKFENMG AQMVKEVASK ANDAAGDGTT TATVLAQAII TEGLKAVAAG MNPMDLKRGI
DKAVTVAVEE LKALSVPCSD SKAIAQVGTI SANSDETVGK LIAEAMDKVG KEGVITVEDG
TGLQDELDVV EGMQFDRGYL SPYFINKPET GAVELESPFI LLADKKISNI REMLPVLEAV
AKAGKPLLII AEDVEGEALA TLVVNTMRGI VKVAAVKAPG FGDRRKAMLQ DIATLTGGTV
ISEEIGMELE KATLEDLGQA KRVVINKDTT TIIDGVGEEA AIQGRVAQIR QQIEEATSDY
DREKLQERVA KLAGGVAVIK VGAATEVEMK EKKARVEDAL HATRAAVEEG VVAGGGVALI
RVASKLADLR GQNEDQNVGI KVALRAMEAP LRQIVLNCGE EPSVVANTVK GGDGNYGYNA
ATEEYGNMID MGILDPTKVT RSALQYAASV AGLMITTECM VTDLPKNDAA DLGAAGGMGG
MGGMGGMM