Gene EcDH1_1142 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcDH1_1142 
Symbol 
ID
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli DH1 
KingdomBacteria 
Replicon accessionCP001637 
Strand
Start bp1224827 
End bp1226677 
Gene Length1851 bp 
Protein Length616 aa 
Translation table11 
GC content58% 
IMG OID 
ProductFe-S protein assembly chaperone HscA 
Protein accessionACX38816 
Protein GI260448394 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones34 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCCTTAT TACAAATTAG TGAACCTGGT TTGAGTGCTG CGCCGCATCA GCGTCGTCTG 
GCGGCCGGTA TTGACCTGGG CACAACCAAC TCGCTGGTGG CGACAGTGCG CAGCGGTCAG
GCCGAAACGT TAGCCGATCA TGAAGGCCGT CACCTGCTGC CATCTGTTGT TCACTATCAA
CAGCAAGGGC ATTCGGTGGG TTATGACGCG CGTACTAATG CAGCGCTCGA TACCGCCAAC
ACAATTAGTT CTGTTAAACG CCTGATGGGA CGCTCGCTGG CTGATATCCA GCAACGCTAT
CCGCATCTGC CTTATCAATT CCAGGCCAGC GAAAACGGCC TGCCGATGAT TGAAACGGCG
GCGGGGCTGC TGAACCCGGT GCGCGTTTCT GCGGACATCC TCAAAGCACT GGCGGCGCGG
GCAACTGAAG CCCTGGCAGG CGAGCTGGAT GGTGTAGTTA TCACCGTTCC GGCGTACTTT
GACGATGCCC AGCGTCAGGG CACCAAAGAC GCGGCGCGTC TGGCGGGCCT TCACGTCCTG
CGCTTACTTA ACGAACCGAC CGCTGCGGCT ATCGCCTACG GGCTGGATTC CGGTCAGGAA
GGCGTGATCG CCGTTTATGA CCTCGGTGGC GGGACGTTTG ATATTTCCAT TCTGCGCTTA
AGTCGCGGCG TGTTTGAAGT GCTGGCAACC GGCGGTGATT CCGCGCTCGG CGGCGATGAT
TTCGACCATC TGCTGGCGGA TTACATTCGC GAGCAGGCGG GCATTCCTGA TCGTAGCGAT
AACCGCGTTC AGCGTGAACT GCTGGATGCC GCCATTGCAG CCAAAATCGC GCTGAGCGAT
GCGGACTCCG TGACCGTTAA CGTTGCGGGC TGGCAGGGCG AAATCAGCCG TGAACAATTC
AATGAACTGA TCGCGCCACT GGTAAAACGA ACCTTACTGG CTTGTCGTCG CGCGCTGAAA
GACGCGGGTG TAGAAGCTGA TGAAGTGCTG GAAGTGGTGA TGGTGGGCGG TTCTACTCGC
GTGCCGCTGG TGCGTGAACG GGTAGGCGAA TTTTTCGGTC GTCCACCGCT GACTTCCATC
GACCCGGATA AAGTCGTCGC TATTGGCGCG GCGATTCAGG CGGATATTCT GGTGGGTAAC
AAGCCAGACA GCGAAATGCT GTTGCTTGAT GTGATCCCAC TGTCGCTGGG CCTCGAAACG
ATGGGCGGCC TGGTGGAGAA AGTGATTCCG CGTAATACCA CTATTCCGGT GGCCCGCGCT
CAGGATTTCA CCACCTTTAA AGATGGTCAG ACGGCGATGT CTATCCATGT AATGCAGGGT
GAGCGCGAAC TGGTGCAGGA CTGCCGCTCA CTGGCGCGTT TTGCGCTGCG TGGTATTCCG
GCGCTACCGG CTGGCGGTGC GCATATTCGC GTGACGTTCC AGGTCGATGC CGACGGTCTT
TTGAGCGTGA CGGCGATGGA GAAATCCACC GGCGTTGAGG CGTCTATTCA GGTCAAACCG
TCTTACGGTC TGACCGATAG CGAAATCGCT TCGATGATCA AAGACTCAAT GAGCTATGCC
GAGCAGGACG TAAAAGCCCG AATGCTGGCA GAACAAAAAG TAGAAGCGGC GCGTGTGCTG
GAAAGTCTGC ACGGCGCGCT GGCTGCTGAT GCCGCGCTGT TAAGCGCCGC AGAACGTCAG
GTCATTGACG ATGCTGCCGC TCACCTGAGT GAAGTGGCGC AGGGCGATGA TGTTGACGCC
ATCGAACAAG CGATTAAAAA CGTAGACAAA CAAACCCAGG ATTTCGCCGC TCGCCGCATG
GACCAGTCGG TTCGTCGTGC GCTGAAAGGC CATTCCGTGG ACGAGGTTTA A
 
Protein sequence
MALLQISEPG LSAAPHQRRL AAGIDLGTTN SLVATVRSGQ AETLADHEGR HLLPSVVHYQ 
QQGHSVGYDA RTNAALDTAN TISSVKRLMG RSLADIQQRY PHLPYQFQAS ENGLPMIETA
AGLLNPVRVS ADILKALAAR ATEALAGELD GVVITVPAYF DDAQRQGTKD AARLAGLHVL
RLLNEPTAAA IAYGLDSGQE GVIAVYDLGG GTFDISILRL SRGVFEVLAT GGDSALGGDD
FDHLLADYIR EQAGIPDRSD NRVQRELLDA AIAAKIALSD ADSVTVNVAG WQGEISREQF
NELIAPLVKR TLLACRRALK DAGVEADEVL EVVMVGGSTR VPLVRERVGE FFGRPPLTSI
DPDKVVAIGA AIQADILVGN KPDSEMLLLD VIPLSLGLET MGGLVEKVIP RNTTIPVARA
QDFTTFKDGQ TAMSIHVMQG ERELVQDCRS LARFALRGIP ALPAGGAHIR VTFQVDADGL
LSVTAMEKST GVEASIQVKP SYGLTDSEIA SMIKDSMSYA EQDVKARMLA EQKVEAARVL
ESLHGALAAD AALLSAAERQ VIDDAAAHLS EVAQGDDVDA IEQAIKNVDK QTQDFAARRM
DQSVRRALKG HSVDEV