Gene EcHS_A2677 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcHS_A2677 
SymbolhscA 
ID5591483 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli HS 
KingdomBacteria 
Replicon accessionNC_009800 
Strand
Start bp2694604 
End bp2696454 
Gene Length1851 bp 
Protein Length616 aa 
Translation table11 
GC content58% 
IMG OID640921795 
Productchaperone protein HscA 
Protein accessionYP_001459319 
Protein GI157162001 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0443] Molecular chaperone 
TIGRFAM ID[TIGR01991] Fe-S protein assembly chaperone HscA 


Plasmid Coverage information

Num covering plasmid clones47 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCCTTAT TACAAATTAG TGAACCTGGT TTGAGTGCTG CGCCGCATCA GCGTCGTCTG 
GCGGCCGGTA TTGACCTGGG CACAACCAAC TCGCTGGTGG CGACAGTGCG CAGCGGTCAG
GCCGAAACGT TAGCCGATCA TGAAGGCCGT CACCTGCTGC CATCTGTTGT TCACTATCAA
CAGCAAGGGC ATTCGGTGGG TTATGACGCG CGTACTAATG CAGCGCTCGA TACCGCCAAC
ACAATTAGTT CTGTTAAACG CCTGATGGGA CGCTCGCTGG CTGATATCCA GCAACGCTAT
CCGCATCTGC CTTATCAATT CCAGGCCAGC GAAAACGGCC TGCCGATGAT TGAAACGGCG
GCGGGGCTGC TGAACCCGGT GCGCGTTTCT GCGGACATCC TCAAAGCACT GGCGGCGCGG
GCAACTGAAG CCCTGGCAGG CGAGCTGGAT GGTGTAGTTA TCACCGTTCC GGCGTACTTT
GACGATGCCC AGCGTCAGGG CACCAAAGAC GCGGCGCGTC TGGCGGGCCT TCACGTCCTG
CGCTTACTTA ACGAACCGAC CGCTGCGGCT ATCGCCTACG GGCTGGATTC CGGTCAGGAA
GGCGTGATCG CCGTTTATGA CCTCGGTGGC GGGACGTTTG ATATTTCCAT TCTGCGCTTA
AGTCGCGGCG TGTTTGAAGT GCTGGCAACC GGCGGTGATT CCGCGCTCGG CGGCGATGAT
TTCGACCATC TGCTGGCGGA TTACATTCGC GAGCAGGCGG GCATTCCTGA TCGTAGCGAT
AACCGCGTTC AGCGTGAACT GCTGGATGCC GCCATTGCAG CCAAAATCGC GCTGAGCGAT
GCGGACTCCG TGACCGTTAA CGTTGCGGGC TGGCAGGGCG AAATCAGCCG TGAACAATTC
AATGAACTGA TCGCGCCACT GGTAAAACGA ACCTTACTGG CTTGTCGTCG CGCGCTGAAA
GACGCGGGTG TAGAAGCTGA TGAAGTGCTG GAAGTGGTGA TGGTGGGCGG TTCTACTCGC
GTGCCGCTGG TGCGTGAACG GGTAGGCGAA TTTTTCGGTC GTCCACCGCT GACTTCCATC
GACCCGGATA AAGTCGTCGC TATTGGCGCG GCGATTCAGG CGGATATTCT GGTGGGTAAC
AAGCCAGACA GCGAAATGCT GTTGCTTGAT GTGATCCCAC TGTCGCTGGG CCTCGAAACG
ATGGGCGGCC TGGTGGAGAA AGTGATTCCG CGTAATACCA CTATTCCGGT GGCCCGCGCT
CAGGATTTCA CCACCTTTAA AGATGGTCAG ACGGCGATGT CTATCCATGT AATGCAGGGT
GAGCGCGAAC TGGTGCAGGA CTGCCGCTCA CTGGCGCGTT TTGCGCTGCG TGGTATTCCG
GCGCTACCGG CTGGCGGTGC GCATATTCGC GTGACGTTCC AGGTCGATGC CGACGGTCTT
TTGAGCGTGA CGGCGATGGA GAAATCCACC GGCGTTGAGG CGTCTATTCA GGTCAAACCG
TCTTACGGTC TGACCGATAG CGAAATCGCT TCGATGATCA AAGACTCAAT GAGCTATGCC
GAGCAGGACG TAAAAGCCCG AATGCTGGCA GAACAAAAAG TAGAAGCGGC GCGTGTGCTG
GAAAGTCTGC ACGGCGCGCT GGCTGCTGAT GCCGCGCTGT TAAGCGCCGC AGAACGTCAG
GTCATTGACG ATGCTGCCGC TCACCTGAGT GAAGTGGCGC AGGGCGATGA TGTTGACGCC
ATCGAACAAG CGATTAAAAA CGTAGACAAA CAAACCCAGG ATTTCGCCGC TCGCCGCATG
GACCAGTCGG TTCGTCGTGC GCTGAAAGGC CATTCCGTGG ACGAGGTTTA A
 
Protein sequence
MALLQISEPG LSAAPHQRRL AAGIDLGTTN SLVATVRSGQ AETLADHEGR HLLPSVVHYQ 
QQGHSVGYDA RTNAALDTAN TISSVKRLMG RSLADIQQRY PHLPYQFQAS ENGLPMIETA
AGLLNPVRVS ADILKALAAR ATEALAGELD GVVITVPAYF DDAQRQGTKD AARLAGLHVL
RLLNEPTAAA IAYGLDSGQE GVIAVYDLGG GTFDISILRL SRGVFEVLAT GGDSALGGDD
FDHLLADYIR EQAGIPDRSD NRVQRELLDA AIAAKIALSD ADSVTVNVAG WQGEISREQF
NELIAPLVKR TLLACRRALK DAGVEADEVL EVVMVGGSTR VPLVRERVGE FFGRPPLTSI
DPDKVVAIGA AIQADILVGN KPDSEMLLLD VIPLSLGLET MGGLVEKVIP RNTTIPVARA
QDFTTFKDGQ TAMSIHVMQG ERELVQDCRS LARFALRGIP ALPAGGAHIR VTFQVDADGL
LSVTAMEKST GVEASIQVKP SYGLTDSEIA SMIKDSMSYA EQDVKARMLA EQKVEAARVL
ESLHGALAAD AALLSAAERQ VIDDAAAHLS EVAQGDDVDA IEQAIKNVDK QTQDFAARRM
DQSVRRALKG HSVDEV