Gene ECD_02418 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagECD_02418 
SymbolhscA 
ID
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli BL21(DE3) 
KingdomBacteria 
Replicon accessionCP001509 
Strand
Start bp2522890 
End bp2524740 
Gene Length1851 bp 
Protein Length616 aa 
Translation table11 
GC content58% 
IMG OID 
Productchaperone protein HscA 
Protein accessionACT44238 
Protein GI253978568 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.89877 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCCTTAT TACAAATTAG TGAACCTGGT TTGAGTGCCG CGCCGCATCA GCGTCGTCTG 
GCGGCCGGTA TTGACCTGGG CACAACCAAC TCGCTGGTGG CGACAGTGCG CAGCGGTCAG
GCCGAAACGT TAGCCGATCA TGAAGGCCGT CACCTGCTGC CATCTGTTGT TCACTATCAA
CAGCAAGGGC ATTCGGTGGG TTATGACGCG CGTACTAATG CAGCGCTCGA TACCGCCAAC
ACAATTAGTT CTGTTAAACG CCTGATGGGA CGCTCGCTGG CTGATATCCA GCAACGCTAT
CCGCATCTGC CTTATCAATT CCAGGCCAGC GAAAACGGCC TGCCGATGAT TGAAACGGCG
GCGGGGCTGC TGAACCCGGT GCGCGTTTCT GCGGACATCC TCAAAGCACT GGCGGCGCGG
GCAACTGAAG CCCTGGCAGG CGAGCTGGAT GGTGTAGTTA TCACCGTTCC GGCGTACTTT
GACGATGCCC AGCGTCAGGG CACCAAAGAC GCGGCGCGTC TGGCGGGCCT TCACGTCCTG
CGCTTACTTA ACGAACCGAC CGCTGCGGCT ATCGCCTACG GGCTGGATTC CGGTCAGGAA
GGCGTGATCG CCGTTTATGA CCTCGGTGGC GGGACGTTTG ATATTTCCAT TCTGCGCTTA
AGTCGCGGCG TGTTTGAAGT GCTGGCAACC GGCGGTGATT CCGCGCTCGG CGGCGATGAT
TTCGACCATC TGCTGGCGGA TTACATTCGC GAGCAGGCGG GCATTCCTGA TCGTAGCGAT
AACCGCGTTC AGCGTGAACT GCTGGATGCC GCCATTGCAG CCAAAATCGC GCTGAGCGAT
GCGGACTCCG TGACCGTTAA CGTTGCGGGC TGGCAGGGCG AAATCAGCCG TGAACAATTC
AATGAACTGA TCGCGCCACT GGTAAAACGA ACCTTACTGG CTTGTCGTCG CGCGCTGAAA
GACGCGGGTG TAGAAGCTGA TGAAGTGCTG GAAGTGGTGA TGGTGGGCGG TTCTACTCGC
GTGCCGCTGG TGCGTGAACG GGTAGGCGAA TTTTTCGGTC GTCCACCGCT GACTTCCATC
GACCCGGATA AAGTCGTCGC TATTGGCGCG GCGATTCAGG CGGATATTCT GGTGGGTAAC
AAGCCAGACA GCGAAATGCT GCTGCTTGAT GTGATCCCAC TGTCGCTGGG CCTCGAAACG
ATGGGCGGTC TGGTGGAGAA AGTGATTCCG CGTAATACCA CTATTCCGGT GGCCCGCGCT
CAGGATTTCA CCACCTTTAA AGATGGTCAG ACGGCGATGT CTATCCATGT AATGCAGGGT
GAGCGCGAAC TGGTGCAGGA CTGCCGCTCA CTGGCGCGTT TTGCGCTGCG TGGTATTCCG
GCGCTACCGG CTGGCGGTGC GCATATTCGC GTGACGTTCC AGGTCGATGC CGACGGTCTT
TTGAGCGTGA CGGCGATGGA GAAATCCACC GGCGTTGAGG CGTCTATTCA GGTCAAACCG
TCTTACGGTC TGACTGACAG CGAAATCGCT TCGATGATCA AAGACTCAAT GAGCTATGCC
GAGCAGGACG TAAAAGCCCG AATGCTGGCA GAACAAAAAG TAGAAGCGGC GCGTGTGCTG
GAAAGTCTGC ACGGCGCGCT GGCTGCTGAT GCCGCGCTGT TAAGCGCCGC AGAACGTCAG
GTCATTGACG ATGCTGCCGC TCACCTGAGT GAAGTGGCGC AGGGCGATGA TGTTGACGCC
ATCGAAAAAG CGATTAAAAA CGTAGACAAA CAAACCCAGG ATTTCGCCGC TCGCCGCATG
GACCAGTCGG TTCGTCGTGC GCTGAAAGGC CATTCCGTGG ACGAGGTTTA A
 
Protein sequence
MALLQISEPG LSAAPHQRRL AAGIDLGTTN SLVATVRSGQ AETLADHEGR HLLPSVVHYQ 
QQGHSVGYDA RTNAALDTAN TISSVKRLMG RSLADIQQRY PHLPYQFQAS ENGLPMIETA
AGLLNPVRVS ADILKALAAR ATEALAGELD GVVITVPAYF DDAQRQGTKD AARLAGLHVL
RLLNEPTAAA IAYGLDSGQE GVIAVYDLGG GTFDISILRL SRGVFEVLAT GGDSALGGDD
FDHLLADYIR EQAGIPDRSD NRVQRELLDA AIAAKIALSD ADSVTVNVAG WQGEISREQF
NELIAPLVKR TLLACRRALK DAGVEADEVL EVVMVGGSTR VPLVRERVGE FFGRPPLTSI
DPDKVVAIGA AIQADILVGN KPDSEMLLLD VIPLSLGLET MGGLVEKVIP RNTTIPVARA
QDFTTFKDGQ TAMSIHVMQG ERELVQDCRS LARFALRGIP ALPAGGAHIR VTFQVDADGL
LSVTAMEKST GVEASIQVKP SYGLTDSEIA SMIKDSMSYA EQDVKARMLA EQKVEAARVL
ESLHGALAAD AALLSAAERQ VIDDAAAHLS EVAQGDDVDA IEKAIKNVDK QTQDFAARRM
DQSVRRALKG HSVDEV