Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | SbBS512_E2901 |
Symbol | hscA |
ID | 6269815 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Shigella boydii CDC 3083-94 |
Kingdom | Bacteria |
Replicon accession | NC_010658 |
Strand | - |
Start bp | 2706206 |
End bp | 2708056 |
Gene Length | 1851 bp |
Protein Length | 616 aa |
Translation table | 11 |
GC content | 57% |
IMG OID | 641726846 |
Product | chaperone protein HscA |
Protein accession | YP_001881317 |
Protein GI | 187730311 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG0443] Molecular chaperone |
TIGRFAM ID | [TIGR01991] Fe-S protein assembly chaperone HscA |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 28 |
Plasmid unclonability p-value | 0.87079 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGCCTTAT TACAAATTAG TGAACCTGGT TTGAGTGCTG CGCCGCATCA GCGTCGTCTG GCGGCCGGTA TTGACCTGGG CACAACCAAC TCGCTGGTAG CGACAGTGCG CAGCGGTCAG GCCGAAACGT TAGCCGACCA TGAAGGCCGT CACCTGCTGC CATCTGTTGT TCACTATCAA CAGCAAGGGC ATTCGGTGGG TTATGACGCG CGTACTAACG CAGCGCTCGA TACCGCCAAC ACCATTAGTT CTGTTAAACG CCTGATGGGA CGCTCGCTGG CTGATATCCA GCAACGCTAT CCGCATTTGC CTTATCAATT CCAGGCCAGC GAAAACGGCC TGCCAATGAT TGAAACGGCG GCGGGGCTGC TGAACCCGGT GCGCGTTTCT GCGGACATCC TCAAAGCACT GGCGGCGCGG GCAACTGAAG CCCTGGCAGG CGAGCTGGAT GGTGTCGTTA TCACCGTTCC GGCGTACTTT GACGATGCCC AGCGTCAGGG CACCAAAGAC GCGGCGCGTC TGGCGGGGTT GCATGTTCTG CGCTTACTTA ACGAACCGAC CGCTGCGGCT ATCGCCTACG GGCTGGATTC CGGTCAGGAA GGCGTGATCG CCGTTTATGA CCTCGGTGGC GGGACGTTTG ATATTTCCAT TCTGCGCTTA AGTCGCGGCG TGTTTGAAGT GCTGGCAACC GGCGGTGATT CCGCGCTCGG CGGCGATGAT TTCGACCATC TGCTGGCGGA TTACATTCGC GAGCAGGCGG ACATTCCTGA TCGTAGCGAT AACCGCGTTC AGCGTGAACT GCTGGATGCC GCCATTGCAG CCAAAATCGC GCTGAGCGAT GCGGACTCCG TGACCGTTAA CGTTGCGGGC TGGCAGGGCG AAATCAGCCG TGAACAATTC AATGAACTGA TCGCGCCACT GGTAAAACGA ACCTTACTGG CTTGTCGTCG TGCGCTGAAA GACGCGGGCG TAGAAGCTGA TGAAGTGCTG GAAGTGGTGA TGGTTGGCGG TTCTACTCGC GTGCCGCTGG TGCGTGAACG GGTAGGCGAA TTTTTCGGTC GTCCACCGCT GACTTCCATC GACCCGGATA AAGTCGTCGC TATTGGCGCG GCGATTCAGG CGGATATTCT GGTGGGTAAC AAGCCGGACA GCGAAATGCT GCTGCTTGAT GTGATCCCAC TGTCGCTGGG CCTCGAAACG ATGGGCGGCC TGGTGGAGAA AGTGATTCCG CGTAATACCA CTATTCCGGT GGCCCGCGCT CAGGATTTCA CCACCTTTAA AGATGGTCAG ACGGCGATGT CTATCCATGT AATGCAGGGT GAGCGCGAAC TGGTGCAGGA CTGCCGCTCA CTGGCGCGTT TTGCGCTGCG TGGTATTCCG GCGCTACCGG CTGGCGGTGC GCATATTCGC GTGACGTTCC AGGTCGATGC CGACGGTCTT TTGAGCGTGA CGGCGATGGA GAAATCCACC GGCGTTGAGG CGTCTATTCA GGTCAAACCT TCTTACGGTC TGACCGATAG CGAAATCGCT TCGATGATCA AAGATTCAAT GAGCTATGCC GAGCAGGACG TAAAAGCCCG TATGCTGGCA GAACAAAAAG TAGAAGCGGC GCGTGTGCTG GAAAGTCTGC ACGGCGCGCT GGCTGCTGAT GCCGCGCTGT TAAGCGCCGC AGAGCGTCAG GTCATTGACA ATGCTGCCGC TCACCTGAGT GAAGTGGCTC AGGGCGATGA TGTTGACGCC ATCGAACAAG CGATTAAAAA CGTAGACAAA CAAACCCAGG ATTTCGCCGC TCGCCGCATG GACCAGTCGG TTCGTCGTGC GCTGAAAGGC CATTCCGTGG ACGAGGTTTA A
|
Protein sequence | MALLQISEPG LSAAPHQRRL AAGIDLGTTN SLVATVRSGQ AETLADHEGR HLLPSVVHYQ QQGHSVGYDA RTNAALDTAN TISSVKRLMG RSLADIQQRY PHLPYQFQAS ENGLPMIETA AGLLNPVRVS ADILKALAAR ATEALAGELD GVVITVPAYF DDAQRQGTKD AARLAGLHVL RLLNEPTAAA IAYGLDSGQE GVIAVYDLGG GTFDISILRL SRGVFEVLAT GGDSALGGDD FDHLLADYIR EQADIPDRSD NRVQRELLDA AIAAKIALSD ADSVTVNVAG WQGEISREQF NELIAPLVKR TLLACRRALK DAGVEADEVL EVVMVGGSTR VPLVRERVGE FFGRPPLTSI DPDKVVAIGA AIQADILVGN KPDSEMLLLD VIPLSLGLET MGGLVEKVIP RNTTIPVARA QDFTTFKDGQ TAMSIHVMQG ERELVQDCRS LARFALRGIP ALPAGGAHIR VTFQVDADGL LSVTAMEKST GVEASIQVKP SYGLTDSEIA SMIKDSMSYA EQDVKARMLA EQKVEAARVL ESLHGALAAD AALLSAAERQ VIDNAAAHLS EVAQGDDVDA IEQAIKNVDK QTQDFAARRM DQSVRRALKG HSVDEV
|
| |