Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tmz1t_2201 |
Symbol | hscA |
ID | 7085554 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thauera sp. MZ1T |
Kingdom | Bacteria |
Replicon accession | NC_011662 |
Strand | - |
Start bp | 2483028 |
End bp | 2484896 |
Gene Length | 1869 bp |
Protein Length | 622 aa |
Translation table | 11 |
GC content | 71% |
IMG OID | 643699221 |
Product | chaperone protein HscA |
Protein accession | YP_002355837 |
Protein GI | 217970603 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG0443] Molecular chaperone |
TIGRFAM ID | [TIGR01991] Fe-S protein assembly chaperone HscA |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 0.112267 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGCTCTGT TGCAAATCAC CGAACCCGGC ATGTCCACCG AGCCGCACCA GCACCGGCTC GCCGTCGGCA TCGACCTCGG GACCACCAAT TCGCTCGTCG CCACCGTGCG CAACGGCATC GCGATCTGCC TGCCCGACGA GGCCGGCCGC ACCATGCTGC CCTCGGCCGT GCGCTATCAC GCCGATGGCG GCATCGAGGT CGGTCTGGGT GCGCTCAAGG CGCAGGCGGT CGATCCGCGC AACACCATCG TCTCGGTCAA GCGCTTCATG GGCCGCGGCC TCAAGGACGT CACCCACATC GAGGCGATGC CCTACGACTT CGAGGACAGC CCCGGGATGG TGCGGCTGCG CACGGTGCAG GGCGTCAAGA GCCCGGTGGA GGTCTCCGCC GAGATCCTGC GCCGCCTGCG CGAACGCGCC GAGGCCAGCC TCGGCGGCGC GCTGGTCGGC GCGGTGATCA CCGTGCCGGC CTACTTCGAC GACGCCCAGC GCCAGGCCAC CAAGGACGCG GCGCGCCTGG CCGGGCTCGA CGTGCTGCGC CTGCTCAACG AACCGACCGC CGCCGCGGTG GCCTACGGCC TCGACAACGC GGCCGAGGGC GTCTACGCGG TGTACGACCT GGGCGGGGGC ACCTTCGACC TCTCCATCCT CAAGCTCTCG CGCGGCGTGT TCGAGGTGCT GTCGACCAAC GGCGACGCCG CGCTCGGCGG CGACGACTTC GATCATCGCC TGTTCTGCTG GATCCTCGAC CGCGCCAAGA TCTCGCCGCC CTCGCTGGAA GACGCGCGCC GCCTGCAGAT GAAGGCGCGC GAGGCCAAGG AGCTGTTGAC GAGCTGTGAC GAGGCCCCGG TGCATGTGCG CCTGGCCTCG GGCGAAGAGG TCGACCTCGT GGTGACCCGC GAGGAGTTTG CCGAGATGAC CACGCATCTG GTCCAGAAGA CCCTGGCCCC GGTGCGCAAG GCCTTGCGCG ACGCCGGCCT CGCGCCCGAC GAGATCAAGG GTGTGGTGAT GGTGGGGGGC GCGACCCGCA TGCCCCACGT CCAGCGCGCG GTCGCGAACT ACTTCGGCCA GGAGCCCCTG ACCAACCTCG ACCCCGACAA GGTCGTCGCA CTGGGCGCGG CGATGCAGGC AAACGTGCTC GCAGGCAACC GCGCCGACCA GGACGACTGG CTGCTGCTCG ACGTCATCCC GCTCTCGCTC GGCCTCGAGA CCATGGGCGG CCTGGTCGAG AAGGTCGTGC CGCGCAACGC CACGCTGCCG ATCGCGCGCG CGCAGGAATT CACCACCTTC AAGGACGGCC AGACCGCGAT GGCCTTCCAT GTCGTGCAGG GCGAGCGCGA GCTCGTCTCC GACTGCCGCT CGCTCGCGCG CTTCGAGCTG CGCGGCATTC CCCCCATGGT GGCGGGCGCG GCGCGCATCC GCGTGACCTT CCAGGTCGAT GCCGACGGCC TGCTGGCGGT GTCGGCGCGC GAGATGTCCT CGGGCGTCGA GGCCAGCGTG CTGGTCAAGC CCTCGTACGG CTTGTCGGAC GACGAGATCG CCGAGATGTT GCGCTCGGGC GTGGATCACG CCGGCGACGA CATGGCCGCG CGCGCGCTGC GCGAGCAGCA GGTCGAGGCC GACCGTGTCG TCGAAGCCAC CGAGCAGGCG CTCGGCCACG ACGGCCACCT GCTCTCCGCC GTCGAGTCCG CCGAAATCCG CAGCGTGATC GCGCGCCTGC GCGAGCTGCG CGCCGGCACC GACAACCGTG CCATCAAGGC CGGCATCGAC GCGCTCGCGC GCGCCACCGA CACCTTCGCC GCGCGCCGCA TGGACAACAG CATTCGCAGC GCGCTGACCG GCCACAAGGT CGACGAACTC CCCATCTGA
|
Protein sequence | MALLQITEPG MSTEPHQHRL AVGIDLGTTN SLVATVRNGI AICLPDEAGR TMLPSAVRYH ADGGIEVGLG ALKAQAVDPR NTIVSVKRFM GRGLKDVTHI EAMPYDFEDS PGMVRLRTVQ GVKSPVEVSA EILRRLRERA EASLGGALVG AVITVPAYFD DAQRQATKDA ARLAGLDVLR LLNEPTAAAV AYGLDNAAEG VYAVYDLGGG TFDLSILKLS RGVFEVLSTN GDAALGGDDF DHRLFCWILD RAKISPPSLE DARRLQMKAR EAKELLTSCD EAPVHVRLAS GEEVDLVVTR EEFAEMTTHL VQKTLAPVRK ALRDAGLAPD EIKGVVMVGG ATRMPHVQRA VANYFGQEPL TNLDPDKVVA LGAAMQANVL AGNRADQDDW LLLDVIPLSL GLETMGGLVE KVVPRNATLP IARAQEFTTF KDGQTAMAFH VVQGERELVS DCRSLARFEL RGIPPMVAGA ARIRVTFQVD ADGLLAVSAR EMSSGVEASV LVKPSYGLSD DEIAEMLRSG VDHAGDDMAA RALREQQVEA DRVVEATEQA LGHDGHLLSA VESAEIRSVI ARLRELRAGT DNRAIKAGID ALARATDTFA ARRMDNSIRS ALTGHKVDEL PI
|
| |