Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Athe_0592 |
Symbol | |
ID | 7406933 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Anaerocellum thermophilum DSM 6725 |
Kingdom | Bacteria |
Replicon accession | NC_012034 |
Strand | + |
Start bp | 665294 |
End bp | 666559 |
Gene Length | 1266 bp |
Protein Length | 421 aa |
Translation table | 11 |
GC content | 38% |
IMG OID | 643714975 |
Product | histidyl-tRNA synthetase |
Protein accession | YP_002572491 |
Protein GI | 222528609 |
COG category | [J] Translation, ribosomal structure and biogenesis |
COG ID | [COG0124] Histidyl-tRNA synthetase |
TIGRFAM ID | [TIGR00442] histidyl-tRNA synthetase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 4 |
Plasmid unclonability p-value | 0.0000556243 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAAATCC AAGCACCAAA AGGAACAAAG GATGTGCTTC CAGAGGAAAG TTATATGTGG CAATATGTTG AGAATAAATT CAGAGAGGTT TGCAAGCTTT ATGGATATCA GGAAGTTAGA TTTCCCACAT TTGAGTACAC AGAGCTTTTC CAAAGAGGAG TGGGCGATAC TACTGATATT GTCCAAAAAG AGATGTATAC CTTTTTGGAC AAGGGTGGAA GGAGCATTAC TTTAAGGCCA GAAGGGACAG CATCAACAGC AAGACTATTT ATCGAGCATG GTTTTGCGTC ACGTCCGATG CCACAGAGAT TTTACTACAT TATTTCAGCT TTCAGGTATG AAAATACTCA AGGCGGAAGG TTTAGAGAGT TTCACCAGTT TGGAATTGAG AATTTTGGTT CTTCTTCGCC TGTAACAGAT GCAGAGGTAA TTTCACTTGC TTACAATTTT TTTACAAGCC TTGGGCTTGA CAATATTACT GTGAATATCA ACAGCATCGG ATGTCCTGTA TGCAGAAAAG AATATGTGAA AAATTTAAAA GAGTATTTTT CGGCAAATTC TCAAAAGCTC TGTCATACAT GCCACCAAAG GCTTGACAAA AATCCTATGA GAATTTTGGA TTGTAAAGAA GAGGGTTGCA AGCTGATTAC AAAAGACGCA CCAAAACCAA TAGATTATCT TTGTGATGAT TGTAAAAGCC ATTTTGAAAG TGTGAAAACT TACTTAGATT CAGCTATGGT TTCATACAAG GTTGACCCAT TTATTGTTCG CGGCTTGGAC TACTACACAA AAACAGTTTT TGAGATTGTT GCCACTGTCT CTGATAAAGA GCTTGCAATT TGCGGCGGTG GAAGGTACGA CAATTTAATA GAGCAGATAG GTGGACCATC TATTGCTGGA ATTGGTTTTG CAATTGGTGT TGAAAGACTT TTGATGCTGC TTGAGCAAAA TGGTCTTCTT CCTGCAAGAT CGCAGGTGCC GAGAGTGTTT GTGGCAACAA TAGGGGAAAA TGGTATCAAA AAAGCTTTTG AGATTGCAAG GATGCTTAGA TTTGAAGGAA TTTCAACCGT AGTTGAAGAG ATGGGAAGAA GCTTAAAATC TCAGATGAAA TATGCTGATA AGATTGGCTG CGAGTTTTCT ATAATTATTG GCGATGATGA GATTGAAAAG GGTGTTTGCA AAGTTAGAAA TATGAAGACA TCTTCAGAGG AAATAGTAGA AATAGAAAAT ATTTGCGAGT ATTTAAAAGA AAAACTTCAA AAATAA
|
Protein sequence | MKIQAPKGTK DVLPEESYMW QYVENKFREV CKLYGYQEVR FPTFEYTELF QRGVGDTTDI VQKEMYTFLD KGGRSITLRP EGTASTARLF IEHGFASRPM PQRFYYIISA FRYENTQGGR FREFHQFGIE NFGSSSPVTD AEVISLAYNF FTSLGLDNIT VNINSIGCPV CRKEYVKNLK EYFSANSQKL CHTCHQRLDK NPMRILDCKE EGCKLITKDA PKPIDYLCDD CKSHFESVKT YLDSAMVSYK VDPFIVRGLD YYTKTVFEIV ATVSDKELAI CGGGRYDNLI EQIGGPSIAG IGFAIGVERL LMLLEQNGLL PARSQVPRVF VATIGENGIK KAFEIARMLR FEGISTVVEE MGRSLKSQMK YADKIGCEFS IIIGDDEIEK GVCKVRNMKT SSEEIVEIEN ICEYLKEKLQ K
|
| |