Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Hneap_2232 |
Symbol | |
ID | 8535396 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Halothiobacillus neapolitanus c2 |
Kingdom | Bacteria |
Replicon accession | NC_013422 |
Strand | - |
Start bp | 2401907 |
End bp | 2404864 |
Gene Length | 2958 bp |
Protein Length | 985 aa |
Translation table | 11 |
GC content | 61% |
IMG OID | 646384612 |
Product | isoleucyl-tRNA synthetase |
Protein accession | YP_003264094 |
Protein GI | 261856811 |
COG category | [J] Translation, ribosomal structure and biogenesis |
COG ID | [COG0060] Isoleucyl-tRNA synthetase |
TIGRFAM ID | [TIGR00392] isoleucyl-tRNA synthetase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GTGCAGGGGC GCCCAAGCGC CGGTTTGTGT CGAAGCGGGT CAATTGAATC CGCCGCTGCC TTGCCACAAA CCGCCGCAGG TCGATCGACT CTTGCTACAA TAAGGTCCAT TTTCTCCATT TTTTCATTCG GCGATCTTTC CGTGACTGAT TACAAATCAA CGCTTAACCT GCCCGATACC CCTTTCCCCA TGCGCGGCAA CCTGCCGCAG CGCGAGCCCG AGCGTTTGGC CAAGTGGCAG CAACTGGAAT TGTATCGCGC GATTCGCGCC GTCAGTGCCG GTCGGCCCAA ATTCGTGCTG CACGATGGAC CGCCCTACGC CAACGGCGAT ATTCATATCG GTCACGCCGT GAACAAGGTG CTCAAGGACA TGATCGTCAA GTCCAAGCAG CTTGCAGGGT TTGATGCGCC GTATGTGCCG GGTTGGGATT GTCATGGACT GCCGATTGAA TTGATGGTCG AGCGTGAACA CGGCAAGCCG GGCGCCAAGC TTTCCGCGCA GGAATTCCGT GCTGCCTGCC GCGTTTACGC GCAAAGCCAG ATCGACCGCC AGATGGCCGA TTTTATCCGG CTGGGCGTGA TCGGTGACTG GGCCAATCCG TACAAAACGA TGGATTTTGC CACCGAGGCG GATATCGTGC GTGCGTTGGC GGGCATCATC GAGAACGGCC ATCTGCATCG GGGCGAGAAG CCGGTGCACT GGTGTGTGGA TTGCGGCTCG GCGCTGGCGG AAGCGGAGGT CGAATATCAG GACAAGAAAT CCGACCAGAT CGACGTGGCC TTTGCGGCCG TGGATTCCGC CGCCGTGCAT CGTGCCTTCG GCATTGACAG CAGTGCACCG GTTTCGGTTG TCATCTGGAC GACTACGCCC TGGACGCTGC CTGCCAACCA GGCGGTGGCG ATCAATCCCG AACTGGATTA CGCCCTGATC GAATTGAGCG ACGGTCGCCG CATTATCCTG GCCGAAGACC TGCGTGTACC GGCACTGGCG CGAATGAAGC TCGAAGGGAC CGTACTGGGC ACCACACCGG GCGCGGAGCT GGAAGGCCTG CATTTACAGC ACCCGTTCCT GAATCGGCAA GTGCTGCTGA TTCTGGGCGA TCACGTGACG ACCGATGCGG GTACGGGCGC GGTGCATACG GCACCGGCGC ACGGTGAGGA CGACTTCAAG GTCGGCCAGC GTTATGGCCT GCCGGTGGAT AACCCGGTCG ATGGCAGCGG TCACTTCCTG CCGAACACCC CGTTGGTCGG TGGTCTGAAC CTCAAGGACG GTGGCGCAAA AATCCTCGAA ATCATTCAGG AATCCGGCGC ACTGCTGGCG CACAGTCGCT TCACGCACAG TTATCCGCAC TGCTGGCGCC ACAAGACGCC GCTGATTTTC CGCGCGACCG GCCAGTGGTT CATCTCGATG GAACAGAACG AACTGCGCGA GCAGGCCATG GCGGCGATCA AGAACGTCCG CTTCGTGCCC GAGTGGGGCG AAGCCCGGAT TGCGGGCATG ATCGAAAATC GTCCGGACTG GTGTATCTCA CGTCAACGTA CTTGGGGCGT GCCGATCGCG CTGTTCGTCG ACAAGCAAAC CGGCGAGCCG CATCCCGAAA CGCCACGCTT GATGCGCGCC GTGGCCGACC GGATCGAGCA GACTGGCATC GATGCTTGGT TTTCGCTTGA CCCGAAAGAG ATCCTTGGTG CCGATGCCGA CCGCTATGAA AAAGTCACCG ACACGCTGGA TGTGTGGTTC GATTCGGGCG TTACTCACGC GACCGTGCTC GATCGCCGCC CGGAGCTGAC CTGGCCGGCC GATCTGTATC TGGAAGGCTC GGATCAGCAT CGCGGCTGGT TCCAGTCGTC GTTGCTGACC GGTGTGGCGC TCAAGGGCGC GGCACCGTAT CGCGGTGTGC TCACCCATGG TTTTACGGTC GATGCGCAGG GTCGCAAGAT GTCCAAGTCG CTCGGCAACG TGGTCGCGCC GCAAAAGGTC ATCGACAAGA TGGGCGCGGA CGTGCTGCGC CTGTGGGTGG CCTCGACCGA TTTCTCTGGC GAGATGACCG TCTCCGATGA GATTCTGCAA CGGGCGGGCG ATGCCTATCG CCGCATCCGC AACACGGCGC GTTATTTATT GGCGAACATC AACGATTTCG ACCCGAACAC CGATTGCGTG GCGAACGAAG ACTTGCTGCC GCTGGATGCC TGGCTGCTCG ATCACGCCGC AGGGCTGCAT CAAGCCGTGC TGGCCGATTT CGAAACCTAC GACTTCCACA GCCTCGTGAC GCGGGTGCAT CATTTCTGCT CCATTGAGCT GGGCGCGTTC TACCTCGATA TCGTCAAGGA TCGCATCTAC ACCGGCCAGA AAAACGCGCC GATGCGCCGC TCGGCGCAAA CGGTGATGTG GCGCGTGATC GAAGCGCTCA CCCGCTGGAT GGCGCCCATC ACCTCATTCA CCGCCGATGA GCTGTGGGCG CATCTGCCCG CGTTGGCCGA GCCGCGCGCG CCTTCGGTGT TTCTGGCAAC GCATACGGAT GACCTGGCGG CCGTGCTCGA TGACACACAG CGCGCATTTT GGGCGAAGTT GATCGATCTG CGCGATGTGG TCAATCGCTA CGCCGAAGCA GCTCGAAACG AAAAGATCAT CAAAGCCAAC CTTTCTGCGA AGGTCACGCT ATTTGTCGAT GACGCACTGG CCGACTTCCT CAAACCCATC CGCGATGAGC TTCGATTTGT GCTGATTGTC TCCGAACTGG AGGTATTGCC GCTGGCGAAC GCACCGACTA CCGCCACGGT CGAGACCTTG CCGAGCGGCG AAAAAATGGC CGTGCAGATC GCCGCAAGCG AAGCGCCTAA GTGCGAACGC TGCTGGCATT TGCAGCCGGA TGTGGGCAGC CATGCCGCGC ATCCGACGCT CTGCGGACGC TGCATCGAAA ACGTCGACGG CGCGGGCGAA GCGCGGGTCT GGGCATGA
|
Protein sequence | MQGRPSAGLC RSGSIESAAA LPQTAAGRST LATIRSIFSI FSFGDLSVTD YKSTLNLPDT PFPMRGNLPQ REPERLAKWQ QLELYRAIRA VSAGRPKFVL HDGPPYANGD IHIGHAVNKV LKDMIVKSKQ LAGFDAPYVP GWDCHGLPIE LMVEREHGKP GAKLSAQEFR AACRVYAQSQ IDRQMADFIR LGVIGDWANP YKTMDFATEA DIVRALAGII ENGHLHRGEK PVHWCVDCGS ALAEAEVEYQ DKKSDQIDVA FAAVDSAAVH RAFGIDSSAP VSVVIWTTTP WTLPANQAVA INPELDYALI ELSDGRRIIL AEDLRVPALA RMKLEGTVLG TTPGAELEGL HLQHPFLNRQ VLLILGDHVT TDAGTGAVHT APAHGEDDFK VGQRYGLPVD NPVDGSGHFL PNTPLVGGLN LKDGGAKILE IIQESGALLA HSRFTHSYPH CWRHKTPLIF RATGQWFISM EQNELREQAM AAIKNVRFVP EWGEARIAGM IENRPDWCIS RQRTWGVPIA LFVDKQTGEP HPETPRLMRA VADRIEQTGI DAWFSLDPKE ILGADADRYE KVTDTLDVWF DSGVTHATVL DRRPELTWPA DLYLEGSDQH RGWFQSSLLT GVALKGAAPY RGVLTHGFTV DAQGRKMSKS LGNVVAPQKV IDKMGADVLR LWVASTDFSG EMTVSDEILQ RAGDAYRRIR NTARYLLANI NDFDPNTDCV ANEDLLPLDA WLLDHAAGLH QAVLADFETY DFHSLVTRVH HFCSIELGAF YLDIVKDRIY TGQKNAPMRR SAQTVMWRVI EALTRWMAPI TSFTADELWA HLPALAEPRA PSVFLATHTD DLAAVLDDTQ RAFWAKLIDL RDVVNRYAEA ARNEKIIKAN LSAKVTLFVD DALADFLKPI RDELRFVLIV SELEVLPLAN APTTATVETL PSGEKMAVQI AASEAPKCER CWHLQPDVGS HAAHPTLCGR CIENVDGAGE ARVWA
|
| |