Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Hneap_1377 |
Symbol | |
ID | 8534533 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Halothiobacillus neapolitanus c2 |
Kingdom | Bacteria |
Replicon accession | NC_013422 |
Strand | - |
Start bp | 1485655 |
End bp | 1488513 |
Gene Length | 2859 bp |
Protein Length | 952 aa |
Translation table | 11 |
GC content | 56% |
IMG OID | 646383768 |
Product | valyl-tRNA synthetase |
Protein accession | YP_003263258 |
Protein GI | 261855975 |
COG category | [J] Translation, ribosomal structure and biogenesis |
COG ID | [COG0525] Valyl-tRNA synthetase |
TIGRFAM ID | [TIGR00422] valyl-tRNA synthetase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGAAAAAA CCTACAATCC TGCCGAGATC GAAGCCCCTT GTTATGCGCG CTGGCAAGCG GGAGGCTACT TCTCTCCCGA TGCCAGTCTG CCCGCCGATG CACCGAATTA TTGCATCATG CTGCCGCCGC CGAATGTGAC CGGCCGCTTG CACATGGGTC ATGCCTTTCA GGATACCTTG ATGGACATGC TCACCCGTGT CCACAGGATG CAGGGCGAAC GTACGCTCTG GCAACCCGGC ACGGACCATG CGGGCATTGC CACGCAAATG GTGGTGGAGC GTCAGCTCGA GGCCGAGGGC AAAACGCGGC ATGATCTGGG CCGCGAGGCA TTCACCGAGC GCGTCTGGCA ATGGAAAAGC GAATCGGGCG GGTTTATCAC TGAACAGATG AAACGCCTGG GCGCATCCTG CGACTGGTCA CGCGAGCGCT TCACCATGGA TGATGGTTTG TCCGATGCGG TGCGCGAAGT TTTTGTGCGC TTGTTTGAAG ATGGGCTGAT TTATCGCGGT AAACGGCTGG TGAACTGGGA CCCGGTCTTG CATACCGCCG TATCCGATCT TGAAGTCATC AGCGAAGAAG AAACCGGCCA CCTGTGGCAT CTGCGTTATC CGCTCACCGA TGGCGGTGGT CATTTGATCG TCGCCACTAC GCGGCCAGAA ACCATGCTGG GCGATACCGC CGTGGCCGTT CATCCGGAAG ATGAGCGGTA TAAACATCTG ATTGGCAAAA CTATTACCTT GCCGCTGGTG GGGCGAGAGA TTCCGATCAT CGGTGATGAT TATGTCGATC CGGCCTTCGG CTCGGGCTGC GTGAAAATCA CGCCTGCGCA TGATTTCAAC GATTATGCCG TCGGACAACG ACACAACCTG CCCAAAATCA ACGTGTTGAC CATCGATGCG CGGATTCGCG AACTGCCGGA AATTATCGGC GGTGAAGAGG AGGGCGCCTT GCCTGCTCAC TACGCAGGTC TGGATCGCTA TGAAGCCCGT GATCGCATCA TCCATGATTT CAAAGAACTC GATTTATTGG AAAAGATCGA CGATCACAAG CTCATGGTGC CGCGCGGCGA CCGCAGTGGC GCGGTGATCG AGCCGATGCT GACCGACCAA TGGTTCGTCG ATTTGACCCG CGAAACTCAG GACGATGGCC GTCCCGGTGG GCTGGCCGCC ATTACGCGCC CAGCGCTTGA GGCCGTGCGC GGCGGCGATA TCAAGTTCGT GCCGGAAAAC TGGTCGAACA CCTATTATCA ATGGCTTGAG AATATTCAGG ACTGGTGCAT CAGCCGCCAG ATCTGGTGGG GGCACCGGAT TCCTGCGTGG TATGACGCGT CGGGCAGGGT GTATGTCGGG CGAGACGAAG CCGAAGTTCG GGCGAAATAC GATCTGGAAA ACACAGTGGT TTTAACGCAG GAAAATGACG TACTCGATAC CTGGTTCTCA TCTGCACTCT GGCCGTTTTC CACCTTGGGC TGGCCGCAGA ACACACAGGA ACTGGCGTAT TTTTACCCTA CTAGTGTGCT GGTCACCGGC TTTGACATCA TCTTTTTCTG GGTCGCGCGG ATGGTGATGA TGGGCAAGTA CTTCATGGGC GATGTGCCGT TTCGTGAGGT GTATGTGCAT GGCCTGATTC GAGACGCGCA AGGGCAGAAA ATGTCCAAAT CCAAGGGTAA CGTGCTCGAC CCGATTGACC TGATCGATGG CATTGATCTC GAATCGCTGG TTGCCAAGCG CACGGCTGGC CTGATGCAGC CCAAAATGGC AGCGAAGATC GAAAAAGACA CGCGCAAGGA GTTTGCCGAT GGCATTCCCG CTTTTGGTAC CGATGCCATG CGTTTTACCT TTGCTGCGCT GGCAACCACA GGGCGCGATA TTCGCTTCGA TTTGGGGCGC ATCGAAGGTT ATCGAAATTT CTGCAACAAA CTGTGGAATG CCAGCCGTTT TGTGATGATG CAATGCGAAG ATCAAGACAC GGGCCTCACC GATGCACCGG TGACCTTGAG CGATGCGGAC GAGTGGATTA TCGGCCGTCT GCAACAGGTC GAGGCGGAAG TTGCCAAGCA TTTTGCCGAC TATCGCTTCG ATCTGGCAGC CCAGACGCTG TATGAATTCA CCTGGAACGA ATACTGCGAC TGGTATCTTG AGTTCACCAA ACCAGCGCTC AAGGCAGACG ATGAAGCCGC GCAGCGCGGC ACCCGCCGCA CCTTGGTGCG TGTGCTCGAA GCGCTTTTAC GTTTGCTGCA TCCGATTATC CCGTTCATCA CCGAAACCAT CTGGCAGCGC TTGGCGCCGA TGGCATTGGT TGATGTGCAA TCAACCGATA GCATCCTTGG CCGCCCTTAT CCCGCATTTG ACGAAAGCAA GATCAATACG CAGGCTATCG AGTCGGTCGA ATGGCTGAAA CAGGTCATTT TGGGTGTGCG CCGTATTCGT GCCGAAATGG ACATTGCGCC CAGCAAGTCG CTCGACGTGC TGATAACGCA TGCCACCGTT GAAGAGATCG CACGATTCGA GCGGTTTAGT GCGCTGCTGA ATTCTGTCGG TCGGATTGGA AGTGTTACCG CATTGACCGC CCAAGAGGCC GTGCCCGAAG CGGCCATGGC ACTGGTAGGT GAGTTGCAAA TCCACATCCC GCTGGCTGGT TTGATCGACA AGCAGGCAGA ACTTGCGCGA CTCGATAGAG AAATCGAGCG GCTAACCAAG GAGCTGGAAA AAGCCAAAGC GAAACTCGCC AATCCGAAAT TCGCCGACAA AGCCCCGCCC GCCGTGGTGC AGCAAGAACG CGAGCGGGAA ACCAGCTTTC AGACGCAACT CCATGATTTG TCCGGTCAAC GCGCGCGTAT CAGCCAGATC AGCGGTTAA
|
Protein sequence | MEKTYNPAEI EAPCYARWQA GGYFSPDASL PADAPNYCIM LPPPNVTGRL HMGHAFQDTL MDMLTRVHRM QGERTLWQPG TDHAGIATQM VVERQLEAEG KTRHDLGREA FTERVWQWKS ESGGFITEQM KRLGASCDWS RERFTMDDGL SDAVREVFVR LFEDGLIYRG KRLVNWDPVL HTAVSDLEVI SEEETGHLWH LRYPLTDGGG HLIVATTRPE TMLGDTAVAV HPEDERYKHL IGKTITLPLV GREIPIIGDD YVDPAFGSGC VKITPAHDFN DYAVGQRHNL PKINVLTIDA RIRELPEIIG GEEEGALPAH YAGLDRYEAR DRIIHDFKEL DLLEKIDDHK LMVPRGDRSG AVIEPMLTDQ WFVDLTRETQ DDGRPGGLAA ITRPALEAVR GGDIKFVPEN WSNTYYQWLE NIQDWCISRQ IWWGHRIPAW YDASGRVYVG RDEAEVRAKY DLENTVVLTQ ENDVLDTWFS SALWPFSTLG WPQNTQELAY FYPTSVLVTG FDIIFFWVAR MVMMGKYFMG DVPFREVYVH GLIRDAQGQK MSKSKGNVLD PIDLIDGIDL ESLVAKRTAG LMQPKMAAKI EKDTRKEFAD GIPAFGTDAM RFTFAALATT GRDIRFDLGR IEGYRNFCNK LWNASRFVMM QCEDQDTGLT DAPVTLSDAD EWIIGRLQQV EAEVAKHFAD YRFDLAAQTL YEFTWNEYCD WYLEFTKPAL KADDEAAQRG TRRTLVRVLE ALLRLLHPII PFITETIWQR LAPMALVDVQ STDSILGRPY PAFDESKINT QAIESVEWLK QVILGVRRIR AEMDIAPSKS LDVLITHATV EEIARFERFS ALLNSVGRIG SVTALTAQEA VPEAAMALVG ELQIHIPLAG LIDKQAELAR LDREIERLTK ELEKAKAKLA NPKFADKAPP AVVQQERERE TSFQTQLHDL SGQRARISQI SG
|
| |