Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Nmul_A0742 |
Symbol | valS |
ID | 3786566 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nitrosospira multiformis ATCC 25196 |
Kingdom | Bacteria |
Replicon accession | NC_007614 |
Strand | - |
Start bp | 863347 |
End bp | 866127 |
Gene Length | 2781 bp |
Protein Length | 926 aa |
Translation table | 11 |
GC content | 56% |
IMG OID | 637810824 |
Product | valyl-tRNA synthetase |
Protein accession | YP_411441 |
Protein GI | 82701875 |
COG category | [J] Translation, ribosomal structure and biogenesis |
COG ID | [COG0525] Valyl-tRNA synthetase |
TIGRFAM ID | [TIGR00422] valyl-tRNA synthetase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 5 |
Plasmid unclonability p-value | 0.279651 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGAACTCG AAAAAAGCTT TGATCCCCGG GCTATCGAAA GCCGCTGGTA TTCGCGCTGG GAAGGCGAAG GCTATTTCAG GCCTGGACCA GCAGGGACTG CGGGTGATCA AGCGCCTGCA TACTGCATCA TGCTGCCGCC ACCCAACGTG ACCGGCACCC TGCACATGGG TCATGCCTTT CAACATACCT TGATGGATGC GCTGACGCGT TATCACCGTA TGCGGGGAGA TAACACGTTA TGGCAGCCGG GTACCGACCA TGCGGGAATT GCCACCCAGA TCGTGGTGGA ACGCCAACTG GATCAGCAGA GCATTGACCG GCGAGATCTG GGACGTGAAG CGTTTCTCGC CCGCGTGTGG GAATGGAAAG AGGAATCGGG CTCCACCATC AGCCGCCAGA TGCGCCGTAT GGGCGCCTCC TGCGACTGGT CGCGCGAACG CTTCACCATG GATGGGGGAC TCTCCCGCGC CGTGACTGAA GTTTTCGTGC GGCTTTATCG CGAAGGCCTC ATTTATCGCG GGGAACGCCT GGTGAACTGG GATCCGGTGC TGCAAACTGC CGTTTCCGAC CTCGAAGTGG TATCGGCAGA AGAAGAAGGA TCACTCTGGC ATATTCTTTA TCCCTTCGAA AACGATTTGG GGGGAAATCA GGATGGCGCA AAACCGGAAG GGCTGATTGT TGCCACTACC CGCCCGGAAA CCATGCTGGG CGATATGGCG GTAGCTGTGC ATCCTGACGA TGAGCGCTAC CGTCACCTGA TCGGCCGCCA CGTGCGCCTG CCATTATGCG AGCGGAGCAT ACCCATCATC GCGGATGCCT ATGTCGACCC GGCATTCGGT ACGGGGTGCG TCAAGATCAC TCCCGCGCAT GATTTCAACG ACTATCAGAT AGGACAACGG CACAAACTCG TTCCGCTCGG CATTCTTACC CTGGATGGAA AGATCAATGA CCTGGCGCCC GCCGAGTATC AGGGACTGGA TCGTTTTGCA GCCCGCAGGA AAATTGTCGC CGACCTGGAA GAACAGAATC TGCTGGTTGA AACAAAACCG CACAAGCTGA TGGTGCCGCG CGGAGATCGC ACTCAGGCTA TCGTTGAACC GATGCTGACT GATCAATGGT ATGTCACCAT GAACGGCCTG GCGAGGCGGG GCCTGGAGGC GGTGGCAAGT GGAGAAGTGA AATTCATTCC GGAAAACTGG GCGCACGTAT ATAACCAGTG GCTCGAAAAT ATCCAGGACT GGTGTATTTC CCGGCAGTTA TGGTGGGGCC ATCGGATTCC CGCCTGGTAT GACGAGGACA ATAACATCTT TGTCGCACAT AATCTGGAGG AAGCGCAGCG GCTGGCGGGG GGGCGCAAGC TGGTGCAGGA CGAGGATGTG CTGGATACTT GGTTTTCATC CGCGCTGTGG CCATTCTCCA CCTTGGGCTG GCCGGAAAAA ACGCCGGAGC TTGACACATT CCTGCCGACC TCGGTTCTGG TCACCGGCTT CGACATCATT TTTTTCTGGG TAGCGCGCAT GGTGATGATG TCCCTGCATT TCACCGGCAA AGTACCTTTC CGGGAGGTAT ATATCACCGG CCTCATCCGC GATGCGGAAG GGCATAAAAT GAGCAAATCC AGAGGCAATG TGCTGGACCC GCTGGATCTT ATCGACGGCA TTGCACTTCC TGATCTGATC ACCAAGCGCA CGAGCGGTCT GATGAACCCG CGGCAGGCCG AATCCATCGA AAAAATCACC CGTAAGCAGT TTCCGGAGGG AATCCCGGCG TTCGGTGCCG ATGCGCTGCG CTTTACTTTT GCAAGCCTCG CCTCTCATGG CCGCGACATC AAGTTCGATA TGCAGCGCTG CGAAGGTTAC CGCAATTTCT GCAATAAGCT CTGGAATGCG GCACGCTACG TGCTGATGAA TTGCGAGGGT AAGGATACCG GTTTAGTGGA ATCGGTGCCG CTGGAATATT CGGATGCCGA CTGCTGGATC ATCGGCCGGC TGCAACAGGC GGAAACCGCG GTCGCGCAGG CTTATCAGGA CTACCGCTTT GACATGGCGG CCCGCGAAAT CTATGAATTC GTCTGGGATG AGTATTGCGA CTGGTATCTG GAGTTCGCCA AGGTACAGCT GAATTCCGGC AACGAAGTGG TACAGCGCAC GACGCGCCGC ACCCTGGCCC GCGTCCTGGA AACGGCTTTA AGACTTGCCC ATCCCCTCAT CCCGTTCATT ACCGAAGAGT TGTGGCAAAG CGTGGCTCCA CTGGCAGCCA AGCAAGGGGT GAGCATCATG TTGCAGCCTT ACCCTCAAGC CGATCCCTCC AAGCTCGACG ACACCGCCAT CGGGAATATC GCTGCACTCA AGGAAATGAT CAATGCCTGC CGCACGCTGC GCGGAGAAAT GAACCTCTCC CCGGCATCGA GGGTACCCCT TCTGGCCGTA GGCGACGTAA AAACACTCGC CGGTTTTTCT CCTTATCTGA AGGCGCTGGC CAAATTGTCG GATATCGAAA TCGAACAGGA TTTACCTCCC GCCGAAGCAC CGGTCGCAAT CGTAGGTGAA TTCAGGCTGA TGCTGAAAAT CGAGATTGAT ATTGCTGCTG AGCGTGAGCG GCTGACCAAA GAGCTCGACC GTGTCCAGAC GGAAATGGAA AAGGCGCAAA CCAAGCTCGC CAACAGTAAT TTCGTGGATC GCGCGCCGGC AAAGGTAGTG GAACAGGAAA AAGAACGCCT TGCCGGTTTC AGCACGACAT TGGGAAAACT GAAAGAGCAA CTTCAGAAAC TGGGCTGCTG A
|
Protein sequence | MELEKSFDPR AIESRWYSRW EGEGYFRPGP AGTAGDQAPA YCIMLPPPNV TGTLHMGHAF QHTLMDALTR YHRMRGDNTL WQPGTDHAGI ATQIVVERQL DQQSIDRRDL GREAFLARVW EWKEESGSTI SRQMRRMGAS CDWSRERFTM DGGLSRAVTE VFVRLYREGL IYRGERLVNW DPVLQTAVSD LEVVSAEEEG SLWHILYPFE NDLGGNQDGA KPEGLIVATT RPETMLGDMA VAVHPDDERY RHLIGRHVRL PLCERSIPII ADAYVDPAFG TGCVKITPAH DFNDYQIGQR HKLVPLGILT LDGKINDLAP AEYQGLDRFA ARRKIVADLE EQNLLVETKP HKLMVPRGDR TQAIVEPMLT DQWYVTMNGL ARRGLEAVAS GEVKFIPENW AHVYNQWLEN IQDWCISRQL WWGHRIPAWY DEDNNIFVAH NLEEAQRLAG GRKLVQDEDV LDTWFSSALW PFSTLGWPEK TPELDTFLPT SVLVTGFDII FFWVARMVMM SLHFTGKVPF REVYITGLIR DAEGHKMSKS RGNVLDPLDL IDGIALPDLI TKRTSGLMNP RQAESIEKIT RKQFPEGIPA FGADALRFTF ASLASHGRDI KFDMQRCEGY RNFCNKLWNA ARYVLMNCEG KDTGLVESVP LEYSDADCWI IGRLQQAETA VAQAYQDYRF DMAAREIYEF VWDEYCDWYL EFAKVQLNSG NEVVQRTTRR TLARVLETAL RLAHPLIPFI TEELWQSVAP LAAKQGVSIM LQPYPQADPS KLDDTAIGNI AALKEMINAC RTLRGEMNLS PASRVPLLAV GDVKTLAGFS PYLKALAKLS DIEIEQDLPP AEAPVAIVGE FRLMLKIEID IAAERERLTK ELDRVQTEME KAQTKLANSN FVDRAPAKVV EQEKERLAGF STTLGKLKEQ LQKLGC
|
| |