Gene Nmul_A0742 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNmul_A0742 
SymbolvalS 
ID3786566 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNitrosospira multiformis ATCC 25196 
KingdomBacteria 
Replicon accessionNC_007614 
Strand
Start bp863347 
End bp866127 
Gene Length2781 bp 
Protein Length926 aa 
Translation table11 
GC content56% 
IMG OID637810824 
Productvalyl-tRNA synthetase 
Protein accessionYP_411441 
Protein GI82701875 
COG category[J] Translation, ribosomal structure and biogenesis 
COG ID[COG0525] Valyl-tRNA synthetase 
TIGRFAM ID[TIGR00422] valyl-tRNA synthetase 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.279651 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGAACTCG AAAAAAGCTT TGATCCCCGG GCTATCGAAA GCCGCTGGTA TTCGCGCTGG 
GAAGGCGAAG GCTATTTCAG GCCTGGACCA GCAGGGACTG CGGGTGATCA AGCGCCTGCA
TACTGCATCA TGCTGCCGCC ACCCAACGTG ACCGGCACCC TGCACATGGG TCATGCCTTT
CAACATACCT TGATGGATGC GCTGACGCGT TATCACCGTA TGCGGGGAGA TAACACGTTA
TGGCAGCCGG GTACCGACCA TGCGGGAATT GCCACCCAGA TCGTGGTGGA ACGCCAACTG
GATCAGCAGA GCATTGACCG GCGAGATCTG GGACGTGAAG CGTTTCTCGC CCGCGTGTGG
GAATGGAAAG AGGAATCGGG CTCCACCATC AGCCGCCAGA TGCGCCGTAT GGGCGCCTCC
TGCGACTGGT CGCGCGAACG CTTCACCATG GATGGGGGAC TCTCCCGCGC CGTGACTGAA
GTTTTCGTGC GGCTTTATCG CGAAGGCCTC ATTTATCGCG GGGAACGCCT GGTGAACTGG
GATCCGGTGC TGCAAACTGC CGTTTCCGAC CTCGAAGTGG TATCGGCAGA AGAAGAAGGA
TCACTCTGGC ATATTCTTTA TCCCTTCGAA AACGATTTGG GGGGAAATCA GGATGGCGCA
AAACCGGAAG GGCTGATTGT TGCCACTACC CGCCCGGAAA CCATGCTGGG CGATATGGCG
GTAGCTGTGC ATCCTGACGA TGAGCGCTAC CGTCACCTGA TCGGCCGCCA CGTGCGCCTG
CCATTATGCG AGCGGAGCAT ACCCATCATC GCGGATGCCT ATGTCGACCC GGCATTCGGT
ACGGGGTGCG TCAAGATCAC TCCCGCGCAT GATTTCAACG ACTATCAGAT AGGACAACGG
CACAAACTCG TTCCGCTCGG CATTCTTACC CTGGATGGAA AGATCAATGA CCTGGCGCCC
GCCGAGTATC AGGGACTGGA TCGTTTTGCA GCCCGCAGGA AAATTGTCGC CGACCTGGAA
GAACAGAATC TGCTGGTTGA AACAAAACCG CACAAGCTGA TGGTGCCGCG CGGAGATCGC
ACTCAGGCTA TCGTTGAACC GATGCTGACT GATCAATGGT ATGTCACCAT GAACGGCCTG
GCGAGGCGGG GCCTGGAGGC GGTGGCAAGT GGAGAAGTGA AATTCATTCC GGAAAACTGG
GCGCACGTAT ATAACCAGTG GCTCGAAAAT ATCCAGGACT GGTGTATTTC CCGGCAGTTA
TGGTGGGGCC ATCGGATTCC CGCCTGGTAT GACGAGGACA ATAACATCTT TGTCGCACAT
AATCTGGAGG AAGCGCAGCG GCTGGCGGGG GGGCGCAAGC TGGTGCAGGA CGAGGATGTG
CTGGATACTT GGTTTTCATC CGCGCTGTGG CCATTCTCCA CCTTGGGCTG GCCGGAAAAA
ACGCCGGAGC TTGACACATT CCTGCCGACC TCGGTTCTGG TCACCGGCTT CGACATCATT
TTTTTCTGGG TAGCGCGCAT GGTGATGATG TCCCTGCATT TCACCGGCAA AGTACCTTTC
CGGGAGGTAT ATATCACCGG CCTCATCCGC GATGCGGAAG GGCATAAAAT GAGCAAATCC
AGAGGCAATG TGCTGGACCC GCTGGATCTT ATCGACGGCA TTGCACTTCC TGATCTGATC
ACCAAGCGCA CGAGCGGTCT GATGAACCCG CGGCAGGCCG AATCCATCGA AAAAATCACC
CGTAAGCAGT TTCCGGAGGG AATCCCGGCG TTCGGTGCCG ATGCGCTGCG CTTTACTTTT
GCAAGCCTCG CCTCTCATGG CCGCGACATC AAGTTCGATA TGCAGCGCTG CGAAGGTTAC
CGCAATTTCT GCAATAAGCT CTGGAATGCG GCACGCTACG TGCTGATGAA TTGCGAGGGT
AAGGATACCG GTTTAGTGGA ATCGGTGCCG CTGGAATATT CGGATGCCGA CTGCTGGATC
ATCGGCCGGC TGCAACAGGC GGAAACCGCG GTCGCGCAGG CTTATCAGGA CTACCGCTTT
GACATGGCGG CCCGCGAAAT CTATGAATTC GTCTGGGATG AGTATTGCGA CTGGTATCTG
GAGTTCGCCA AGGTACAGCT GAATTCCGGC AACGAAGTGG TACAGCGCAC GACGCGCCGC
ACCCTGGCCC GCGTCCTGGA AACGGCTTTA AGACTTGCCC ATCCCCTCAT CCCGTTCATT
ACCGAAGAGT TGTGGCAAAG CGTGGCTCCA CTGGCAGCCA AGCAAGGGGT GAGCATCATG
TTGCAGCCTT ACCCTCAAGC CGATCCCTCC AAGCTCGACG ACACCGCCAT CGGGAATATC
GCTGCACTCA AGGAAATGAT CAATGCCTGC CGCACGCTGC GCGGAGAAAT GAACCTCTCC
CCGGCATCGA GGGTACCCCT TCTGGCCGTA GGCGACGTAA AAACACTCGC CGGTTTTTCT
CCTTATCTGA AGGCGCTGGC CAAATTGTCG GATATCGAAA TCGAACAGGA TTTACCTCCC
GCCGAAGCAC CGGTCGCAAT CGTAGGTGAA TTCAGGCTGA TGCTGAAAAT CGAGATTGAT
ATTGCTGCTG AGCGTGAGCG GCTGACCAAA GAGCTCGACC GTGTCCAGAC GGAAATGGAA
AAGGCGCAAA CCAAGCTCGC CAACAGTAAT TTCGTGGATC GCGCGCCGGC AAAGGTAGTG
GAACAGGAAA AAGAACGCCT TGCCGGTTTC AGCACGACAT TGGGAAAACT GAAAGAGCAA
CTTCAGAAAC TGGGCTGCTG A
 
Protein sequence
MELEKSFDPR AIESRWYSRW EGEGYFRPGP AGTAGDQAPA YCIMLPPPNV TGTLHMGHAF 
QHTLMDALTR YHRMRGDNTL WQPGTDHAGI ATQIVVERQL DQQSIDRRDL GREAFLARVW
EWKEESGSTI SRQMRRMGAS CDWSRERFTM DGGLSRAVTE VFVRLYREGL IYRGERLVNW
DPVLQTAVSD LEVVSAEEEG SLWHILYPFE NDLGGNQDGA KPEGLIVATT RPETMLGDMA
VAVHPDDERY RHLIGRHVRL PLCERSIPII ADAYVDPAFG TGCVKITPAH DFNDYQIGQR
HKLVPLGILT LDGKINDLAP AEYQGLDRFA ARRKIVADLE EQNLLVETKP HKLMVPRGDR
TQAIVEPMLT DQWYVTMNGL ARRGLEAVAS GEVKFIPENW AHVYNQWLEN IQDWCISRQL
WWGHRIPAWY DEDNNIFVAH NLEEAQRLAG GRKLVQDEDV LDTWFSSALW PFSTLGWPEK
TPELDTFLPT SVLVTGFDII FFWVARMVMM SLHFTGKVPF REVYITGLIR DAEGHKMSKS
RGNVLDPLDL IDGIALPDLI TKRTSGLMNP RQAESIEKIT RKQFPEGIPA FGADALRFTF
ASLASHGRDI KFDMQRCEGY RNFCNKLWNA ARYVLMNCEG KDTGLVESVP LEYSDADCWI
IGRLQQAETA VAQAYQDYRF DMAAREIYEF VWDEYCDWYL EFAKVQLNSG NEVVQRTTRR
TLARVLETAL RLAHPLIPFI TEELWQSVAP LAAKQGVSIM LQPYPQADPS KLDDTAIGNI
AALKEMINAC RTLRGEMNLS PASRVPLLAV GDVKTLAGFS PYLKALAKLS DIEIEQDLPP
AEAPVAIVGE FRLMLKIEID IAAERERLTK ELDRVQTEME KAQTKLANSN FVDRAPAKVV
EQEKERLAGF STTLGKLKEQ LQKLGC