Gene Nmar_1477 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNmar_1477 
SymbolvalS 
ID5774489 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNitrosopumilus maritimus SCM1 
KingdomArchaea 
Replicon accessionNC_010085 
Strand
Start bp1345206 
End bp1347521 
Gene Length2316 bp 
Protein Length771 aa 
Translation table11 
GC content38% 
IMG OID641317125 
Productvalyl-tRNA synthetase 
Protein accessionYP_001582811 
Protein GI161528985 
COG category[J] Translation, ribosomal structure and biogenesis 
COG ID[COG0525] Valyl-tRNA synthetase 
TIGRFAM ID[TIGR00422] valyl-tRNA synthetase 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones38 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGACCCAA AAATTACAGA GAAAGCTTGG AATCCAGAGT TAGAGGCACA GGTCTTAAAG 
AAATGGCAGG ATTCAGACCT TTACAAATTC ACTCCAACTG ATGATAATTA CACCATAGAT
ACGCCTCCAC CATACCCATC AGGCAGGCCA TGGCACATTG GAGCTGCTGC TCACTATTCT
CAGATAGACA TGATTGCAAG AACTGCTCGC ATGAATGGCA AGAACGTCTA CTTTCCAATT
GGAATTGACA GAAATGGTTT GCCTGTAGAG TTCTACACTG AGAAAAAACA CAACATCAGA
ATGAGGGAAA CTGACAGAGG AGAATTCTTG GATTTGTGCC GAGAAGCACT AGATGACTTG
GAAGAAGAGA TGGTGTTGAT TATGACAAAC CTTGGTTTGA GTGGTGACTT GAAGAATCAT
TACAGAACAG ATTCAGAAGA ATATCGTGCA TTAACACAAT CAACATTCAT CAAACTATGG
AAAGAAGGAA AGATTGTTCT AGCTACTCGA CCTAACAACT ATGATTGGGT ATCTGGCACT
ACAATTGCTG ATGCTGAAAT TGCATACCAA GATTTGATGA CAAAGCTAGT TCACATGAAA
TTCAAGATTA AAGATATGGA TAAGGAAATC ATCATTGCAA GTACAAGACC AGAACTACTA
TGTGCATGTC GTACCATTAT TGTAAACCCT GAAGATGAAA GATATGCTCC ATTTGTTGGA
AGTAAAGTAG TTGTACCAAT TACAGGGGCG GAAGTTGAAA TCAAGACTCA CCATTCAGCT
CAGCAAGACT TTGGTTCAGG AGCTGTAATG GTTTGTAGTT ATGGTGATCA AAATGACGTT
GCATTGTTTA GAGAGATGGG ACTAGAAGAG ATTGTTGCAA TTGGTTTGGA TGGAAGGATG
ACTGAAGCTG CAGGAGAATA TGCAGGACTC AAACCAAAAC AAGCTAGAGA AAAAATTATT
GCAGATTTAG AATCAAAAGG ATTTGTAGAA AAAATCGAAG AGATTAATCA TAGAACACCA
ATTTCAGAGC GAAGTAAGAT TCCAATTGAG ATTATACCCA TGGAGGAGTA CTATCTCAAA
CAGAAAGATT CCATTGAAAA AATGCGAAAA TTAGGCTCAG AAATCACGTT TTACCCCTCT
ATGCACAAGC AGATTTTGAT GAATTGGCTT GATTCCATAT CCATTGACTG GCCAATCTCA
AGACGAAGAT ACTATGGAAC AGAGATTCCA ATTTGGTATT GTTCTAGTTG TAATGAACCG
CACGTCCCTG AACCAGGAAA GTACTATCAA CCATGGAAGG AGAAGTGTCC AATTGAAAAG
TGTACAAAGT GCGGTAATGG TGAATTTACT GGGGAGGATA GAACATTTGA TACATGGATG
GATTCTAGTG TATCGCCATT GTTTGTTACA AAGTATGATA AGGATTCAGA ATTCTTTGAG
AAAACTTATC CAACATCACT TAGACCTCAA GCAAAAGATA TTGTAAGGAC TTGGTTGTAT
TACACACTTT TGCGTTGTGA TATGTTAACT GGTAAAAAAC CATGGACTGA AGCTTGGATT
ATGGGTTATG GTTTAGATGA GAAAGGAATG AAGATGAGTA AAAGTAAAGG AAATGCAATT
GATCCATTAC CAGTAATAGA GAAATTTGGA GCTGACACAT TTCGATTCTG GAGTGCAAGT
GAGATTAATC ATGGTTATGA CTTTAGATGC AATGAGCAAA AGATAGAGTC TACAAAGAAA
TTCTTGAGCA AACTATGGAA TGTCTCAAGA TTCTTGTCAA GCTTTCCTGT AATCAAAGAA
GGAAAACCAA CTACCGTTGA CAAGTGGATT TTGTCTGAAT TGAACAATTT GATTAAAGAT
TGCAAGAAAG GATATGATGA ATACAACTTC TTTGTTCCAG CAATTGCAAT CAGGGAATTT
ACTTGGAATT TGTTTGCAGC TCATTACATC GAAATGGTCA AAAAACGAGC CTACGGAGAA
GGATTTTCTG AAGAGGAACG AGATGCAGCC ATTTACACAC TACACAAAGT ACTCTCCACA
GTCTTGAAAT TATTGGCACC AATTACACCA TTTATCACAG AGCATTTGTG GCAGACACTA
TACTCTGAGA CTAGTATTCA TACTGAAAGA CAAGCTGAAT CTGAAAAGGT AGAAGATTAT
TCCAAAACCA CTGAAGAAGT TGTACAGTTT AACTCTAAAG TATGGAATGA GAAAAAGGCA
AAAGAATTGT CATTAAAGGA CTCTATTGCA GTTGAAGTTC CTGAAAGTTT GTCAGAGTTT
AAGAAAGATT TGCAATCAAT GCACAATCTT GAGTGA
 
Protein sequence
MDPKITEKAW NPELEAQVLK KWQDSDLYKF TPTDDNYTID TPPPYPSGRP WHIGAAAHYS 
QIDMIARTAR MNGKNVYFPI GIDRNGLPVE FYTEKKHNIR MRETDRGEFL DLCREALDDL
EEEMVLIMTN LGLSGDLKNH YRTDSEEYRA LTQSTFIKLW KEGKIVLATR PNNYDWVSGT
TIADAEIAYQ DLMTKLVHMK FKIKDMDKEI IIASTRPELL CACRTIIVNP EDERYAPFVG
SKVVVPITGA EVEIKTHHSA QQDFGSGAVM VCSYGDQNDV ALFREMGLEE IVAIGLDGRM
TEAAGEYAGL KPKQAREKII ADLESKGFVE KIEEINHRTP ISERSKIPIE IIPMEEYYLK
QKDSIEKMRK LGSEITFYPS MHKQILMNWL DSISIDWPIS RRRYYGTEIP IWYCSSCNEP
HVPEPGKYYQ PWKEKCPIEK CTKCGNGEFT GEDRTFDTWM DSSVSPLFVT KYDKDSEFFE
KTYPTSLRPQ AKDIVRTWLY YTLLRCDMLT GKKPWTEAWI MGYGLDEKGM KMSKSKGNAI
DPLPVIEKFG ADTFRFWSAS EINHGYDFRC NEQKIESTKK FLSKLWNVSR FLSSFPVIKE
GKPTTVDKWI LSELNNLIKD CKKGYDEYNF FVPAIAIREF TWNLFAAHYI EMVKKRAYGE
GFSEEERDAA IYTLHKVLST VLKLLAPITP FITEHLWQTL YSETSIHTER QAESEKVEDY
SKTTEEVVQF NSKVWNEKKA KELSLKDSIA VEVPESLSEF KKDLQSMHNL E