Gene Nmar_1490 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNmar_1490 
Symbol 
ID5773245 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNitrosopumilus maritimus SCM1 
KingdomArchaea 
Replicon accessionNC_010085 
Strand
Start bp1358772 
End bp1359884 
Gene Length1113 bp 
Protein Length370 aa 
Translation table11 
GC content38% 
IMG OID641317138 
Producttryptophanyl-tRNA synthetase 
Protein accessionYP_001582824 
Protein GI161528998 
COG category[J] Translation, ribosomal structure and biogenesis 
COG ID[COG0180] Tryptophanyl-tRNA synthetase 
TIGRFAM ID[TIGR00233] tryptophanyl-tRNA synthetase 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones30 
Fosmid unclonability p-value0.624592 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCAGCTG ACGATTTTGT TGTGACTCCT TGGCACGTAG AAGGAGATAT CGATTATGAC 
AAGTTAATCA AGCAATTTGG CACTCAAAAG ATTTCTCCAG AACTGCTAGC ACGAATCCAA
AAAATTACTG GAGAGGATCA TTTCATGCTC AGACGTGGAA TCTTTTTCTC TCACAGAGAG
ATGAACAGAA TTTTAGATGA TTATGAGAAA GGCAACAAGT TCTTCCTATA CACAGGACGA
GGTCCATCAG GCCACACCCA CATTGGCCAC CTGGTTCCAT GGGTCTTTGC AAAATGGCTC
CAAGAAAAAT TTGATGTAAA CATGTATTTT CAATTAACAG ATGATGAGAA ATTTTTCTCA
AAACCAAATC TAACTTTGGA GGAGACAAAA AACTTTGCAT ATGAAAATGC TCTTGACTTT
ATTGCACTAG GTTTCAAACC AGAAAAAACA AAGATCATCA TCAACACAAG AAACATCCAA
ACGCTTTATC CAATTGCAGC TCAAGTTGCA AAGAAGATCA ATTTCTCAAA TACTAAAGCA
ACATTTGGAT TTACAAATGA AACCAACCTC GGAATGATAT TTTACACATC ACTCCAGTCT
GCTCCATGTT TCATAGAAGA CAAGCCAGTG CTGATTCCAC TAGGAGTTGA CCAAGACCCT
CACTTTAGAC TAACAAGAGA CATTGCACCA AAGATTGGAA AAGAAAAACC TGCATTAATC
CACAACATAA TGATTCCTGC ACTAGAAGGA CCTGGAGGAA AGATGTCAGC ATCTGATGAA
AACGGTACAG TCTACACGAC AGATGCGCCA AATGTTGTAA AGAAAAAGAT CAACAAGTAT
GCATTTTCTG GAGGACAGCC AGACTTGGAA CAACACAGAA AGCTTGGAGG AAATCCAGAC
ATTGATGTGT CATACCAGTA TCTCAGAATA TTCTTTGAGC CAGATGACAA CAAGCTAAAA
TCAATCTATG AAGATTACAA GTCTGGAAAA TTACTTTCTG GAGAACTAAA GGCAATTCTA
ATTGAAAAGA TGAACGAGTT CCTAGCAGTA CATCAAGAGA ATAGAGAAAA AGCTAAAGAC
AAGATAGACG AATTTCTTTT TGAAAACAAA TGA
 
Protein sequence
MSADDFVVTP WHVEGDIDYD KLIKQFGTQK ISPELLARIQ KITGEDHFML RRGIFFSHRE 
MNRILDDYEK GNKFFLYTGR GPSGHTHIGH LVPWVFAKWL QEKFDVNMYF QLTDDEKFFS
KPNLTLEETK NFAYENALDF IALGFKPEKT KIIINTRNIQ TLYPIAAQVA KKINFSNTKA
TFGFTNETNL GMIFYTSLQS APCFIEDKPV LIPLGVDQDP HFRLTRDIAP KIGKEKPALI
HNIMIPALEG PGGKMSASDE NGTVYTTDAP NVVKKKINKY AFSGGQPDLE QHRKLGGNPD
IDVSYQYLRI FFEPDDNKLK SIYEDYKSGK LLSGELKAIL IEKMNEFLAV HQENREKAKD
KIDEFLFENK