Gene Nmar_0236 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNmar_0236 
Symbol 
ID5773321 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNitrosopumilus maritimus SCM1 
KingdomArchaea 
Replicon accessionNC_010085 
Strand
Start bp207468 
End bp208751 
Gene Length1284 bp 
Protein Length427 aa 
Translation table11 
GC content32% 
IMG OID641315857 
Producthistidyl-tRNA synthetase 
Protein accessionYP_001581570 
Protein GI161527744 
COG category[J] Translation, ribosomal structure and biogenesis 
COG ID[COG0124] Histidyl-tRNA synthetase 
TIGRFAM ID[TIGR00442] histidyl-tRNA synthetase 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones41 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGAACTTC CAAGAGGTAT GAAAGACTTT GAGGCTAATG AAAATTCAAA CATTGAACAT 
GTAAGAAATC ATTTCAAAAA ATTATCTAAT TTGTATGGCT TTTCATTCAT GGAACCCTCT
GTTTTAGAAT CCTTATCTAC ACTTGAAACC AAATCTGGTC CTGCAATAAG AGACGAAATC
TATTATTTCA AAGATAAAGG TGATCGTGAG GTTGCTTTGC GTTTTGATTT TACAATGGGT
TTGACAAGAT ATGCAACTGC TCAAAAATCA ATGAAACTTC CTGCAAAAAT TTCTGCATTT
GGAGGAGTAT TTCGATATGA TGAACCTCAA AAAGGAAGAT ATCGATATTT TCATCAATGG
GATGTTGAAG TTTATGGCAA GAAAACTTTA GAATCTGAGG CTGAAATTAT TGAATTAACC
TCACGATTGT TTGATTCTTT GTTGCTTAAG GATATTGTAA TTGACATTAA TCACCGAAAT
CTTGTTGAAT CATACATTAA CAAAATATTT GACATAACAG ATTCTGACAA AGTAACAGAT
ATTCTAAGAG CAATAGACAA AATTGCTAAA AAATCAAAAG ATGAAATCTT GACTGAATTT
ACTAAAAAAG GATACGACAC TGCCAAACTT GAAAAAATTC TAGAATTCTC TCAAGTAAAG
GGTACAATTT CTGAAGTTGA AAAAATTTTT GATACTACAC AACTAGAATC ATGGGATGAA
TTAAAACAAC TTTTTGATTC ATTAGAAAAT AGAGGAGTTT CCAACGTTCG AATTAATTTT
GGTATAGTCC GTGGGTTGGA TTATTATTCT GGAATGGTCT TTGAAGTTTT TGATAAAAAT
TCAAAACTTG GTGCATTAGC TGGTGGGGGA AGATATGATA CATTAACAAA AGCATTTGGA
AGAGATGATA TTGGTGCTAC TGGTGTTGCA GGTGGAGTTG AGAGAATTAT TCTTACAATG
CAAGAACAAA AAATAATTCC TGAAATTTTA CAAAATAGAG TTGCAGTTGT TTACATCAAT
GAAGAGATGC AAAAAGTTGC TCACTCCATC ACATCATTAT TGCGGTTAAA CAACATTCCT
GCTGATATTG ATCTTGCAGG ACGTAATCTA AAAAAACAGA TGGATAATGC TGGAACTGCT
AGATACTCTA TAATTGTTGG ACCTCAAGAA CTTGAAAATG GAAATGTTGT CTTGCATGAC
ATGAAAGATG GAAAAGAAGG TACTATCTCT CTAGAAAAAT TAACCGAAGA TCCTAATTCT
GTTCTTAATT TAGAAAAGCT CTAG
 
Protein sequence
MELPRGMKDF EANENSNIEH VRNHFKKLSN LYGFSFMEPS VLESLSTLET KSGPAIRDEI 
YYFKDKGDRE VALRFDFTMG LTRYATAQKS MKLPAKISAF GGVFRYDEPQ KGRYRYFHQW
DVEVYGKKTL ESEAEIIELT SRLFDSLLLK DIVIDINHRN LVESYINKIF DITDSDKVTD
ILRAIDKIAK KSKDEILTEF TKKGYDTAKL EKILEFSQVK GTISEVEKIF DTTQLESWDE
LKQLFDSLEN RGVSNVRINF GIVRGLDYYS GMVFEVFDKN SKLGALAGGG RYDTLTKAFG
RDDIGATGVA GGVERIILTM QEQKIIPEIL QNRVAVVYIN EEMQKVAHSI TSLLRLNNIP
ADIDLAGRNL KKQMDNAGTA RYSIIVGPQE LENGNVVLHD MKDGKEGTIS LEKLTEDPNS
VLNLEKL