Gene Nmar_0229 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNmar_0229 
Symbol 
ID5774655 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNitrosopumilus maritimus SCM1 
KingdomArchaea 
Replicon accessionNC_010085 
Strand
Start bp201415 
End bp203196 
Gene Length1782 bp 
Protein Length593 aa 
Translation table11 
GC content35% 
IMG OID641315850 
Producttranslation initiation factor IF-2 
Protein accessionYP_001581563 
Protein GI161527737 
COG category[J] Translation, ribosomal structure and biogenesis 
COG ID[COG0532] Translation initiation factor 2 (IF-2; GTPase) 
TIGRFAM ID[TIGR00231] small GTP-binding protein domain
[TIGR00491] translation initiation factor aIF-2/yIF-2 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones39 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TTGCAAATTC GTCAGCCCAT AGTGGCAGTT CTTGGTCACG TAGACTCTGG CAAAACATCA 
CTTTTAGATA GAATTCGTGG AACAGGTGTT CAAGGTAGAG AAGCTGGAGG AATCACACAA
CATATCGGTG CTAGCTTTCT ACCTACTGAA ACAATCAAAG AAACCTGTGG TCCTTTATAC
AAGAAACTAG AACAATCAGA AAACAAAGTT CCCGGAATTT TAGTTATTGA CACACCAGGA
CACGAAGTCT TTACAAATCT TCGTTCTCGT GGAGGTTCTG CTGCTGATAT TGCAGTTTTA
GTAGTTGATG TTAATCGTGG ATTTCAACCA CAAACAAACG AAAGTTTGAA AATTTTACAA
AGTAGAAAAG TTCCTTTTGT TGTAGCTTTG AACAAATGTG ACCAAATTTC TGGATGGAGA
AAATCTGAAA CATCCTTTAT TTCACAAGCA ATCAAAGAAC AAGATGCATC TATTCAAGCT
GATCTTGATC AAAAAATCTA TGATGTAGTT GGCACACTTT CTGTATTAGG ATACCAATCT
GAAGCATTTT ATCGTGTAAA GGACTTTAAA TCTGAAATTG CAATCGTTCC AATCTCTGCA
CGTTCTGGTG TTGGAATTCC AGAATTACTT AGCGTTTTAG TGGGGTTAAC TCAACAATAT
CTCCAAAAAA GACTAAGTCA AGAAGAAAAA GATCCCCGTG GAATTGTTTT AGAAGTAAAA
GATGAAGTTG GATTAGGTCA AACTGCAAAT ATTATACTAA TTGATGGTTC AATCAAAAAA
GAAGACAGTA TAGTTGTTGC AAAACGTGAT GGTGTAATTG TCACAAAACC AAAAGCATTG
TTGTTGCCAA AAGCTCTTGA TGAAATGCGT GATCCTCGTG ATAAATTCAA ACCAACCCCT
CAGGTAGATG CCGCAGCCGG ATTAAAGATT GCATCCCCTG AACTAGAAGG AGTTCTTCCG
GGAAGTACTC TCTATGTTGC AAGAAATGAT GATGAGGTTA CAAAATACAC CAATCTCATA
GAATCTGAGA TGAAATCCAT GTTTGTAGAT ACTGAAACTA ATGGAATTAT TCTAAAATGT
GATACCATTG GTTCACTTGA AGCTATTGTT GAGATGCTAC GACGATCACA AGTACCAGTA
GCTAAAGCCG ATATTGGTCC TGTAAATAGA CGTGATGTAA TTGAAGCTAA AGCAATAAAA
GAAAAGGACA GACATTTGGG AATTGTTTTA GCATTTAATG TCAAGGTTTT ACCTGATGCA
AAAGAAGAAT CAGAAATTAG TCACATCAAA ATCTTTGAAG ACAAAGTAAT CTATAGTTTA
ATTGATAACT ATAATGCTTG GGTTGAAGAA GATACCGCAC ATCAAGAGGA TGCAATATTT
TCTGAATTAA CACCTGTTTC CAAGTTTACT TTTCTTAAAG GAATGGTGTT TAGAAATAAC
AATCCTGCAG TTTTTGGAAT TAGAATAGAT GTTGGAACAC TAAAACACAA AATTCCATTT
ATGAATTCTG ATGGACGAAG AATTGGAAAC ATCCACCAAC TTCAACATGA TAAAAAAACA
GTAACTTCAG CTAAAACAGG TGATGAAGTT GCATGTTCAG TTCAAGATGT AACCATTGGA
AGACAAATTT TTGAAGAAGA AGTGTTTTAC ACATTTCCTC CATCTCATGA AGCAAAACAA
TTACTAAACA AATTCATGCA TAAACTAAGT ACTGAAGAAC AAGAAGTACT AAATGAAATA
GTAGAAATTC AAAGAAAGAA AGAAGCAGCT TATGCTTACT AA
 
Protein sequence
MQIRQPIVAV LGHVDSGKTS LLDRIRGTGV QGREAGGITQ HIGASFLPTE TIKETCGPLY 
KKLEQSENKV PGILVIDTPG HEVFTNLRSR GGSAADIAVL VVDVNRGFQP QTNESLKILQ
SRKVPFVVAL NKCDQISGWR KSETSFISQA IKEQDASIQA DLDQKIYDVV GTLSVLGYQS
EAFYRVKDFK SEIAIVPISA RSGVGIPELL SVLVGLTQQY LQKRLSQEEK DPRGIVLEVK
DEVGLGQTAN IILIDGSIKK EDSIVVAKRD GVIVTKPKAL LLPKALDEMR DPRDKFKPTP
QVDAAAGLKI ASPELEGVLP GSTLYVARND DEVTKYTNLI ESEMKSMFVD TETNGIILKC
DTIGSLEAIV EMLRRSQVPV AKADIGPVNR RDVIEAKAIK EKDRHLGIVL AFNVKVLPDA
KEESEISHIK IFEDKVIYSL IDNYNAWVEE DTAHQEDAIF SELTPVSKFT FLKGMVFRNN
NPAVFGIRID VGTLKHKIPF MNSDGRRIGN IHQLQHDKKT VTSAKTGDEV ACSVQDVTIG
RQIFEEEVFY TFPPSHEAKQ LLNKFMHKLS TEEQEVLNEI VEIQRKKEAA YAY