Gene Nmar_0846 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNmar_0846 
Symbol 
ID5774218 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNitrosopumilus maritimus SCM1 
KingdomArchaea 
Replicon accessionNC_010085 
Strand
Start bp745994 
End bp747136 
Gene Length1143 bp 
Protein Length380 aa 
Translation table11 
GC content35% 
IMG OID641316484 
Productphosphoribosylaminoimidazole carboxylase, ATPase subunit 
Protein accessionYP_001582180 
Protein GI161528354 
COG category[F] Nucleotide transport and metabolism 
COG ID[COG0026] Phosphoribosylaminoimidazole carboxylase (NCAIR synthetase) 
TIGRFAM ID[TIGR01161] phosphoribosylaminoimidazole carboxylase, PurK protein 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value0.0000154723 
Fosmid HitchhikerNo 
Fosmid clonabilitydecreased coverage 
 

Sequence

Gene sequence
ATGACCAAAA TTTTAGGGAT CATAGGCGGA GGACAGCTTG GCATGATGAT AGCAGAGGCT 
GCCAAAAAGA TGCCTGAAAA TGTTTCAAAA ATCATAGTAT TGGATCCTAC AGAGAATTGT
CCTGCAGCTC AAGTAGGTGC AGAACAAATT GTTGCAGATT TCAAAGACAA AAATGCCATA
ATTGAACTCT CAGAAAAATC AGACATTATT ACATATGAAA TTGAATCAGG AGATAGCGAA
GTCTTAAAAT CCGTTGAAAA AAATGCAGAG ATTAACCCTT CTCCTGAAAC ATTACATACA
ATTCAAGACA AATTCTTACA AAAAACATTT CTTAAAGAAC ACGGTATCCC TGTTCCAGAG
TTTATAGAAA TTTCAAACAT TGATGATGTT AAAGAGGGAT TGAAAAAATT TGGATATCCT
GCATTGCTCA AAGCAAGACG TGATGCATAT GATGGACGAG GAAACTTTAA GGTAGACTCT
GAAGACATGG TCCAAACAGC ATATGATTAT TTCAAAGATC AAAAATTGAT GTTAGAAAAA
TTTGTGCCAT TCAAAATGGA AGTATCTGTA ATTGCAGCTA GAAACACTAA AGGCGAAATC
AAAACATTTC CTCTTGTAGA AAATATTCAT GAAGAAAATA TTTTGCGCGA AACAATTGCT
CCTGCAAGAG TTTCTGAAGA AATAACAAAG AATGCCGAAA AAATTGCCAG TCAAACTATG
GACGTACTAA AAGGTGCAGG AGTGTTTGGA ATTGAAATGT TTGTAACTCG AGATGATCAG
ATTGTAATTA ACGAAATTGC TCCTAGAGTT CACAATTCAG GACACCATAC TTTGGAATCT
AGTGAAACAT CACAGTTTGA ACAACATTTG CGTGCAATTT TAGGATTAGA TCTTGGAAGT
ACCAAACTTT TACGTCCTAC CATAATGTAC AATATACTTG GAACAAAAAC CTTTGAGGGA
GAATACAAGC CATTAGAAAT CCCCGAAGAA AATCTATTTC TCAAAATGTA TGGAAAGAAA
ATTTCTAAAC CCATGAGAAA ACTAGGTCAT TTTAATCTGA TATCTACAAA CAATGAATCA
GTTGAAGATC TATTGAAGAA ATTGGAATCC ATAAAACCTA GGGCAGCAGT TCAATCTATC
TGA
 
Protein sequence
MTKILGIIGG GQLGMMIAEA AKKMPENVSK IIVLDPTENC PAAQVGAEQI VADFKDKNAI 
IELSEKSDII TYEIESGDSE VLKSVEKNAE INPSPETLHT IQDKFLQKTF LKEHGIPVPE
FIEISNIDDV KEGLKKFGYP ALLKARRDAY DGRGNFKVDS EDMVQTAYDY FKDQKLMLEK
FVPFKMEVSV IAARNTKGEI KTFPLVENIH EENILRETIA PARVSEEITK NAEKIASQTM
DVLKGAGVFG IEMFVTRDDQ IVINEIAPRV HNSGHHTLES SETSQFEQHL RAILGLDLGS
TKLLRPTIMY NILGTKTFEG EYKPLEIPEE NLFLKMYGKK ISKPMRKLGH FNLISTNNES
VEDLLKKLES IKPRAAVQSI