Gene Nmar_0002 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNmar_0002 
Symbol 
ID5773897 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNitrosopumilus maritimus SCM1 
KingdomArchaea 
Replicon accessionNC_010085 
Strand
Start bp1325 
End bp2770 
Gene Length1446 bp 
Protein Length481 aa 
Translation table11 
GC content36% 
IMG OID641315619 
ProductDNA polymerase II small subunit 
Protein accessionYP_001581340 
Protein GI161527514 
COG category[L] Replication, recombination and repair 
COG ID[COG1311] Archaeal DNA polymerase II, small subunit/DNA polymerase delta, subunit B 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones40 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAAAAGG AACTGTCCTT TGCATTAAAC TATGCATTAA ACAAAGGATT CCAGATTCAT 
CCAAATGCTT TTAAAATTTT AGAAAATGTC GATGTGAAAA AACTAGAGAA AATAATAAAG
GAAATTGTCA GAGAAAAAAC CAAACAGAAA TTATTTCAAA TCAATCAAGA TGATTTAGAA
ACATATCTTG GAATCAAAGA TGATCCATCC TTAGAAAATG AGGTCAAGGT TTTGCTGGAT
CCTACAGACA AGATAACCAC GGGAGAAGGA GTAAAGGGAT ACAATGCATT GTTTTCTAGT
AGGTTTAACA AACTAAAACG AATAATTTCA GACAGACCAG AGGCTCGAAT GTTAAAATCT
CTGACTTTTG TTAAAAATGA AAGATCTGAA GATGACATGT ATGTCTGTGG TTTGGTAACT
TCAAGAAATA GTGAGAGAAA TGTAACAAAG TTGGTTTTAG AGGATCCTTC AGGCTCATTT
GAGGGGATTA TTTTTGATAC TGAACTTCAA AAGACGGCAG ATACTCTCCT AGTAGACCAA
TTTGTTATGG CAAGAGTTAG TGTAGGAAAG AACTCTGGAT TCATCATAAA GGATTTGATT
TTTCCGGATA TTCCTGATCA GGCCACAAAC AAGTCAGAAT CTGAAGCCTA TGCCGTATTT
TTGTCTGATT TACATATTGG AAGTAAATAT TTCATGGAAG AAGAGTTAAC AGAGTTTTTC
TCTTGGATAT CTAGTCCCGA TCCTACAGCA AAAAAAATCC GATTTATTTT GATTGGAGGA
GATATGGTTG ATGGGGTTGG AATTTACCCA AATCAGAACA AGGAATTAGT TTGTCAAACC
ATAGAGGAAC AGCTTAAAAA AGTTGAAAGT TTGATTGATC AGATTCCAAA AAATATCAAG
ATAATCATAA TGCCTGGAAA CCATGATCCG GGTAGAAGGG CACTCCCTCA ACCAGCAATT
CCAAAGAAGT ATAATTCTGG ATTATGGGAT AGGGAAAATG TCATAATGGT TGGAAACCCT
GCTGTAGTCT CATTAAATGG TGTGAAAGTG ATGATGTTTC ATGGGCAAAG TATTGATGAT
ATTGTAAAGA CTACCCCAGG GCTAAGTTAT GACAAGCCAA CAAATGTGAT GAAGCACCTA
CTTAGGGCCA GACATCTAAG TCCAATTTAT GGCAGCCAGA CGCCAATTGC TCCTGAGATG
CAGGATCTTA TGGTAATTGA AGATATTCCA GATATCTTTC ATGTAGGCCA CGTCCACAAG
GCTCAGTTGG ATATGTACAA GGGTATATTG TTGGTTAATT CAGGCTCTTG GCAGAAACAG
ACACCATTTC AAGCAAGTGT TGGGATGACT CCAAATCCAG GTATTGCTTT ACTGGTTAAT
TTAAAGACCT TTCAGGTTTT CCACCAAAAT TACAATTCTA ATCTAGACAA TATCTTGCAA
AGTTAA
 
Protein sequence
MKKELSFALN YALNKGFQIH PNAFKILENV DVKKLEKIIK EIVREKTKQK LFQINQDDLE 
TYLGIKDDPS LENEVKVLLD PTDKITTGEG VKGYNALFSS RFNKLKRIIS DRPEARMLKS
LTFVKNERSE DDMYVCGLVT SRNSERNVTK LVLEDPSGSF EGIIFDTELQ KTADTLLVDQ
FVMARVSVGK NSGFIIKDLI FPDIPDQATN KSESEAYAVF LSDLHIGSKY FMEEELTEFF
SWISSPDPTA KKIRFILIGG DMVDGVGIYP NQNKELVCQT IEEQLKKVES LIDQIPKNIK
IIIMPGNHDP GRRALPQPAI PKKYNSGLWD RENVIMVGNP AVVSLNGVKV MMFHGQSIDD
IVKTTPGLSY DKPTNVMKHL LRARHLSPIY GSQTPIAPEM QDLMVIEDIP DIFHVGHVHK
AQLDMYKGIL LVNSGSWQKQ TPFQASVGMT PNPGIALLVN LKTFQVFHQN YNSNLDNILQ
S