Gene Nmar_1104 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNmar_1104 
Symbol 
ID5774147 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNitrosopumilus maritimus SCM1 
KingdomArchaea 
Replicon accessionNC_010085 
Strand
Start bp1006999 
End bp1009818 
Gene Length2820 bp 
Protein Length939 aa 
Translation table11 
GC content35% 
IMG OID641316746 
Productexcinuclease ABC subunit A 
Protein accessionYP_001582438 
Protein GI161528612 
COG category[L] Replication, recombination and repair 
COG ID[COG0178] Excinuclease ATPase subunit 
TIGRFAM ID[TIGR00630] excinuclease ABC, A subunit 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones37 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCGAAA ATAAATTAAA AATTCGTGGT GCACGTCATC ATAATTTAAA AAACATTGAC 
ATTGACATTC CAAAAAACAA ACTAGTTGTA ATTAGTGGAC TGTCTGGATC TGGTAAATCA
ACTTTAGCAT TTGATACAAT ATATGCTGAA GGACAAAGAC GATATGTTGA ATCCCTTTCA
GCATATGCTC GTCAGTTCTT GGAGATGATG GACAAACCTG ATGTTGACTC TATAGATGGA
TTATCTCCTG CAATTTCAAT CCAACAAAAA ACAACTAGTA AAAACCCTCG TTCTACTGTT
GGAACAACCA CTGAAATTTA TGATTACATG AGATTACTTT ATGCTAGAAT TGGTATTCCA
TATTGTACTA ATTGTGGCAG AAAAATCTCT ACCCAATCAA TTGAAACCAT TTGTGATTCT
GTGCTAAGAG AATTTTCTGG CAAAAAGATT CTGATACTGT CTCCTATCAT CCAGCGAAAG
AAAGGAACCT ATGAGAAACT CTTTGAGCAG ATCAAAAAGG ACGGTTACTC TAGAATACGC
CTAAATGGAG AAATTTTGAG CCTAGATGAG GAAATTCCTC CACTTGACAG GCAAAAATGG
CACAATATTG AGATTGTAGT TGACAGAATA ACTACTGAAA AATCTGAACG CTCAAGATTG
TTTGAGGCTA TTCAGACTGC AATAAAGGCA TCAAAAGGAG ATGTAATGGT TGCATCAGAA
AAATCTGAAA AAGTCTTTTC TCAAAATAAT GCTTGTCCTT ATTGTGGATT AACAGTAGGT
GAATTGGAAC CAAGAAATTT TTCATTCAAC TCTCCATTTG GCTTGTGTAA AGATTGCAAT
GGCCTTGGTG TTAAAATGGA GTTTGATCCT GATTTAGTAA TTCCAGATAA AACAAAATCA
ATTTTGGATG GAGCAATTGT TCCTTGGAGT GGAAGATTTT CTTCCTTTAG AAGACAAGCA
TTAAGAGCAG TAGGCAAAAA ATTTGGCTTT GATTTGATGA CACCAATTAA CAAAATAAAA
CCAAAACATC TCAAAATTAT TTTGTATGGA ACAGATGATC TAATTGATTT TAATTATCGT
TCAAAATCTG GCGATTCTTC ATGGCAATAC ACTAATGCCT TTGAAGGTGT GTTGGATAAT
CTTCAACGTG TTTTTATGGA AACTGATTCT GAATCAAAAC GTGAATGGTT AAAACAATTC
ATGAGAGATA CTCCCTGTAA TGGATGTAAT GGAAAAAAAC TCAAACCTGA ATCACTTGCA
GTAAAAATCA ATGATAAGGG AATCATGGAT GTCTGTGATT TGTCTATTGA CCATTGTTAT
GATTTCTTTT CTACGCTAAA ACTAACTGAA AATGAACAAT ACATTGCAAG AGATGTTCTA
AAGGAAATTA AAGAACGACT AGAGTTTTTG ATGAATGTTG GTTTGAATTA TCTTACTCTA
AACAGATTAA GTTCTACATT ATCTGGTGGC GAATCTCAAA GAATTCGATT AGCTACACAA
ATTGGTTCCA ATCTTACTGG TGTTTTGTAT GTGTTAGATG AACCAACTAT CGGTCTTCAC
CAAAGAGATA ATGCCCGACT CATTAAAACG CTAAATAAGC TACGAAATTT GGGAAATACT
GTTATTGTAG TGGAGCATGA CGAAGAAGTT ATACGAAATT CAGACTGGAT GGTTGATTTG
GGACCTGGAG CCGGTGTAAA TGGTGGTAGT GTTGTTTTTG AAGGAACTGT CAACCAAATT
CTCAATGGTC ACAAATCTGT AACTGGTGAT TATCTTAAAG ATAATTCTTT AATCATGTTG
CAAGACAAAA TTCGAAATAA TTCTGGAACA CTTGTTGTAG AAAAGGCATC TGAAAACAAT
CTCAAAGATA TTGATGTTGA AATTCCATTA GGACTTTTTG TTTCTATTAC TGGTGTTTCG
GGCTCTGGAA AATCTACTCT GATTAATGAC ATTTTACTAA AAGCATTAGA AAATCATTTT
TACAAATCTA ATGTCAAACC TGGAACTCAC AAAAAGATTG TTGGTTTAGA GAATATAGAC
AAAGTCATTG CAATTGATCA ATCTCCAATT GGACGAACAC CTCGTTCAAA TCCAGCAACA
TACATTGGTG CATTTACTCC CATTAGAGAA TTGTATGCAA ATACTGAACT ATCAAAAGAA
CGTGGATATG CTCCAGGACA ATTTTCATTT AATGTTGCAG ATGGAAGATG TTTTGCATGT
GATGGTGATG GTGTAAAACA AATCGAGATG CAGTTTCTTT CTGATGTGTA TGTAAAATGT
GATGAATGTA AAGGAAAACG TTACAACTCT GAAACATTGT CTGTTCTTTA CAAAGGGAAA
AACATCTCAG ATGTTTTGGA TATGACTGTG TATGAAGCTC TAAACTTTTT TGAAAACATT
CCTGCAATTA AACGAAAATT ACAAACAATT TACGATGTTG GTTTGGGATA TATCAAATTA
GGCCAATCTT CTACAACTCT ATCTGGTGGT GAGGCTCAAA GAGTGAAACT AGCCTCAGAA
TTATCAAAAC GAGGAACTGG AAAGACCATG TACATCTTAG ATGAGCCTAC AACTGGTCTC
CACTTTGCTG ATGTACAAAA ATTACTGGAT GTCTTAAACA GACTGGCCAA TCTTGGAAAT
ACTGTTGTAG TAATTGAACA CAATATGGAC GTCATCAAAA ACTCTGATTG GATAATTGAT
TTGGGTCCTG AAGGTGGAGA TGAGGGTGGC AGAATTGTTG CCACAGGAAC CCCAAAAGAC
ATTGCAAAAG CTCCTGGAAG CTATACTGGA AAATATCTAA AGAAATTAGT TAAAAAATGA
 
Protein sequence
MTENKLKIRG ARHHNLKNID IDIPKNKLVV ISGLSGSGKS TLAFDTIYAE GQRRYVESLS 
AYARQFLEMM DKPDVDSIDG LSPAISIQQK TTSKNPRSTV GTTTEIYDYM RLLYARIGIP
YCTNCGRKIS TQSIETICDS VLREFSGKKI LILSPIIQRK KGTYEKLFEQ IKKDGYSRIR
LNGEILSLDE EIPPLDRQKW HNIEIVVDRI TTEKSERSRL FEAIQTAIKA SKGDVMVASE
KSEKVFSQNN ACPYCGLTVG ELEPRNFSFN SPFGLCKDCN GLGVKMEFDP DLVIPDKTKS
ILDGAIVPWS GRFSSFRRQA LRAVGKKFGF DLMTPINKIK PKHLKIILYG TDDLIDFNYR
SKSGDSSWQY TNAFEGVLDN LQRVFMETDS ESKREWLKQF MRDTPCNGCN GKKLKPESLA
VKINDKGIMD VCDLSIDHCY DFFSTLKLTE NEQYIARDVL KEIKERLEFL MNVGLNYLTL
NRLSSTLSGG ESQRIRLATQ IGSNLTGVLY VLDEPTIGLH QRDNARLIKT LNKLRNLGNT
VIVVEHDEEV IRNSDWMVDL GPGAGVNGGS VVFEGTVNQI LNGHKSVTGD YLKDNSLIML
QDKIRNNSGT LVVEKASENN LKDIDVEIPL GLFVSITGVS GSGKSTLIND ILLKALENHF
YKSNVKPGTH KKIVGLENID KVIAIDQSPI GRTPRSNPAT YIGAFTPIRE LYANTELSKE
RGYAPGQFSF NVADGRCFAC DGDGVKQIEM QFLSDVYVKC DECKGKRYNS ETLSVLYKGK
NISDVLDMTV YEALNFFENI PAIKRKLQTI YDVGLGYIKL GQSSTTLSGG EAQRVKLASE
LSKRGTGKTM YILDEPTTGL HFADVQKLLD VLNRLANLGN TVVVIEHNMD VIKNSDWIID
LGPEGGDEGG RIVATGTPKD IAKAPGSYTG KYLKKLVKK