Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Nmar_1104 |
Symbol | |
ID | 5774147 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nitrosopumilus maritimus SCM1 |
Kingdom | Archaea |
Replicon accession | NC_010085 |
Strand | - |
Start bp | 1006999 |
End bp | 1009818 |
Gene Length | 2820 bp |
Protein Length | 939 aa |
Translation table | 11 |
GC content | 35% |
IMG OID | 641316746 |
Product | excinuclease ABC subunit A |
Protein accession | YP_001582438 |
Protein GI | 161528612 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG0178] Excinuclease ATPase subunit |
TIGRFAM ID | [TIGR00630] excinuclease ABC, A subunit |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | 37 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACCGAAA ATAAATTAAA AATTCGTGGT GCACGTCATC ATAATTTAAA AAACATTGAC ATTGACATTC CAAAAAACAA ACTAGTTGTA ATTAGTGGAC TGTCTGGATC TGGTAAATCA ACTTTAGCAT TTGATACAAT ATATGCTGAA GGACAAAGAC GATATGTTGA ATCCCTTTCA GCATATGCTC GTCAGTTCTT GGAGATGATG GACAAACCTG ATGTTGACTC TATAGATGGA TTATCTCCTG CAATTTCAAT CCAACAAAAA ACAACTAGTA AAAACCCTCG TTCTACTGTT GGAACAACCA CTGAAATTTA TGATTACATG AGATTACTTT ATGCTAGAAT TGGTATTCCA TATTGTACTA ATTGTGGCAG AAAAATCTCT ACCCAATCAA TTGAAACCAT TTGTGATTCT GTGCTAAGAG AATTTTCTGG CAAAAAGATT CTGATACTGT CTCCTATCAT CCAGCGAAAG AAAGGAACCT ATGAGAAACT CTTTGAGCAG ATCAAAAAGG ACGGTTACTC TAGAATACGC CTAAATGGAG AAATTTTGAG CCTAGATGAG GAAATTCCTC CACTTGACAG GCAAAAATGG CACAATATTG AGATTGTAGT TGACAGAATA ACTACTGAAA AATCTGAACG CTCAAGATTG TTTGAGGCTA TTCAGACTGC AATAAAGGCA TCAAAAGGAG ATGTAATGGT TGCATCAGAA AAATCTGAAA AAGTCTTTTC TCAAAATAAT GCTTGTCCTT ATTGTGGATT AACAGTAGGT GAATTGGAAC CAAGAAATTT TTCATTCAAC TCTCCATTTG GCTTGTGTAA AGATTGCAAT GGCCTTGGTG TTAAAATGGA GTTTGATCCT GATTTAGTAA TTCCAGATAA AACAAAATCA ATTTTGGATG GAGCAATTGT TCCTTGGAGT GGAAGATTTT CTTCCTTTAG AAGACAAGCA TTAAGAGCAG TAGGCAAAAA ATTTGGCTTT GATTTGATGA CACCAATTAA CAAAATAAAA CCAAAACATC TCAAAATTAT TTTGTATGGA ACAGATGATC TAATTGATTT TAATTATCGT TCAAAATCTG GCGATTCTTC ATGGCAATAC ACTAATGCCT TTGAAGGTGT GTTGGATAAT CTTCAACGTG TTTTTATGGA AACTGATTCT GAATCAAAAC GTGAATGGTT AAAACAATTC ATGAGAGATA CTCCCTGTAA TGGATGTAAT GGAAAAAAAC TCAAACCTGA ATCACTTGCA GTAAAAATCA ATGATAAGGG AATCATGGAT GTCTGTGATT TGTCTATTGA CCATTGTTAT GATTTCTTTT CTACGCTAAA ACTAACTGAA AATGAACAAT ACATTGCAAG AGATGTTCTA AAGGAAATTA AAGAACGACT AGAGTTTTTG ATGAATGTTG GTTTGAATTA TCTTACTCTA AACAGATTAA GTTCTACATT ATCTGGTGGC GAATCTCAAA GAATTCGATT AGCTACACAA ATTGGTTCCA ATCTTACTGG TGTTTTGTAT GTGTTAGATG AACCAACTAT CGGTCTTCAC CAAAGAGATA ATGCCCGACT CATTAAAACG CTAAATAAGC TACGAAATTT GGGAAATACT GTTATTGTAG TGGAGCATGA CGAAGAAGTT ATACGAAATT CAGACTGGAT GGTTGATTTG GGACCTGGAG CCGGTGTAAA TGGTGGTAGT GTTGTTTTTG AAGGAACTGT CAACCAAATT CTCAATGGTC ACAAATCTGT AACTGGTGAT TATCTTAAAG ATAATTCTTT AATCATGTTG CAAGACAAAA TTCGAAATAA TTCTGGAACA CTTGTTGTAG AAAAGGCATC TGAAAACAAT CTCAAAGATA TTGATGTTGA AATTCCATTA GGACTTTTTG TTTCTATTAC TGGTGTTTCG GGCTCTGGAA AATCTACTCT GATTAATGAC ATTTTACTAA AAGCATTAGA AAATCATTTT TACAAATCTA ATGTCAAACC TGGAACTCAC AAAAAGATTG TTGGTTTAGA GAATATAGAC AAAGTCATTG CAATTGATCA ATCTCCAATT GGACGAACAC CTCGTTCAAA TCCAGCAACA TACATTGGTG CATTTACTCC CATTAGAGAA TTGTATGCAA ATACTGAACT ATCAAAAGAA CGTGGATATG CTCCAGGACA ATTTTCATTT AATGTTGCAG ATGGAAGATG TTTTGCATGT GATGGTGATG GTGTAAAACA AATCGAGATG CAGTTTCTTT CTGATGTGTA TGTAAAATGT GATGAATGTA AAGGAAAACG TTACAACTCT GAAACATTGT CTGTTCTTTA CAAAGGGAAA AACATCTCAG ATGTTTTGGA TATGACTGTG TATGAAGCTC TAAACTTTTT TGAAAACATT CCTGCAATTA AACGAAAATT ACAAACAATT TACGATGTTG GTTTGGGATA TATCAAATTA GGCCAATCTT CTACAACTCT ATCTGGTGGT GAGGCTCAAA GAGTGAAACT AGCCTCAGAA TTATCAAAAC GAGGAACTGG AAAGACCATG TACATCTTAG ATGAGCCTAC AACTGGTCTC CACTTTGCTG ATGTACAAAA ATTACTGGAT GTCTTAAACA GACTGGCCAA TCTTGGAAAT ACTGTTGTAG TAATTGAACA CAATATGGAC GTCATCAAAA ACTCTGATTG GATAATTGAT TTGGGTCCTG AAGGTGGAGA TGAGGGTGGC AGAATTGTTG CCACAGGAAC CCCAAAAGAC ATTGCAAAAG CTCCTGGAAG CTATACTGGA AAATATCTAA AGAAATTAGT TAAAAAATGA
|
Protein sequence | MTENKLKIRG ARHHNLKNID IDIPKNKLVV ISGLSGSGKS TLAFDTIYAE GQRRYVESLS AYARQFLEMM DKPDVDSIDG LSPAISIQQK TTSKNPRSTV GTTTEIYDYM RLLYARIGIP YCTNCGRKIS TQSIETICDS VLREFSGKKI LILSPIIQRK KGTYEKLFEQ IKKDGYSRIR LNGEILSLDE EIPPLDRQKW HNIEIVVDRI TTEKSERSRL FEAIQTAIKA SKGDVMVASE KSEKVFSQNN ACPYCGLTVG ELEPRNFSFN SPFGLCKDCN GLGVKMEFDP DLVIPDKTKS ILDGAIVPWS GRFSSFRRQA LRAVGKKFGF DLMTPINKIK PKHLKIILYG TDDLIDFNYR SKSGDSSWQY TNAFEGVLDN LQRVFMETDS ESKREWLKQF MRDTPCNGCN GKKLKPESLA VKINDKGIMD VCDLSIDHCY DFFSTLKLTE NEQYIARDVL KEIKERLEFL MNVGLNYLTL NRLSSTLSGG ESQRIRLATQ IGSNLTGVLY VLDEPTIGLH QRDNARLIKT LNKLRNLGNT VIVVEHDEEV IRNSDWMVDL GPGAGVNGGS VVFEGTVNQI LNGHKSVTGD YLKDNSLIML QDKIRNNSGT LVVEKASENN LKDIDVEIPL GLFVSITGVS GSGKSTLIND ILLKALENHF YKSNVKPGTH KKIVGLENID KVIAIDQSPI GRTPRSNPAT YIGAFTPIRE LYANTELSKE RGYAPGQFSF NVADGRCFAC DGDGVKQIEM QFLSDVYVKC DECKGKRYNS ETLSVLYKGK NISDVLDMTV YEALNFFENI PAIKRKLQTI YDVGLGYIKL GQSSTTLSGG EAQRVKLASE LSKRGTGKTM YILDEPTTGL HFADVQKLLD VLNRLANLGN TVVVIEHNMD VIKNSDWIID LGPEGGDEGG RIVATGTPKD IAKAPGSYTG KYLKKLVKK
|
| |