Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Nmar_1324 |
Symbol | |
ID | 5774146 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nitrosopumilus maritimus SCM1 |
Kingdom | Archaea |
Replicon accession | NC_010085 |
Strand | + |
Start bp | 1213048 |
End bp | 1214319 |
Gene Length | 1272 bp |
Protein Length | 423 aa |
Translation table | 11 |
GC content | 36% |
IMG OID | 641316969 |
Product | integrase family protein |
Protein accession | YP_001582658 |
Protein GI | 161528832 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | 6 |
Fosmid unclonability p-value | 0.0000000102488 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGAAACAGG TCAATAATCA GATCGAGCAA GAATTGTCAT TTGAGAACAT TTCACTAATC AAAAAATATG ACAGGGAGAT GGTATCACAA TCAATTGCCA TTGCAACACG TCAAAAGCAT CTAAGAACAT TGCTAACACT ATCAAAATTG CTAAAGAAAA ACTGGAAGGA TGTGACCAGG GATGATATTG ATGACTTGGT ATTTCTGATA ATGGACCAGT TTGCAGATGA AAGTGGTCAG GAAACACATT ACTCTTATGA TCACAAAAAG ATTCTCAAGA TTTTCTTTAG ATGGTACAAG CTTGGCTCTA GAGAATTTGT TCAAGTTGGA GATCCACCTG AGACAAAAAA TGTCAAGATG AAAAAAGTCA AAGACAAGAT TGCACGTGAA GACCTCCTAA ATGAAGAAGA CAGAATAAAG ATACTGTATG CATGTGGCGA GAATGCAAGA GACAGAGCTC TAATTGATTG TCATATGGAA GCTGGAACCA GACCAGGTGA GATTCTAAAT TTGAAGTTAA AACATGTAAA GTTTGACAAG CATGGTTGTG TACTTCAAGT GGACGGAAAG ACAGGAGCTA GAACAATTAG AATCGTAAGG GCTACTCCAA ACTTGGCTGC ATGGATTGCA GTACATCCAT ACAAAGATGA ACCTGAAATG CCATTATGGC CAAATATTAG CCATCATAAG AAAGGCAGTC CAATTACATA TGCTGCAGCA AGACAGATCT TACATAGAAG ATGCAAGATT GCAAATATCT CAAAACGTGT TTATCTGAAT TTATTTAGAC ATAGTGAGGC CACAACTACA GCAAACTTCA TGACTGAAGC TCAGATGAGA AAAAGACATG GATGGTCGTC TGACTCTAAA ATGCCTGCAA GATATGTCCA CTTGGTAAAT TCTGATGTGG AAGATGCAAT CTTCAAGCAC TATGGAATCA AAAAAGAAGA TGAAAAGATG CCAGAAATGC CTGTAAAGTG TCATTTTTGT GAAATGTACA ATCCATCAGA CAGCGTAACA TGTACAAAAT GTGGAAAACC ATTGAATCTT GAGAGTGCAA TAAAAAGAGA AGAGCAAGAA AATGCTGAAA AGAAAAAACT TGAAGAAAAG ATCAAGATGC TAGAGCAAAG ACAGATTGAA TCAGAAAAGA ATCAGAAAGG ATATTCAGAT TTAAAATCAA TTGTAGATGA ATATTTGAAA GAATACTTTG AGGACGTATT TGACAAGATA GAGTTTGTAA AGAATCAAAA ACAAAATAGT ATTACAAACT GA
|
Protein sequence | MKQVNNQIEQ ELSFENISLI KKYDREMVSQ SIAIATRQKH LRTLLTLSKL LKKNWKDVTR DDIDDLVFLI MDQFADESGQ ETHYSYDHKK ILKIFFRWYK LGSREFVQVG DPPETKNVKM KKVKDKIARE DLLNEEDRIK ILYACGENAR DRALIDCHME AGTRPGEILN LKLKHVKFDK HGCVLQVDGK TGARTIRIVR ATPNLAAWIA VHPYKDEPEM PLWPNISHHK KGSPITYAAA RQILHRRCKI ANISKRVYLN LFRHSEATTT ANFMTEAQMR KRHGWSSDSK MPARYVHLVN SDVEDAIFKH YGIKKEDEKM PEMPVKCHFC EMYNPSDSVT CTKCGKPLNL ESAIKREEQE NAEKKKLEEK IKMLEQRQIE SEKNQKGYSD LKSIVDEYLK EYFEDVFDKI EFVKNQKQNS ITN
|
| |