Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Nmar_1792 |
Symbol | |
ID | 5773381 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nitrosopumilus maritimus SCM1 |
Kingdom | Archaea |
Replicon accession | NC_010085 |
Strand | - |
Start bp | 1635808 |
End bp | 1637520 |
Gene Length | 1713 bp |
Protein Length | 570 aa |
Translation table | 11 |
GC content | 38% |
IMG OID | 641317448 |
Product | thermosome |
Protein accession | YP_001583126 |
Protein GI | 161529300 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG0459] Chaperonin GroEL (HSP60 family) |
TIGRFAM ID | [TIGR02339] thermosome, various subunits, archaeal |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | 50 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCTATGC AAGCTACTTC TAAAGGAAAT ATGCCCGTCG TATTGCTCAA AGAAGGTGGT TCAGAAACTA AAGGTAGAGA TGCACAAAAG AACAACATTG CTGCATCAAA GATAGTTGCT GAGATTGTTC ATACCAGTTT AGGTCCTAGA GGCATGGATA AGATGCTAGT TGATTCCCTA GGAGATGTTA CTATTACAAA TGATGGAGCA ACAATTCTCA AAGAAATTGA TGTTCAACAC CCAGCAGCAA AAATGCTCGT TGAGATTTCT AAAACAACTG ACAATGAAGT TGGAGATGGA ACAACTTCTG CCGTGGTTTT GGCAGGAGCC TTACTTGAGA ATGCCGAGTC ATTAATTGAT CAAGATGTTC ACCCAACAAT TATTGTTGAT GGTTATAGAA AGGCTGGAAG AAAAGCCAAG CAATTTCTTG AAAGTATATC TGATAAAATT TCTCCAAATG ATAAGAATAT TCTCAATAAA ATTGCAAAAA CTTCTATGCA AACAAAACTT GTTAGAAAAG ATTCTGATCA ACTTGCAGAC ATTATTGTAA AATCGGTTCT TGCTGTTGCA GAAAAAGACT CTGAAAGTTA TGATGTTGAT ATTGATGACA TTAAAGTTGA AAAGAAAGCA GGTGGTTCAA TAAAGGACTC TATGATTGTT CAGGGTATAG TTTTGGACAA AGAAATTGTT CATGGAGGAA TGCCTAGAAA GATCAATGAA GCAAAAATTG CTTTGATTAA CACTGCCCTA GAAATCAGTA AAACTGAAAC CGATGCTAAG ATTAACATTT CAAATCCTCA ACAATTGAAA TCATTTCTAG ATGAAGAAAA TAGAATGCTA AAAACAATGG TTGACAAAGT TATCGGTTCT GGTGCAAATG TAGTTTTGTG TCAAAAAGGA ATTGATGACA TGGCACAACA TTACTTGGCA AAAGCCGGAA TTATAGCTGT TAGAAGAATT AAGGAAAGTG ACTTGACAAA ACTAGCAAAA GCAACAGGCG CAAGAATAGT CACAAACCTT GATGATCTCT TTGAAAAAGA TCTTGGTAGT GCTGACCTTG TTGAGGAAAG AAAGATTGAG GAAGACAAAT GGGTATTTGT TGAAGGATGT AAACATCCAA AATCTGTAAC ATTACTTCTA CGTGGTGGCT CACAAAGAGT TGTTGATGAA GTAGAACGTT CTGTTCATGA CGCTTTAATG GTTGTAAAAG ATGTAATTGA AAAACCAGAA ATTGTAGCAG GTGGAGGCGC CCCTGAAACA TATGCCGCTA CAAAACTTAG AAACTGGGCT AAATCTCTTG AAGGTAGAGA ACAATTAGCT GCAGAAAAAT TTGCTGATGC TTTAGAAGCA ATTCCATTAA CACTTTCAGA GAATGCTGGA ATGGATCCAA TTGATACTCT TACCCTGTTG CGTTCAAAAC AACAAAAAGG TGAGAAATGG ACAGGTATTG ATGTTATGAA AGGAAAAATT GCTAACATGA AATCTAGTGA TATTATTGAA CCTCTTGCAG TAAAACTTCA GATTGTTTCA GCATCTGCTG AAGCTGCATG TATGATTCTT AGAATTGATG ATGTAATCGC TACTCAGAAA TCTGCTGGTG GTCCACCTGG TGGTGGAGAA GGCGGAATGC CACCTGGAAT GGGCGGAATG CCACCTGGAA TGCCTGGAAT GGGCGGTATG GGTGGAATGC CTGACATGGG CGGTATGATG TAA
|
Protein sequence | MAMQATSKGN MPVVLLKEGG SETKGRDAQK NNIAASKIVA EIVHTSLGPR GMDKMLVDSL GDVTITNDGA TILKEIDVQH PAAKMLVEIS KTTDNEVGDG TTSAVVLAGA LLENAESLID QDVHPTIIVD GYRKAGRKAK QFLESISDKI SPNDKNILNK IAKTSMQTKL VRKDSDQLAD IIVKSVLAVA EKDSESYDVD IDDIKVEKKA GGSIKDSMIV QGIVLDKEIV HGGMPRKINE AKIALINTAL EISKTETDAK INISNPQQLK SFLDEENRML KTMVDKVIGS GANVVLCQKG IDDMAQHYLA KAGIIAVRRI KESDLTKLAK ATGARIVTNL DDLFEKDLGS ADLVEERKIE EDKWVFVEGC KHPKSVTLLL RGGSQRVVDE VERSVHDALM VVKDVIEKPE IVAGGGAPET YAATKLRNWA KSLEGREQLA AEKFADALEA IPLTLSENAG MDPIDTLTLL RSKQQKGEKW TGIDVMKGKI ANMKSSDIIE PLAVKLQIVS ASAEAACMIL RIDDVIATQK SAGGPPGGGE GGMPPGMGGM PPGMPGMGGM GGMPDMGGMM
|
| |