Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Nmar_1660 |
Symbol | |
ID | 5773051 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nitrosopumilus maritimus SCM1 |
Kingdom | Archaea |
Replicon accession | NC_010085 |
Strand | + |
Start bp | 1518432 |
End bp | 1519400 |
Gene Length | 969 bp |
Protein Length | 322 aa |
Translation table | 11 |
GC content | 39% |
IMG OID | 641317314 |
Product | glycosyl hydrolase BNR repeat-containing glycosyl hydrolase |
Protein accession | YP_001582994 |
Protein GI | 161529168 |
COG category | [R] General function prediction only |
COG ID | [COG4447] Uncharacterized protein related to plant photosystem II stability/assembly factor |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | 48 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGGAAAA AACGCTCATC AAAAAATAAA AAATTTGTAT TAATTGTAAT TCCCATAATC ATAGTTGGAA TTCTTCTAGC TCTGCCTATA CAAAATAACC AGAATTCCCC AACCTCAAAA CTTCAGGGTC CATGGAATGA CATTCATGGT GTTGGAATGT TTCTTTCTGG CAATGATGAT ACCCTTTACC TTGCTACTCA TCAGGGATTG TTTGAGAAAA AGGATTCAGG GTGGCAGCAT GTTGGTAATG ACAATGCAGA TTTGATGGGA TTTTCTATGA ATCACGACAC GGAGAAAATG TATTCTAGTG GACATCCAAA AACAGGAGGG AATTTAGGAT TTAGAATGAG TGACGATAAA GGAAATTCAT GGACAACTAT TTCCAAAGTA AAAAATACCC CAGTTGATTT TCATGCAATG ACTGCAAGCC AAGCACAAAA TGGTTTGATT TATGGCTCTC CGGGAGGGGG AAGTGAACTC TTTGTAACAT CAGATGACGG AACATCTTGG AATTCCCTTG ATATTCCAAA TAAAATAATC TCACTTGCAG CAGACCCATT AGATCCTAAT CGCGTATATG CAGGAACAAT GTCCGGACTG TATGTCAGTA ACAATCAGGG AAAACAATGG ACTGCAGTTG ATTCAGACAT TGAGAAAGGA GTGATTACAG GAATAGGATT TTCATCGGAT GGCAAAACCA TGTATGTATT TTCCACATTA GATGGAAATG GAATGATTGT AAAATCAATT GACGGGGGGA AAACAATGGT TAAGACACAA AGCCAAATTG CTGATGCCAA GGGTGTTTGG AATTTTGCCC CGGGTCGTGA TGGGGAAATT TATGCAATAG CAGCACAACA AGTAGCAAGC GGATTGGCAA TGAGTGTTTA CAAAACAGAT GATGGCGGGG CTACGTGGGT TTTAGAGGGC ACAAATAATT CAGAACTAGC ACTGACTGAC GAATCTTAA
|
Protein sequence | MRKKRSSKNK KFVLIVIPII IVGILLALPI QNNQNSPTSK LQGPWNDIHG VGMFLSGNDD TLYLATHQGL FEKKDSGWQH VGNDNADLMG FSMNHDTEKM YSSGHPKTGG NLGFRMSDDK GNSWTTISKV KNTPVDFHAM TASQAQNGLI YGSPGGGSEL FVTSDDGTSW NSLDIPNKII SLAADPLDPN RVYAGTMSGL YVSNNQGKQW TAVDSDIEKG VITGIGFSSD GKTMYVFSTL DGNGMIVKSI DGGKTMVKTQ SQIADAKGVW NFAPGRDGEI YAIAAQQVAS GLAMSVYKTD DGGATWVLEG TNNSELALTD ES
|
| |