Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Nmar_1133 |
Symbol | |
ID | 5774197 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nitrosopumilus maritimus SCM1 |
Kingdom | Archaea |
Replicon accession | NC_010085 |
Strand | + |
Start bp | 1036866 |
End bp | 1037891 |
Gene Length | 1026 bp |
Protein Length | 341 aa |
Translation table | 11 |
GC content | 41% |
IMG OID | 641316776 |
Product | kelch repeat-containing protein |
Protein accession | YP_001582467 |
Protein GI | 161528641 |
COG category | [S] Function unknown |
COG ID | [COG3055] Uncharacterized protein conserved in bacteria |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | 24 |
Fosmid unclonability p-value | 0.0981995 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAAAAAA TATCATTTTT CTTAATTCTG TTAATTTTTC CTGTTTCAGA TATTTTTGCA GAAGAAGATT CAGAAGGGTG GAAAAGATTG GCAGACATGC CAGAAGTAAG GTCAGAGATG GAATCAGCTG CAATTGATGA AAAGATCTAT GTGGTGGGAG GCATAGCCAA TACAAATCAA GTATCAAATT CTGTTTTTGT TTTTGATACC AAAGATGAAT CATGGAGTAC TGGAACCCCA ATGCCAATAG AATTACATCA TGCTGGAACT GCAGCCCATG ATGGGAAGCT GTATGTTGTT GGAGGATACA TGAAAGGGTG GAGTCCATCA AACGCATTAC TAATTTATGA TTCTGTCAAA GATTCTTGGA GTCAGGGCAA GGATATGCCA ACTGCTCGCG GTGCACTGAC TGCAGAATTT GTAGACGGCA AGCTGTATGC AGTTGGAGGA TTCAATGAGA ATTCCCGTAC TGAAAATGAA GTGTATGATC CTGCAGACGA CTCTTGGGAG AAAATGGCCC CAATGCCTAC AGCCAGGGAA CACCTAGCAT CAGCAGTTCT AGACGGACAG TTGTTTGTCA TTGGTGGCAG GGCAGGACAG GTAAATTCTG ATGCAAACGA AATGTACGAC TATACCTCAG ATACTTGGAA AATATTAGAA CCACTTCCAA CTGCAAGAAG TGGATTGGCT GCATCTGTTA TTAGCGGAGC AGTTTTTGTT TTTGGGGGAG AAAGCTCACT AAGGACATTT GAAGAAAATG AAGCATACAT TCCTGAAGAA GGATGGTTTG CACAGCAACC AATGCCAATA CCAAGACATG GCTTAGCATC ATCAACTGTA GGGGACAACA TCTATCTTAT TGGCGGGGGA GTAGTTCCAG GTTTTAGCTT TAGTGGAATT ACTGAAAAAT ATCACAACAC AGTCGTTCCA GAATTCGGCG TCTTGTCAAT TGTGATTTTG GGAATCTCAA CTGTCATGAT AATTTTGTTT ACAAAGCCTA AATTTCAACA CATCATTCAG CAATGA
|
Protein sequence | MKKISFFLIL LIFPVSDIFA EEDSEGWKRL ADMPEVRSEM ESAAIDEKIY VVGGIANTNQ VSNSVFVFDT KDESWSTGTP MPIELHHAGT AAHDGKLYVV GGYMKGWSPS NALLIYDSVK DSWSQGKDMP TARGALTAEF VDGKLYAVGG FNENSRTENE VYDPADDSWE KMAPMPTARE HLASAVLDGQ LFVIGGRAGQ VNSDANEMYD YTSDTWKILE PLPTARSGLA ASVISGAVFV FGGESSLRTF EENEAYIPEE GWFAQQPMPI PRHGLASSTV GDNIYLIGGG VVPGFSFSGI TEKYHNTVVP EFGVLSIVIL GISTVMIILF TKPKFQHIIQ Q
|
| |