Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Nmar_0467 |
Symbol | |
ID | 5773367 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nitrosopumilus maritimus SCM1 |
Kingdom | Archaea |
Replicon accession | NC_010085 |
Strand | + |
Start bp | 420220 |
End bp | 421641 |
Gene Length | 1422 bp |
Protein Length | 473 aa |
Translation table | 11 |
GC content | 37% |
IMG OID | 641316099 |
Product | hypothetical protein |
Protein accession | YP_001581801 |
Protein GI | 161527975 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | 18 |
Fosmid unclonability p-value | 0.0035824 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGATGCCTT CAGCTTTTGC ACAATTTCAA TCTGGAGGTG TAGATATGCC TGGAACATGG TATGTTGGTG AAGGCCTCAA ACACGGCGAT TATTTTTCTT ACAATCTTTG TCATGTTGAC TATAAGGAAT GTGCTGAATT TGGTCTAGAT ATGTGGATTA AAGGAGATAT TCAATCTGGC AGTGAAACAA AGTGGTTAGC TGAAGTTGTT GTATATGATG GCAACAAAGT TGTTGTCGGT GAAATGGAAT TAGGAAAGAT TGCTCCTGAA CCAACTGGTG GAAGTGAAGA ACTAGGTGTG TATAGAGGTG CATTCAAATC TTCAGTTGCA TGGTTATCTG CATTTGCAAC ATCTGATTCT GGAACTAGTG GAAAAGGACC AAAGGCATTT AGTGCAACTT CATGGGGAAA GATTGGAAAC ATTGGTGGAG AACAAGTACT TCCCATGAAG ATTGAAACAA TTACTATTTC ATCTGGAACT TGGGAAACTG TACAAATGGG ATGGCGTACT GGTGGACAAA CAAGCAAAGT TTGGATTGTA GATGAATTTC CATTTCCTGT AAAAGCCCAT ACGTTAACTC ATGTTTCAGA GGGAATTCCT CCAGCTGAAT ACAAATTTGA ATTACTAGAT TACAAAGAAA ACGTTTCAAC AAGCCCTTTT TCAGGAATTG TATCTACTGT TGACACTTTT TCAGAACAAG GATGTGATAC TGATTTTGAA CGAGATGTCA CTATAAAAAA ACCTACAAAC AACTTTGATT ATCAAATTCA TGTATTCTAT GGACCTGAAG AACCAGTACA AGGTTGTGAG ATGCAATGGC TAATAAAATT TATCAGCAAA TTTGACGATA CTGAATTTTT GAACCAAGTC CAATTTGATT TTCTAGTAGT AGATGATAAC TTGACTCCAT TACGTTCAAT GGCTCAAGAT GAAGGAAGAC AGTATCTCTA CTCTCCATCT GGACAATACA TTCTTGATAT GGTAGTCCAA GAACCACCTG GCAAAGTAAA CTATGTTATT TGGGTTTATG GATTAGCTCC TGAAGGAATA GTACCTGGTT CTGCTGCAGA TTATTTACAA ATTCCTGTAA CTGTTTTTGC CAGTGAAGGA AGTACTCCGG TAGTTTCTCC ACCTTTTGAA ACAACATCTC AGGAAATACC TGAATGGATT AAGAATAATG CAGGCTGGTG GGCAGAAGGT GCAATTGATG ATGGTTCATT TGTTCAAGGA ATTCAATTCT TAATCAAAGA AGGAATTATG CAAATTCCTC CAACTACACA AGGAACTAGT TCTTCTAATG AAATTCCCTC TTGGATAAAG CAAAATGCTG CATGGTGGGC AGAAGGTGCA ATTGATGATG GTTCATTTGT TCAAGGAATC CAATTCTTAA TCAAAGAAGG AATAATGAGT ATCTCTTCTT AA
|
Protein sequence | MMPSAFAQFQ SGGVDMPGTW YVGEGLKHGD YFSYNLCHVD YKECAEFGLD MWIKGDIQSG SETKWLAEVV VYDGNKVVVG EMELGKIAPE PTGGSEELGV YRGAFKSSVA WLSAFATSDS GTSGKGPKAF SATSWGKIGN IGGEQVLPMK IETITISSGT WETVQMGWRT GGQTSKVWIV DEFPFPVKAH TLTHVSEGIP PAEYKFELLD YKENVSTSPF SGIVSTVDTF SEQGCDTDFE RDVTIKKPTN NFDYQIHVFY GPEEPVQGCE MQWLIKFISK FDDTEFLNQV QFDFLVVDDN LTPLRSMAQD EGRQYLYSPS GQYILDMVVQ EPPGKVNYVI WVYGLAPEGI VPGSAADYLQ IPVTVFASEG STPVVSPPFE TTSQEIPEWI KNNAGWWAEG AIDDGSFVQG IQFLIKEGIM QIPPTTQGTS SSNEIPSWIK QNAAWWAEGA IDDGSFVQGI QFLIKEGIMS ISS
|
| |