Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Nmar_1129 |
Symbol | |
ID | 5773436 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nitrosopumilus maritimus SCM1 |
Kingdom | Archaea |
Replicon accession | NC_010085 |
Strand | - |
Start bp | 1031846 |
End bp | 1033210 |
Gene Length | 1365 bp |
Protein Length | 454 aa |
Translation table | 11 |
GC content | 38% |
IMG OID | 641316772 |
Product | hypothetical protein |
Protein accession | YP_001582463 |
Protein GI | 161528637 |
COG category | [C] Energy production and conversion |
COG ID | [COG3794] Plastocyanin |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | 27 |
Fosmid unclonability p-value | 0.32866 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTCTTCGT TATTTGTAGA GGAGAAATCC TCTGCAAATT TTACAAATGT AAAGTTGCTT GAAAAAAATT TAAAGTTTTT ATGTTGTAAT AATTTACAAC ACCTTGTGAA AACGCTAGCA ATCTTTTCAG TTTTAATGTT GTTTAGTATC ATTTCTTTAT CTCCTGCTTT TGCAGATCAT TCAGAAGTTA CAGTTGTACC TGCAGATGGT TCTGGCTCAC CTGGTTGTGA AAAAACTGCA GATGGATGTT ACATTCCAAG TACTGCAACA GTTGATGTTG GCGGTGTTGT AATCATGTCA AACACTGACA CTGCAGGACA CACATACACT TCAGGAAATC CTGAAGATGG ACCTGATGGT ATCTTTGATA GTAGTTTGTT GATGACTGGA AATTCATTTG AATGGAGTCC AGATGAAGTT GGTGAATATG ATTATTATTG TATGGTTCAT CCTTGGATGT TGGGAACAAT AATTGTTCAA GAAGTATCTG CAGAGGAGGA TGATGTGATG GAGCAACCCA TGAAGGCTGA AATGTTTGGT TGGGACAGAT TTGAATCAAT GCAAGATCCT GGCGTTGGCC ATGAAGAGCA TCAACTAGCA ATTTTGTTGG CTCCAAGCGA GAATACCTAT GCTGGAACAT TAAGATATGA TGCATCTGAA CCTATTCAAC TTGTAAGTTT GAGGGGTCCA CTTGGATCTG ATGAAACTGC TGGAAAAATT TGGACTCCTG ACGGTAAGAC TAAATTTGAG TTGACCCTAG TTGATCAAGA ATCTTCTTCT GGCGAATGGG ATTTTTCTGG AAATGCTCTA GCCGTTCATA CTTTTAATAC AAATCAGTTC GTAGTTGACG TTCAAATTGA TTATGAAGAA ATCCCTCCAC AAAAATCTAT GATGGAAGAA GACATGATGA AAGATGACTC TATGATGGAA CAAGAAACTA TGATGGCAGA TGATTCTGTG ATGGAAACAA CATCTGATGA AGCAGAGTCT AGTGGTGGTG GGTGTTTGAT TGCAACAGCA GCATATGGAA CAGAACTGGC ACCACAAGTT CAATTCTTAA GAGAAATTCG AGATAACACT GTAATGAGTA CGGCATCTGG TGCATCCTTT ATGACTGGTT TTAACCAATT GTATTATTCA TTCTCACCAA CAATTGCTGA TCTGGAACGA GAAAACCCAA TGTTTAAAGA ATCTGTACGA GCATTCATCA CACCAATGAT TTCAACATTG TCTATTATGA CATTGGCTGA AGATGGTTCA GAAGCAGAAG TTTTAGGATT GGGAATATCT GTTATTGCAC TTAACTTGGC AATGTATATT GCAGCACCTG CTGTTGTTGT ATGGCAAATC AAAAAGAGAA TTTAG
|
Protein sequence | MSSLFVEEKS SANFTNVKLL EKNLKFLCCN NLQHLVKTLA IFSVLMLFSI ISLSPAFADH SEVTVVPADG SGSPGCEKTA DGCYIPSTAT VDVGGVVIMS NTDTAGHTYT SGNPEDGPDG IFDSSLLMTG NSFEWSPDEV GEYDYYCMVH PWMLGTIIVQ EVSAEEDDVM EQPMKAEMFG WDRFESMQDP GVGHEEHQLA ILLAPSENTY AGTLRYDASE PIQLVSLRGP LGSDETAGKI WTPDGKTKFE LTLVDQESSS GEWDFSGNAL AVHTFNTNQF VVDVQIDYEE IPPQKSMMEE DMMKDDSMME QETMMADDSV METTSDEAES SGGGCLIATA AYGTELAPQV QFLREIRDNT VMSTASGASF MTGFNQLYYS FSPTIADLER ENPMFKESVR AFITPMISTL SIMTLAEDGS EAEVLGLGIS VIALNLAMYI AAPAVVVWQI KKRI
|
| |