Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Nmar_1161 |
Symbol | |
ID | 5772936 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nitrosopumilus maritimus SCM1 |
Kingdom | Archaea |
Replicon accession | NC_010085 |
Strand | - |
Start bp | 1062348 |
End bp | 1063718 |
Gene Length | 1371 bp |
Protein Length | 456 aa |
Translation table | 11 |
GC content | 36% |
IMG OID | 641316804 |
Product | hypothetical protein |
Protein accession | YP_001582495 |
Protein GI | 161528669 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | 38 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAAATCA AAGATATCAA GAAAATTATG TATAGCCATG ATTACCATGA ATTCCATGAA TATTCATATA CTAGTACACG CTCCACACCA TGCTTGCATC TGACGTATCT TGTATTGGCA TTTGCTCTGA TAATCCCGTC TTTTGCATAT GCCCAAGAAT CTGGTGATTT TCCAAACTGG TATACAGAAT TTGTAATCAT GTGGACGGAA AGACAAATTG CAGATGATGA ATTTATCTTG TCAACAGAAT ATCTAGCAAA CCAAAATTTG ATTTCAGTGC CATTGTTTGA TCCTACAAGT CCGTCCGCCG GAATTCCTGA TTGGATAAAA AATACTGCAG ACTGGTGGAC AGACGGCATT GTGTCTGATT CCGAGTTTCT TTCAAGCTTG GAATTTTTGG TTGATACAAA TGTAATTAAA AAAACATCGT ACCAATCTGA TTCTCTGTTT GATAATTTTA CAATTGATGT TGAGAAATTA GAAAACGGTG AGAAAGTTAC ATGGACATCA AAGGATAGTT TGTCCCATAC AATAACAAGC GGTACTCCTG CAAAACACAC ACCGTTGTTT TCTTCTGGAA TTATTCACAA GGGAGAATCC GTATCCTATG TTCTTGAGAA TGGTGTCTAT TCTTTTTTCT GCATGATACA TCCCTGGGAA ACAGAAACTC TTACTGTACC TGTTGCATTG GAGAATTATC AAACGGCAGA ATCTGATTTT CCTGATTCCA AAACAGAAGA AACAACTTTA GAACAAAAAC AAGAAGAACA CAAAACGCTG CAAGAAAAAC AAGCAAAGAA AATTCTTGCA AATTTTGGCA GCACGTCATT GAACATTGTA TCTGTTTCCA ACTCTGACGG GCAATTACAC TTGGAATACA TGGAGGAATT TTTCACCAGA CACATAGATC AATTCAATCC TCAAATTGTA GAGAACATAA AATTAATGGA AAACGATCCT GAAAAATATC TTGATTTGGC TTTGGCTGAT GCAGGACTGT CTCGGGAATT TTATTCTGGG CAGATTTCAA TATGGGGTGA AATTTTCAAG GTAAAATCTG AGAACCTTGA TCAACAACAT GTTCGTTCTT TGGAGGAAAT GCAGTCTTTG GATTTGGAGG ATTCCAAAAA AGAATACTAT GTTGCAGAAA TAAACAAGGC TAAGCAGACA TTTGATGACA TGTTAGAGTC CTCCGGCCTA ACCATGATTC AGCCACTTGA GGAAAAAATT ACAAAATCTG AACATTCTGA CTCTTCTAAA GGTAAGATCC TTGAATCAAA TTTGTCAGAA AATGACCATA TTGGCAGAAT AATTGCAACA ATTTCTGATT TTATTACGTC CTTGTTATAT GGAATCAAAT CTTTGTTCTA A
|
Protein sequence | MKIKDIKKIM YSHDYHEFHE YSYTSTRSTP CLHLTYLVLA FALIIPSFAY AQESGDFPNW YTEFVIMWTE RQIADDEFIL STEYLANQNL ISVPLFDPTS PSAGIPDWIK NTADWWTDGI VSDSEFLSSL EFLVDTNVIK KTSYQSDSLF DNFTIDVEKL ENGEKVTWTS KDSLSHTITS GTPAKHTPLF SSGIIHKGES VSYVLENGVY SFFCMIHPWE TETLTVPVAL ENYQTAESDF PDSKTEETTL EQKQEEHKTL QEKQAKKILA NFGSTSLNIV SVSNSDGQLH LEYMEEFFTR HIDQFNPQIV ENIKLMENDP EKYLDLALAD AGLSREFYSG QISIWGEIFK VKSENLDQQH VRSLEEMQSL DLEDSKKEYY VAEINKAKQT FDDMLESSGL TMIQPLEEKI TKSEHSDSSK GKILESNLSE NDHIGRIIAT ISDFITSLLY GIKSLF
|
| |