Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Nmar_1597 |
Symbol | |
ID | 5774237 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nitrosopumilus maritimus SCM1 |
Kingdom | Archaea |
Replicon accession | NC_010085 |
Strand | - |
Start bp | 1459613 |
End bp | 1460944 |
Gene Length | 1332 bp |
Protein Length | 443 aa |
Translation table | 11 |
GC content | 34% |
IMG OID | 641317250 |
Product | UbiD family decarboxylase |
Protein accession | YP_001582931 |
Protein GI | 161529105 |
COG category | [H] Coenzyme transport and metabolism |
COG ID | [COG0043] 3-polyprenyl-4-hydroxybenzoate decarboxylase and related decarboxylases |
TIGRFAM ID | [TIGR00148] UbiD family decarboxylases |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | 39 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGTGATT TAAGAAATTA TATTTCAAAA ATAAAGAAAA ACAAAGAACT CAAAACTGTA AAAACTAAAG TTTCAACAAA ATATGAGATT GCAGGAATTA CTGCAAAAGT TGACGGTTCA CATGCTGTAC TATTTGAAAA TATCAAGGAA AGTGATTTTC ATCTAGTTGC AAATTTGGTT GGTACTCGAA AAAGATTTGC TCTAGCTGTT GGAGGTACTG AATATAATAT ACATGAAAAA GTCATCTCTG CTATCAAAAA GGCAAAAGCC CCAAAAATTA TCTCCTCTGG CAAATTTCAA GAAAATAGCT CAAAAAACCT CCTTTCCATG CCTATTGTTA CTCATTTTGA AAAAGAATCT GGCCCATTTA TCACCTCATC AATTGCATAT GCCAAAAACC CTGAAACTGG AAAACAAAAT TCATCTTTTC ATCGAATGAT GCCAATTGAT AAAACTCATT TTTCAATACG AATGGTTGAA GGACGTCATT TACATCGATG TTTTATTGAT GCTAAGGAAC ATGGCGAGGA TCTCAAGATT GCAATAACTG TTGGTGTTCA TCCTGCAATT TCTATTGCCG GAGCATATCA AGCAGAGTGG GGAAAAGACG AGATTGATAT TGCAAATTCT CTGTTGGGTG GAAAACTGAC TTTAACAAAA CTCCCATTTA CTGGACTAAA TATTCCATCA GGTTCAGAAA TCGTTATGGA AGGAAGAGTT CTTCAGGACA AAACACATCC TGAATGGATG GTCGAAATGC TTCAAACATA TGACCATGAA AGATCTCAAC CAGTTTTTGA ACTTGAAAAT ATGTATTTTA GAAATAATCC AATATTTCAT GATGTTTTGT CTGGTTATTC AGAACATAGA TTATTGATGG GTATGCCAAT TGAATCAAAA TTAAATGGTG ATTTGAAAAA AGCATTCAAG CAAACACAAC AAGTATCCAT GACAAATGGT GGATGTAATT GGCTACATGC AGTTGTACAA ATAAAAAAGA AACACGATTC CGATGCAAAA AAAATTATCA AAAAAACATT CGAATCTCAT CGTTCGCTAA AACAAGTAAC AGTAGTTGAT GAGGATATTG ATCCTAACAG TGCTGAGGCA GTAGAATATG CTATGGCCAC AAGATTCCAG GCAGACAAAG ATCTTATAAT TCTAAAAAAC GTGCGTGGCT CTAGCCTTGA TCCATCAAGT AATCAAAAGA AGTTACAGAC TGCAAAAATG GGTATTGATG CAACTAGATC ACTTTCAAAA CGTCCTGAAG GATTTGAATT GGCAAAAATC CCAAAAATCG ATAAAATTAA ACTCGAAAAA TATTTCAAAT AA
|
Protein sequence | MSDLRNYISK IKKNKELKTV KTKVSTKYEI AGITAKVDGS HAVLFENIKE SDFHLVANLV GTRKRFALAV GGTEYNIHEK VISAIKKAKA PKIISSGKFQ ENSSKNLLSM PIVTHFEKES GPFITSSIAY AKNPETGKQN SSFHRMMPID KTHFSIRMVE GRHLHRCFID AKEHGEDLKI AITVGVHPAI SIAGAYQAEW GKDEIDIANS LLGGKLTLTK LPFTGLNIPS GSEIVMEGRV LQDKTHPEWM VEMLQTYDHE RSQPVFELEN MYFRNNPIFH DVLSGYSEHR LLMGMPIESK LNGDLKKAFK QTQQVSMTNG GCNWLHAVVQ IKKKHDSDAK KIIKKTFESH RSLKQVTVVD EDIDPNSAEA VEYAMATRFQ ADKDLIILKN VRGSSLDPSS NQKKLQTAKM GIDATRSLSK RPEGFELAKI PKIDKIKLEK YFK
|
| |