Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Nmar_1358 |
Symbol | |
ID | 5773805 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nitrosopumilus maritimus SCM1 |
Kingdom | Archaea |
Replicon accession | NC_010085 |
Strand | - |
Start bp | 1244511 |
End bp | 1245950 |
Gene Length | 1440 bp |
Protein Length | 479 aa |
Translation table | 11 |
GC content | 39% |
IMG OID | 641317003 |
Product | beta-lactamase domain-containing protein |
Protein accession | YP_001582692 |
Protein GI | 161528866 |
COG category | [C] Energy production and conversion |
COG ID | [COG0426] Uncharacterized flavoproteins |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | 28 |
Fosmid unclonability p-value | 0.345311 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCAAGATA CTGCTGTGGC AAAAAACAAT AAAAGATATG ACAAATTCCA AAAATACATG AAAAATAATT TTTTGGTATT TAGCTTTTCA CTATTTAGTT TGGTTGCTGT ATTGGCGGTT TCTCTTGTGT CGTATTCGTA TGCAGATGAT GTCAGGATAC ATGAAAAACA TATTCCATAC AACAATGTAT GTGCACCAGG ATTTGCATCT CTTGGGGACC TCTGTGTACT AAATGACAGA TGCGGTCCAA GTGTCTATGC TGGAAAGGTT TGTGTGATGG ATGGTGTCAA ACAACCATAC CTTAGACCCA CTCAGCAAGG AAATGCTGGA ATTGCAGCTT CTGATGTAAT ATGTGCAGAA GGATTGAATC TGATCTTCAA GTCTCGTGAC GCCTCCCCTG CATGTGTCAC TCCTGATTCT GCAGATAAAC TTGAACAACG CGGATGGCAA ACACAGTATC CTGTAATTGC ATGTACACTA GAATATGCCC CCGTATGTGG AATTGACGGA AAGACATATG GCAACAAATG TGCCATAGAC TCAAGTCACG TTGCAGTCAA ACATGCAGGA GAATGTTCTG ATGTAATTCC AACGGAATTT GAATTAGATG AAACTGTGTT ACAACACACT CTGGTTCCTA TTCAAACTGA TCCTGATAAG GGATATGCTG TTCTAGAAAT TGCTGATGGT GTATACTGGC TAGTTGGAAG TGGCTACCAG ACAATGTTTC TAACTACCGG ACAAGGAGTT GTTGCAGTTG ATGCACCGCA ACCTATAGGC GAAAAGTATC TTAGTGCAAT AGATGAGGTA ACTGATGAAC CCATTACTCA TATGATCTAC TCGCATCATC ATCAAGACCA CACTGGTGCA GCTGGAGAAA TATTTTCCCA AGATATTACA TACATATCAC ACAAAGATGC TGCAGATGAA TTAAAATCTG AAAATAATCC TGACAGGCCT ATTCCAACTC AAATCCTAGA AGGAGACTTT AACACTCTTG AAATTGGCAA CAAAACTATC GAGTTTTACA ATCTTGGTGA CTTTCATTCC AAAGGTAATT TGTTAATTCT ACTTCCTAAC TATAAGGTTG CAATGCTAGT TGACCTTTTG CGTCCTGCAG AGTCCCCATA TCGTGCATTT GGAGTTACTC CTGATATTGA TCTTTACTTG GATACACATG ACACATTACA AAATTATGAT TTTGAGGTTT TGATTTCAGG TCACACAAAT CTTCTTGCAA CAAAAGACCA CATCAAAACA AACAAACAGT TTACACAAAG CGTCATGGAT AACGCACAAC TGGGACTAGA TTCTGTTGAT TCTGAAGCAT TGGATGTTTG TACAACACTT ACCATTGAAC AATGGGAAGG CAAACTTGGA AATCTTGATG CATTCATGGA TGATCATTGC AATGCCATGA TTGAATACCT GACGCAATAA
|
Protein sequence | MQDTAVAKNN KRYDKFQKYM KNNFLVFSFS LFSLVAVLAV SLVSYSYADD VRIHEKHIPY NNVCAPGFAS LGDLCVLNDR CGPSVYAGKV CVMDGVKQPY LRPTQQGNAG IAASDVICAE GLNLIFKSRD ASPACVTPDS ADKLEQRGWQ TQYPVIACTL EYAPVCGIDG KTYGNKCAID SSHVAVKHAG ECSDVIPTEF ELDETVLQHT LVPIQTDPDK GYAVLEIADG VYWLVGSGYQ TMFLTTGQGV VAVDAPQPIG EKYLSAIDEV TDEPITHMIY SHHHQDHTGA AGEIFSQDIT YISHKDAADE LKSENNPDRP IPTQILEGDF NTLEIGNKTI EFYNLGDFHS KGNLLILLPN YKVAMLVDLL RPAESPYRAF GVTPDIDLYL DTHDTLQNYD FEVLISGHTN LLATKDHIKT NKQFTQSVMD NAQLGLDSVD SEALDVCTTL TIEQWEGKLG NLDAFMDDHC NAMIEYLTQ
|
| |