Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Nmar_1285 |
Symbol | |
ID | 5774054 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nitrosopumilus maritimus SCM1 |
Kingdom | Archaea |
Replicon accession | NC_010085 |
Strand | - |
Start bp | 1176884 |
End bp | 1177909 |
Gene Length | 1026 bp |
Protein Length | 341 aa |
Translation table | 11 |
GC content | 37% |
IMG OID | 641316929 |
Product | aliphatic sulfonate ABC transporter periplasmic ligand-binding protein |
Protein accession | YP_001582619 |
Protein GI | 161528793 |
COG category | [P] Inorganic ion transport and metabolism |
COG ID | [COG0715] ABC-type nitrate/sulfonate/bicarbonate transport systems, periplasmic components |
TIGRFAM ID | [TIGR01728] ABC transporter, substrate-binding protein, aliphatic sulfonates family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | 22 |
Fosmid unclonability p-value | 0.0376066 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAAATTC GTTCGGTAAT TGCTGCAGGA ATTGGGGCAA TTATCGTATT TTCTGCACTT GGAATTGCTC TTAGCTCTAG TGATACTACC TATGAAAATA AAATTCGGAT TGCTTACTTT CCAAACATTG GCCATGCCAT TCCAATTGTA GGGATGGAAA AAGGATTCTT TGCAGAGCAT CTTGGTGATG ATGTAAAGAT TGAAACCAAA GTTTTTGATA GCGGACCTCA AGCAATAGAA TCTCTATTTG CAAACTCTAT TGACATTGCA TATGTCGGTC CTGGACCTGC AATTAATGGA TTTTTGAATT CTAATAATCA AAATGTAAAA ATTCTTGCTG GCGCTGCAAG CGGTGGTGCA AGTTTCATTG TACATCCTGA TTCTGAAATA AACACTGCAG ATGACTTTGC AGGAAAAAAG ATTGCTGCCC CTCAAATTGG AAACACACAA GATGTTTCAC TGCGTCATTT TTTGGCTGAA AACCAACTAA AGCCAGCTGA GAAAGGTGGA AACGTTGTTG TATATAATAT TCCAAACCCT GACATCTATA CTTTGTTTGT AAAAGGTGAC ATTGATGGTG CATGGGTTGC AGAACCTTGG GCAACAATTT TAGAAACCGA ACTTGATGGA AAAAGATTAT TCCATGAAGA AGAACTTTGG CCTGACAAAG AGTTTGCATC TGTTCTCTTA ATTGGAAATG TAGATTACAT TGATAAAAAC AGTGTAGTAT GGGCTGACTA TATTCGTGCA CATCATGAAA CGCAAATTTG GATTGAATCA AATCCTATAG AAACTAGAAA TGTTTTCAAT GACTTTCTTG ATTCTTACTT GGGACAATCA CTTTCTGATG ATGTTGTAGA TGTTGCACTA TCCAACATTA TGATAACTGC AGATCCAAAA CCAAACTCTG TGGTCTCATT TGCTGAAAAA GCAGATACTT TGGGATATCT TGGAAGAAAT GGATATGATT TGTCTGGAAT TTTTTACAGC TTTGATACAA ATTCTCTAGA GGAGGCCAGC ACGTAA
|
Protein sequence | MKIRSVIAAG IGAIIVFSAL GIALSSSDTT YENKIRIAYF PNIGHAIPIV GMEKGFFAEH LGDDVKIETK VFDSGPQAIE SLFANSIDIA YVGPGPAING FLNSNNQNVK ILAGAASGGA SFIVHPDSEI NTADDFAGKK IAAPQIGNTQ DVSLRHFLAE NQLKPAEKGG NVVVYNIPNP DIYTLFVKGD IDGAWVAEPW ATILETELDG KRLFHEEELW PDKEFASVLL IGNVDYIDKN SVVWADYIRA HHETQIWIES NPIETRNVFN DFLDSYLGQS LSDDVVDVAL SNIMITADPK PNSVVSFAEK ADTLGYLGRN GYDLSGIFYS FDTNSLEEAS T
|
| |