Gene Nmar_1285 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNmar_1285 
Symbol 
ID5774054 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNitrosopumilus maritimus SCM1 
KingdomArchaea 
Replicon accessionNC_010085 
Strand
Start bp1176884 
End bp1177909 
Gene Length1026 bp 
Protein Length341 aa 
Translation table11 
GC content37% 
IMG OID641316929 
Productaliphatic sulfonate ABC transporter periplasmic ligand-binding protein 
Protein accessionYP_001582619 
Protein GI161528793 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG0715] ABC-type nitrate/sulfonate/bicarbonate transport systems, periplasmic components 
TIGRFAM ID[TIGR01728] ABC transporter, substrate-binding protein, aliphatic sulfonates family 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones22 
Fosmid unclonability p-value0.0376066 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAAATTC GTTCGGTAAT TGCTGCAGGA ATTGGGGCAA TTATCGTATT TTCTGCACTT 
GGAATTGCTC TTAGCTCTAG TGATACTACC TATGAAAATA AAATTCGGAT TGCTTACTTT
CCAAACATTG GCCATGCCAT TCCAATTGTA GGGATGGAAA AAGGATTCTT TGCAGAGCAT
CTTGGTGATG ATGTAAAGAT TGAAACCAAA GTTTTTGATA GCGGACCTCA AGCAATAGAA
TCTCTATTTG CAAACTCTAT TGACATTGCA TATGTCGGTC CTGGACCTGC AATTAATGGA
TTTTTGAATT CTAATAATCA AAATGTAAAA ATTCTTGCTG GCGCTGCAAG CGGTGGTGCA
AGTTTCATTG TACATCCTGA TTCTGAAATA AACACTGCAG ATGACTTTGC AGGAAAAAAG
ATTGCTGCCC CTCAAATTGG AAACACACAA GATGTTTCAC TGCGTCATTT TTTGGCTGAA
AACCAACTAA AGCCAGCTGA GAAAGGTGGA AACGTTGTTG TATATAATAT TCCAAACCCT
GACATCTATA CTTTGTTTGT AAAAGGTGAC ATTGATGGTG CATGGGTTGC AGAACCTTGG
GCAACAATTT TAGAAACCGA ACTTGATGGA AAAAGATTAT TCCATGAAGA AGAACTTTGG
CCTGACAAAG AGTTTGCATC TGTTCTCTTA ATTGGAAATG TAGATTACAT TGATAAAAAC
AGTGTAGTAT GGGCTGACTA TATTCGTGCA CATCATGAAA CGCAAATTTG GATTGAATCA
AATCCTATAG AAACTAGAAA TGTTTTCAAT GACTTTCTTG ATTCTTACTT GGGACAATCA
CTTTCTGATG ATGTTGTAGA TGTTGCACTA TCCAACATTA TGATAACTGC AGATCCAAAA
CCAAACTCTG TGGTCTCATT TGCTGAAAAA GCAGATACTT TGGGATATCT TGGAAGAAAT
GGATATGATT TGTCTGGAAT TTTTTACAGC TTTGATACAA ATTCTCTAGA GGAGGCCAGC
ACGTAA
 
Protein sequence
MKIRSVIAAG IGAIIVFSAL GIALSSSDTT YENKIRIAYF PNIGHAIPIV GMEKGFFAEH 
LGDDVKIETK VFDSGPQAIE SLFANSIDIA YVGPGPAING FLNSNNQNVK ILAGAASGGA
SFIVHPDSEI NTADDFAGKK IAAPQIGNTQ DVSLRHFLAE NQLKPAEKGG NVVVYNIPNP
DIYTLFVKGD IDGAWVAEPW ATILETELDG KRLFHEEELW PDKEFASVLL IGNVDYIDKN
SVVWADYIRA HHETQIWIES NPIETRNVFN DFLDSYLGQS LSDDVVDVAL SNIMITADPK
PNSVVSFAEK ADTLGYLGRN GYDLSGIFYS FDTNSLEEAS T