Gene Nmar_0107 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNmar_0107 
Symbol 
ID5774470 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNitrosopumilus maritimus SCM1 
KingdomArchaea 
Replicon accessionNC_010085 
Strand
Start bp97656 
End bp98852 
Gene Length1197 bp 
Protein Length398 aa 
Translation table11 
GC content33% 
IMG OID641315727 
Productpeptidase M50 
Protein accessionYP_001581445 
Protein GI161527619 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0265] Trypsin-like serine proteases, typically periplasmic, contain C-terminal PDZ domain 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value0.0000498394 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
TTGGAACTTG ATTTTATCAC TCAAAATTCG ATTATCTATG TGTTGATGGC ATGGGTAGTA 
ATTGTTATTG TTGCAAAAGG ACTAAAATTA GAGAAACATG GTTTTGAAAT AAAAGCATAC
AGTCTAACTT ACAAAAACAA ACAAGTTAAC TCAGTACTGT TAAAACTTCT TAGTCGGACA
AGAAGAGGAA TTAGAGTTTT TGCAGATGTA AGTGTAATTT CAGGTTTTAT AATGATGGGT
TTTGCATTTT GGTTTTTGCT AAATAATGTT GCAAACTTTT TTGTTGCACA AACAGAATTT
TCAGAATTAA CTGTTCTTAT TCCAGGAGTT ACATTAACTT CAGCAGCATC AATTACATAT
TTTTTACTAT CAATTCCAGT TGTACTAGTA ATTCATGAGG GAGCACACGG TATTGTTGCA
GCATTAGAAA AGATAAAGAT CAAAACAGGA GGATTTGCAA TATTCATTGC AATGTTTGCA
GGCTTTGTAG AACCTGATGA GGAAGAATTC AACAAAGCAA AAAAGATCTC AAAACTCAGA
GTTATTGGAG CAGGAGCTAC ATCAAATGTA ATTTTTGCAT TTGCTTTAGG AGTAATTTTA
CTTACAAATC CATTTTTTGC AATGGTATTA CCTGAACCAC TATTAAGTAC ATTTTACGAA
TTACCAGAAG GAGTCCTAAT TCTTTCAATT ATTGAAAATT CAGGAGCAGA GCAAGCAGGA
TTACTTGCAA ATGACATCAT AACATCAATC AATGACAAGT CCATTCTAAG TCCGGCGGAT
TTCCCCAGTT TAAATCCAGG AGAGACAGCA AGTGTCTCTG TACTTAGAGA TGGACAACCA
TTGGACTTTA GTCTTGAAGT AATGCCAGCA CCAGATGATC CAGAAAGAGG ATTGATTGGA
ATTATGAGAG ATAATTCATT TGCATACAAG CCTATATTGA ATTTTATTGA ATGGAATGAT
CCAAACGTCT CAATGTTCCT CTTATGGTTA TGGATGATTT CATTTTTCAT TGGAATAATC
AATATGCTTC CATTACCAAT TTTAGATGGA GGTAAATTCA TTCATACGAT TATTGATCAA
AGAATTTCAG AAAAAGCAGT AAATGGAGTA ATGTGGGGAA TCTATGCGTT TACTTTTGCT
TTGTTTGGCC TAAACATTGC CCTCTCATAT GTAAAATCTG GTTGGTTTAC AATATAA
 
Protein sequence
MELDFITQNS IIYVLMAWVV IVIVAKGLKL EKHGFEIKAY SLTYKNKQVN SVLLKLLSRT 
RRGIRVFADV SVISGFIMMG FAFWFLLNNV ANFFVAQTEF SELTVLIPGV TLTSAASITY
FLLSIPVVLV IHEGAHGIVA ALEKIKIKTG GFAIFIAMFA GFVEPDEEEF NKAKKISKLR
VIGAGATSNV IFAFALGVIL LTNPFFAMVL PEPLLSTFYE LPEGVLILSI IENSGAEQAG
LLANDIITSI NDKSILSPAD FPSLNPGETA SVSVLRDGQP LDFSLEVMPA PDDPERGLIG
IMRDNSFAYK PILNFIEWND PNVSMFLLWL WMISFFIGII NMLPLPILDG GKFIHTIIDQ
RISEKAVNGV MWGIYAFTFA LFGLNIALSY VKSGWFTI