Gene Nmar_1007 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNmar_1007 
Symbol 
ID5773093 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNitrosopumilus maritimus SCM1 
KingdomArchaea 
Replicon accessionNC_010085 
Strand
Start bp881616 
End bp882884 
Gene Length1269 bp 
Protein Length422 aa 
Translation table11 
GC content35% 
IMG OID641316646 
Producthypothetical protein 
Protein accessionYP_001582341 
Protein GI161528515 
COG category[R] General function prediction only 
COG ID[COG1253] Hemolysins and related proteins containing CBS domains 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones36 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TTGAGCTTAG AATATCAACT AGCTGCACTT GCAGGTCTAA TTGGTCTTTC TGGTTTTTTC 
AGTGGTCTTG AAGTTGCACT TGTGGGTACA AGTCAAGCCA CAATTGAGAG ACTCGTCAAA
GATAATGTAA AGGGCGCAAA ATCTCTTCAG AAATTAAAGG CCAATCCTGG ATGGATGATG
TCTAGTGTCA ATCTTGGAAA TAATCTTGTC AATATAGGCT CTGCATCCCT TGCTACTATA
GTTGCAATTG AAATTTTTGG AGATAACGGA GTGGGAATTG CAGTTGGTAT TATGACTTTT
CTTGTAATAA TATTTGGTGA AGTAACTCCA AAAACTTATT GTAATGCAAA TGCCACAAAA
GTTGCTTTAC GATGCAGTAG AATTTTGTTA ACATTCAGTT ATGTTTTCTA TCCTGCAGTT
TGGATTCTTG AAAAGATAAC TCGTGGAATT ATCAAAATAA CTGGAAGTGA TTATCAACCT
CCTGCCCTAA CTGAGGATGA AATTAAAGGA ATTATTGCTC AGGGTCATAG AGATGAAGCT
TTAGAAAAAT CTGAACGAGA TTTACTTTAC GGTGCTCTCA AATTTGATGA TACTGTGATA
AGATCTGTAA TGATGCCAAG AACTAGAATG TTTAGTTTGC ATGGAGATAT GGAACTAATT
ACAGCTGCAG ATAAGATTCA CAAGAGTGGT CATTCTAGAA TCCCCATATA TGGAAAGGAT
CATGATGACA TACTTGGTAT TCTTCATGTA AGAGATATTC TCAAACATCT AAAAGATAAA
GAACTGCAAA AAATGAAACT ACGAGAATTT GTAAGAGAAC CAATCTATGT GTCTCAGGAA
AAACGAATGA GCGAACTTCT CAAACAAATG CAGGCAAAAA ATACCCATAT GGCCATAGTT
GTTGATGAAT TTGGTGGCGT TGAAGGCCTT GTTACTCTAG AAGATCTTAT TGAAGAGATA
GTTGGCGAAA TTCATGATGA GACTGATCTA AAGAGTCCTC ATTATCAAAA AATCAATAAT
GATGTAATTC TTGCAAATGG AGAAATTGAA ATAGACGAGA TTAATGAAAT CTTCAAATCC
AATCTTCCTA GAGGTGATGA TTATTCTACA TTAAATGGCT TGTTGCATGA GAAACTTCAT
GATATTCCTC AAGTTGGAAA TGTCATAAAC ATTGATGCAT TAGAAATCAA GGTTGAAAAG
GTTTCAAAAA ACAAACCTGT TTCCTTACGA ATTACTAAGA AAAAACCTCT TGAGGAGAAT
CTAGATTGA
 
Protein sequence
MSLEYQLAAL AGLIGLSGFF SGLEVALVGT SQATIERLVK DNVKGAKSLQ KLKANPGWMM 
SSVNLGNNLV NIGSASLATI VAIEIFGDNG VGIAVGIMTF LVIIFGEVTP KTYCNANATK
VALRCSRILL TFSYVFYPAV WILEKITRGI IKITGSDYQP PALTEDEIKG IIAQGHRDEA
LEKSERDLLY GALKFDDTVI RSVMMPRTRM FSLHGDMELI TAADKIHKSG HSRIPIYGKD
HDDILGILHV RDILKHLKDK ELQKMKLREF VREPIYVSQE KRMSELLKQM QAKNTHMAIV
VDEFGGVEGL VTLEDLIEEI VGEIHDETDL KSPHYQKINN DVILANGEIE IDEINEIFKS
NLPRGDDYST LNGLLHEKLH DIPQVGNVIN IDALEIKVEK VSKNKPVSLR ITKKKPLEEN
LD