Gene Nmar_0467 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNmar_0467 
Symbol 
ID5773367 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNitrosopumilus maritimus SCM1 
KingdomArchaea 
Replicon accessionNC_010085 
Strand
Start bp420220 
End bp421641 
Gene Length1422 bp 
Protein Length473 aa 
Translation table11 
GC content37% 
IMG OID641316099 
Producthypothetical protein 
Protein accessionYP_001581801 
Protein GI161527975 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value0.0035824 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGATGCCTT CAGCTTTTGC ACAATTTCAA TCTGGAGGTG TAGATATGCC TGGAACATGG 
TATGTTGGTG AAGGCCTCAA ACACGGCGAT TATTTTTCTT ACAATCTTTG TCATGTTGAC
TATAAGGAAT GTGCTGAATT TGGTCTAGAT ATGTGGATTA AAGGAGATAT TCAATCTGGC
AGTGAAACAA AGTGGTTAGC TGAAGTTGTT GTATATGATG GCAACAAAGT TGTTGTCGGT
GAAATGGAAT TAGGAAAGAT TGCTCCTGAA CCAACTGGTG GAAGTGAAGA ACTAGGTGTG
TATAGAGGTG CATTCAAATC TTCAGTTGCA TGGTTATCTG CATTTGCAAC ATCTGATTCT
GGAACTAGTG GAAAAGGACC AAAGGCATTT AGTGCAACTT CATGGGGAAA GATTGGAAAC
ATTGGTGGAG AACAAGTACT TCCCATGAAG ATTGAAACAA TTACTATTTC ATCTGGAACT
TGGGAAACTG TACAAATGGG ATGGCGTACT GGTGGACAAA CAAGCAAAGT TTGGATTGTA
GATGAATTTC CATTTCCTGT AAAAGCCCAT ACGTTAACTC ATGTTTCAGA GGGAATTCCT
CCAGCTGAAT ACAAATTTGA ATTACTAGAT TACAAAGAAA ACGTTTCAAC AAGCCCTTTT
TCAGGAATTG TATCTACTGT TGACACTTTT TCAGAACAAG GATGTGATAC TGATTTTGAA
CGAGATGTCA CTATAAAAAA ACCTACAAAC AACTTTGATT ATCAAATTCA TGTATTCTAT
GGACCTGAAG AACCAGTACA AGGTTGTGAG ATGCAATGGC TAATAAAATT TATCAGCAAA
TTTGACGATA CTGAATTTTT GAACCAAGTC CAATTTGATT TTCTAGTAGT AGATGATAAC
TTGACTCCAT TACGTTCAAT GGCTCAAGAT GAAGGAAGAC AGTATCTCTA CTCTCCATCT
GGACAATACA TTCTTGATAT GGTAGTCCAA GAACCACCTG GCAAAGTAAA CTATGTTATT
TGGGTTTATG GATTAGCTCC TGAAGGAATA GTACCTGGTT CTGCTGCAGA TTATTTACAA
ATTCCTGTAA CTGTTTTTGC CAGTGAAGGA AGTACTCCGG TAGTTTCTCC ACCTTTTGAA
ACAACATCTC AGGAAATACC TGAATGGATT AAGAATAATG CAGGCTGGTG GGCAGAAGGT
GCAATTGATG ATGGTTCATT TGTTCAAGGA ATTCAATTCT TAATCAAAGA AGGAATTATG
CAAATTCCTC CAACTACACA AGGAACTAGT TCTTCTAATG AAATTCCCTC TTGGATAAAG
CAAAATGCTG CATGGTGGGC AGAAGGTGCA ATTGATGATG GTTCATTTGT TCAAGGAATC
CAATTCTTAA TCAAAGAAGG AATAATGAGT ATCTCTTCTT AA
 
Protein sequence
MMPSAFAQFQ SGGVDMPGTW YVGEGLKHGD YFSYNLCHVD YKECAEFGLD MWIKGDIQSG 
SETKWLAEVV VYDGNKVVVG EMELGKIAPE PTGGSEELGV YRGAFKSSVA WLSAFATSDS
GTSGKGPKAF SATSWGKIGN IGGEQVLPMK IETITISSGT WETVQMGWRT GGQTSKVWIV
DEFPFPVKAH TLTHVSEGIP PAEYKFELLD YKENVSTSPF SGIVSTVDTF SEQGCDTDFE
RDVTIKKPTN NFDYQIHVFY GPEEPVQGCE MQWLIKFISK FDDTEFLNQV QFDFLVVDDN
LTPLRSMAQD EGRQYLYSPS GQYILDMVVQ EPPGKVNYVI WVYGLAPEGI VPGSAADYLQ
IPVTVFASEG STPVVSPPFE TTSQEIPEWI KNNAGWWAEG AIDDGSFVQG IQFLIKEGIM
QIPPTTQGTS SSNEIPSWIK QNAAWWAEGA IDDGSFVQGI QFLIKEGIMS ISS