Gene Nmar_0975 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNmar_0975 
Symbol 
ID5774771 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNitrosopumilus maritimus SCM1 
KingdomArchaea 
Replicon accessionNC_010085 
Strand
Start bp851328 
End bp852485 
Gene Length1158 bp 
Protein Length385 aa 
Translation table11 
GC content36% 
IMG OID641316614 
Producthypothetical protein 
Protein accessionYP_001582309 
Protein GI161528483 
COG category[S] Function unknown 
COG ID[COG1602] Uncharacterized conserved protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones25 
Fosmid unclonability p-value0.109067 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCTTCTG ACTCTCAGGA TATTCGCCGT TCAATTTTAA CAAAATGGCA TGAGACATTA 
TCAAAATATG GAAATTTGTT TTCATCTGAT TCAATAAGTG GTACTAGTCC TCCATCTGTA
TTTGTTGGGT CGTACAATTA TCCTAAGGTC TTTGTTGGTC CAATGGTTCC ACCAATTCAT
GGAGATACAA GTTTACTTGA CAGTCCTGAA AAATGGAAGG GAAAGTCTTT AGAAGAAATT
GTAAACTTTA GATTGAATTT AGTTCGTGGC ACACAAAAAC TATCTATCGA TAAAACTGAT
GGACGATACA TTGAAAATCT CCAAGAAGTA ACAATGTCTT CAAAACCAAC TGATTCTGAT
TTAATATTTC AAAAATCTGT ATCTTCAAAC ATTTCCCTTG ATGGAGAAAG TGCTCCATTT
GGTCCTGTTG GGGAAATCAA ATCTGCAAAA TTCTCTGGAA CCTCTTCTGT AAAGTCTATT
GAAAAGACAT ACTATGATAA AGATTTGAAG GCACAGGATG CTGTCATGAA CTTATACAAT
TCTGGAATTG ATATTTCAAA AATTCAAAAA TGCTTTAGCA TTGGAATGCT TGGCCAAAAA
AGAAAACTCG TTCCAACAAA ATGGAGTATT ACTGCAACTG ATGACATTAT ATCACAATCT
CTTGCTGACG AAGTATTAGA TTATGCCCTA ATTGACTCTT GTAAGGTCTT CTCATATTCT
CATTTGGGAA ATCATTTCTC TGTGGTTTTG TTCCCTCATA GATGGATATA CGAAATGGTT
GAGGCATGGT ATTCTAATGG AATTCTAGGG TTTGGCTCTG ATTTTGAGGA TGCCCGGGGT
ATTGACCATC CTCCTGCCAT AGCTGGTGCG TATTTTGCTG CCAAATTAGG TGTTTTAGAG
TATCTCAGTG CAAAAAAGAT TCAATCTGGA GCCGTAATTT TAAGAGAAAT CCGACCTGAA
TATGCAATAC CTGTAGGCGT CTGGCAGGTT CGTGAAGGAA TTAGAGAAGC AATGAAACAA
ACCCCAGTAA TTGCAAATAA TTTTGATCAT GCATTGAATT TGGCATCCGA GAAACTAAGC
ATTAGCAAGT CTGAATGGCT TGCACATGGA AATATCTCCA AACTAATGAG ACAAAAAACT
TTGTCAGACT TTTTCTGA
 
Protein sequence
MSSDSQDIRR SILTKWHETL SKYGNLFSSD SISGTSPPSV FVGSYNYPKV FVGPMVPPIH 
GDTSLLDSPE KWKGKSLEEI VNFRLNLVRG TQKLSIDKTD GRYIENLQEV TMSSKPTDSD
LIFQKSVSSN ISLDGESAPF GPVGEIKSAK FSGTSSVKSI EKTYYDKDLK AQDAVMNLYN
SGIDISKIQK CFSIGMLGQK RKLVPTKWSI TATDDIISQS LADEVLDYAL IDSCKVFSYS
HLGNHFSVVL FPHRWIYEMV EAWYSNGILG FGSDFEDARG IDHPPAIAGA YFAAKLGVLE
YLSAKKIQSG AVILREIRPE YAIPVGVWQV REGIREAMKQ TPVIANNFDH ALNLASEKLS
ISKSEWLAHG NISKLMRQKT LSDFF