Gene Nmar_0554 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNmar_0554 
Symbol 
ID5772944 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNitrosopumilus maritimus SCM1 
KingdomArchaea 
Replicon accessionNC_010085 
Strand
Start bp493936 
End bp495054 
Gene Length1119 bp 
Protein Length372 aa 
Translation table11 
GC content35% 
IMG OID641316187 
Productradical SAM domain-containing protein 
Protein accessionYP_001581888 
Protein GI161528062 
COG category[H] Coenzyme transport and metabolism
[R] General function prediction only 
COG ID[COG1060] Thiamine biosynthesis enzyme ThiH and related uncharacterized enzymes 
TIGRFAM ID[TIGR00423] radical SAM domain protein, CofH subfamily 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones29 
Fosmid unclonability p-value0.420102 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TTGAGTCAGA CAACAGAACA ATTAGAAAAA AGTGACATTA AAGATATTTT AGAAAATTCT 
CTTAATGGAA AAAGACCTGG TCCTGAAGAC TGTATGAGAT TGTTGGAGTC TGATGATGTT
CATCTAATGG GACTTGTATC TGGCCATTTG ACAAGAAAGC AATTTGGAAA GAAAGCATCT
TTTGTCAATA ATATTATTTT GAATTACACC AATGTCTGTA TTACTGACTG TAAGTTTTGT
GCATTTTACA GATCACCTGG TGCTGATGAT TCTTACACTT TAACTTTGGA ACAAATTGAA
TCACGTGTAA AAACCGCATG GGACATGTTT AAGATCCGAC AGGTCTTGAT TCAAGGTGGT
CATAACCCAA ATCTGAAAAT TGAATACTAT GAAGATGCAT TTAGAATGAT TAGGGAGAAA
TTCCCTAAAG TTGGTGTACA TGGATTGTCA ACATCAGAAA TTGACATGAT TGCAAGAGTT
GAAAAATCCT CAACAAAAGA AATTTTATCA CGACTCAAAG ACGCAGGTTT ACAATCAATG
CCTGGTGCAG GAGCTGAAAT CTTGACTGAC TCTGTTAAAG AAATCATTAG TCCAAAGAAA
ATCTCTAGTG ATGCTTGGAT TAGAATCATG AATGAAGCTC ATTCACTTGG AATTCCATCT
TCTGCAACAA TGATGTACGG ACATGTGGAA AACAAAAATG ACATTGTTGA ACACTTTTTC
AAACTTGTAA AATTACAAGA AAAAACCAAA GGATTCATGG CATTTATCCC TTGGAACTTT
GAGCCAAACA ATACTTTGAT GCATGAAGAG GGATTAGTTG AATATGGTAC TGGTGGAATT
CAACTCTTGA AAATGATTGC AATCTCTAGA TTAATCTTTG ATGGACTTAT ACCTCACATA
CAATCCTCAT GGCTGACAAA TGGTATCGGT ATGGCACAAC TAGCTTTACA GTATGGCGCT
GATGACTTTG GTGGTACTCT AATTGGAGAA GAAGTAGTTT CATGTACTGG CGCACGCTCA
ACTGAACTTA CTGATAAAAT AATCATGGAT GCAATTCATC AAATTGGTTA TTCAGTTGAA
GAGAGAGATA ATTTCTATAA TCCTATTTCT GTATCATAG
 
Protein sequence
MSQTTEQLEK SDIKDILENS LNGKRPGPED CMRLLESDDV HLMGLVSGHL TRKQFGKKAS 
FVNNIILNYT NVCITDCKFC AFYRSPGADD SYTLTLEQIE SRVKTAWDMF KIRQVLIQGG
HNPNLKIEYY EDAFRMIREK FPKVGVHGLS TSEIDMIARV EKSSTKEILS RLKDAGLQSM
PGAGAEILTD SVKEIISPKK ISSDAWIRIM NEAHSLGIPS SATMMYGHVE NKNDIVEHFF
KLVKLQEKTK GFMAFIPWNF EPNNTLMHEE GLVEYGTGGI QLLKMIAISR LIFDGLIPHI
QSSWLTNGIG MAQLALQYGA DDFGGTLIGE EVVSCTGARS TELTDKIIMD AIHQIGYSVE
ERDNFYNPIS VS