Gene Nmar_0742 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNmar_0742 
Symbol 
ID5773024 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNitrosopumilus maritimus SCM1 
KingdomArchaea 
Replicon accessionNC_010085 
Strand
Start bp671572 
End bp672669 
Gene Length1098 bp 
Protein Length365 aa 
Translation table11 
GC content34% 
IMG OID641316379 
Productradical SAM domain-containing protein 
Protein accessionYP_001582076 
Protein GI161528250 
COG category[H] Coenzyme transport and metabolism
[R] General function prediction only 
COG ID[COG1060] Thiamine biosynthesis enzyme ThiH and related uncharacterized enzymes 
TIGRFAM ID[TIGR00423] radical SAM domain protein, CofH subfamily 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones31 
Fosmid unclonability p-value0.547138 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTTGGAGC AATTAGTAGG TGACTCTAAG GCATTAGATC GAGCAATTGC AGGAGAAGAT 
TTGTCCTACA ATGACGGTAT GGATTTGATG AATTATGATA ATTTACATCT TCTTGGTGCA
GTTGCTGATG CTCGAAGAAA AGAGTTGGTT GGTGATACTG TAACTTTTGC AGCATCTTAT
TACATGAATT ACACAAATGT CTGTGCTGCA AGTTGTCAAA TGTGTGCCTT TTACCGAAAA
GATGGTGCTG AAGATGCATA TACGCTAACT CCTCAAGAAA TTGAACAAAG AGTTGGAATT
GCAAAACAAA TGGGAGCAAC TGAAGTACAC ATTGTTGGAG GATTTCATCC AAAACTTCCA
TTAGAATACT ATGAAGACAT GATGAGAATA ATCAAAAAAC ATCATCCTCA ACTAAACATT
AAAGCATTAA CTGCTGCAGA AATTTTCTAT TTATCAAAAC TGACAAAAAA TTCTACTAAA
GAAATTTTAT CTCGTTTAAA AGATGCTGGA CTTGATTCAA TGCCTGGTGG TGGTGCTGAA
CTATTTCATC CAGAAATTAG AAAACAAATT GTTCGAGGAA AATGTACTGG ACAAGAATGG
TTAGATGTAA TTGAAGAAGC ACATACCATG GGAATTCAAA GTAATGTAAC TATGCTTTAT
GGACATATAG AAAAACCTGA GCACATTGTT GATCATCTCA TTAAAATTCG TGACTTGCAA
AAAAAGACAA ATGGATTTAT CACTCTTATT CCTCTCAAAT TTAGCCTTGA TAATACTGAA
TTAGAACAAG AACATTTAGT AAATAATGAA TGTTCTTCTG TGTATGACTT GAGAGTGATT
GCATTGTCTC GTTTAATGCT TGCAAATTAT TTGAACAACA TTTCTGTTTA TTGGGTAGCA
TATGGTAAGA AACTTGCTCA AGTAGCATTA TCTAATGGTG GTAGTGATCT TGTTGGTACT
GCCTTTTCTG AAGAAATTTA TCGCGCTGCA GGAAAAGCAA CTAACTCTTC AGTAGATGAA
CTCGCAACTA TGGTAAAAGA GATAGGACGT GTTCCTGCAC AACGAAATAC TCATTTTGGA
ATTTTAAAGA ATTTTTAA
 
Protein sequence
MLEQLVGDSK ALDRAIAGED LSYNDGMDLM NYDNLHLLGA VADARRKELV GDTVTFAASY 
YMNYTNVCAA SCQMCAFYRK DGAEDAYTLT PQEIEQRVGI AKQMGATEVH IVGGFHPKLP
LEYYEDMMRI IKKHHPQLNI KALTAAEIFY LSKLTKNSTK EILSRLKDAG LDSMPGGGAE
LFHPEIRKQI VRGKCTGQEW LDVIEEAHTM GIQSNVTMLY GHIEKPEHIV DHLIKIRDLQ
KKTNGFITLI PLKFSLDNTE LEQEHLVNNE CSSVYDLRVI ALSRLMLANY LNNISVYWVA
YGKKLAQVAL SNGGSDLVGT AFSEEIYRAA GKATNSSVDE LATMVKEIGR VPAQRNTHFG
ILKNF