Gene Nmar_0497 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNmar_0497 
Symbol 
ID5774750 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNitrosopumilus maritimus SCM1 
KingdomArchaea 
Replicon accessionNC_010085 
Strand
Start bp447804 
End bp449048 
Gene Length1245 bp 
Protein Length414 aa 
Translation table11 
GC content35% 
IMG OID641316129 
ProductSufS subfamily cysteine desulfurase 
Protein accessionYP_001581831 
Protein GI161528005 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0520] Selenocysteine lyase 
TIGRFAM ID[TIGR01979] cysteine desulfurases, SufS subfamily 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones47 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TTGCAAAGTA CTCAATCTTC ATTTGAAAAC TTAAGAAAAG ATTTTCCAAT CCTTGAACGA 
ACTGTTAGAG ATAACAAACA TCTAGTTTAT CTTGATAATG CATCTACTAC ACAAAAACCA
AATCAAGTAA TTGATGCTAT TACTGATTAC TATCAAAATC ATAATGCAAA TATTCACAGA
GCAGTTTATG CTCTAGCTGA AGAAGCAACA GAAGCTTATG AGAAAACAAG AGACAAAATT
GCAAATTTTG TTAACATTAA AGACAGACAA GAAATTATCT TTGTTAGAGG AACCACTGAA
GCAATTAATC TAGTTGCATA TGCATGGGGT AGACCTCACA TTAGTGAAGG TGATATTATT
GTCACTACTG AATATGAACA CCACAGTAAC ATTGTTCCAT GGCAACTTCT TACACAAGAA
AAACGTGCCA AACTAGAATA CATTGGAATG GATGACAATG GTGAATTGAT TCTAGATGAT
CTTGATAAAC ATCTTGCAAC AGGTAAAGTG AAACTTGTTA CATTTAGTCT AATGTCAAAT
GTACTTGGTA CAATTACTGA TGCCAAAAAA ATTATTGAAA AATGTAAAGC TGCAGGCGTT
CCTACTTTAG TTGATGGTGC TCAGGCAGTT CCTCACATGA AAGTTGATCT TGAAGATTTA
GACTGTGATT TCTTTGCATT TTCAGGCCAC AAAATGTTAG GTCCAACTGG AATTGGTGTT
CTATGGGTCA GAAAATCAGT TCTCAATACA ATGAGCCCCT TCCATGGTGG TGGTGACATG
ATTCGTGAAG TTCACAAGTA TGAGACTACT TGGAATGATT TGCCTTACAA ATTTGAAGCA
GGAACTCCTA ACATTGCAGA TGTTGTTGGA TTTGGAGCTG CAATTGATTA TTTGACTAAA
ATTGGAATGG ACAATATTCG TGAACATGAA ATTGAACTAA CAAAATACGC TATGGAAAAG
CTTTCTAATA TTAAAGGACT ACAAATCTAT GGAACTAAAG ATGTCTCAAA ACGTGGTGGC
GTCATCTCAT TTAATTTTGC AGATGTGCAT CCTCATGATG TAGCACAAAT TATTGATGAA
GAGGGAATTG CATTGCGTTC TGGTCATCAC TGTGCACAGG TATTGATGGA GAGACTAAAT
GTAGCTGCTA CTTCAAGAGC AAGTTTCTAC ATTTACAATA CTAAAGAAGA CATTGATGTA
CTAGTTAATT CATTAAATAC TGTGGCAAAG GTGTTCAAGT TATGA
 
Protein sequence
MQSTQSSFEN LRKDFPILER TVRDNKHLVY LDNASTTQKP NQVIDAITDY YQNHNANIHR 
AVYALAEEAT EAYEKTRDKI ANFVNIKDRQ EIIFVRGTTE AINLVAYAWG RPHISEGDII
VTTEYEHHSN IVPWQLLTQE KRAKLEYIGM DDNGELILDD LDKHLATGKV KLVTFSLMSN
VLGTITDAKK IIEKCKAAGV PTLVDGAQAV PHMKVDLEDL DCDFFAFSGH KMLGPTGIGV
LWVRKSVLNT MSPFHGGGDM IREVHKYETT WNDLPYKFEA GTPNIADVVG FGAAIDYLTK
IGMDNIREHE IELTKYAMEK LSNIKGLQIY GTKDVSKRGG VISFNFADVH PHDVAQIIDE
EGIALRSGHH CAQVLMERLN VAATSRASFY IYNTKEDIDV LVNSLNTVAK VFKL