Gene Nmar_0299 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNmar_0299 
Symbol 
ID5773456 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNitrosopumilus maritimus SCM1 
KingdomArchaea 
Replicon accessionNC_010085 
Strand
Start bp265235 
End bp266902 
Gene Length1668 bp 
Protein Length555 aa 
Translation table11 
GC content35% 
IMG OID641315925 
Productribulose-phosphate 3-epimerase 
Protein accessionYP_001581633 
Protein GI161527807 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG0036] Pentose-5-phosphate-3-epimerase
[COG3959] Transketolase, N-terminal subunit 
TIGRFAM ID[TIGR01163] ribulose-phosphate 3-epimerase 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones21 
Fosmid unclonability p-value0.0242597 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGGGCTCA ATTATTATCA AATTAAGAAA CATGTTCTGA GAGCTCGAAA ATTAGTTATC 
AAAGCCACAA ATACTGCTGG TTCAGGTCAT CCAGGCGGTT CATTTTCTAT GGCTGAAATT
TTAGGTTGTT TATTCAACAA ATATCTAAAA TTTGATCCTA AAAATCCACA ATGGGAAGAT
AGAGATCGTT TAGTGTTATC AAAAGGTCAT GCTGCCCCGG GTTTATTTTC AAATATGGCA
GTTGCAGGAT ATTTTCCAGA ATCAGAACTT GAAACTTTGA GAAAATTTGG AAGTAAACTA
CAAGGTCATC CTGATTTGAA ATGTCCTGGT GTTGAATTTT GTGGTGGCTC GTTAGGTACT
GGTTTATCCT ATTCTGTTGG AATTGCACTT GCAGGAAAAA TTGATTCTAA AGACTATCAT
GTCTATACAA TTATTGGAGA TGGAGAATCC GATGAAGGTC AGGTTTGGGA AGCTGCGATG
ACTGCAGCAA AATACAAAGT TGATAATCTT ACTGTTTTTC TTGACAGGAA TTTTATCCAA
CAAGATTCTT ACACTGAAAA AATTATGCCA CTTGACAAAA AGTTAGAAAC TGATGATCTT
TCTGAAATGT GGAAAGATGC TTCTAGATGG AAAACAGGTG ACAAATGGAG ATCTTTTGGT
TGGAATGTAA TTGAAATTGA TGGTCATCGA GTTGAACAGA TTGATGCAGC TATTACAAAA
GCAAATACGA TAAAAGGTGT CCCAACAATA ATTATCTCAA GAACAATTAA AGGAAAAGCT
GTTGAACATA TGGAAGATAA TCCTGCATGG CATGGAAAGG CACCTGATTC TGATGTTGTT
CCAATTATCA ACATGGAATT AGATTCTCAG TTTATGATTG CACCATCAAT TATTGCTGGT
GATATGTCAA ATCTTGAAAA TGAAGTTAAG AGATGTGTTT CTGGAAGAGC TGATTACATT
CATCTAGATG TCATGGATGG TCAATTTGTT CCAAACAAAA CATTTGATCA TATTAAAATC
AAAGAATTAC GTCCACTTAC CGTAATCCCA TTTGATTCTC ATTTGATGAT TAACGAACCT
GTAAAACATG TTAGAGATTA CATTGATGCA GGTAGTGATA TCATTACTGT ACATGCAGAA
GTAACTGATG AATCTAGTTT TGGAGAAATT CATGATGTAT TAAAACAAAA TCAAGTTGGT
ATTGGTTTTG CAATAAATCC TGATACGGAA TTACCTGAAT GGTCTTACAA ATTCATACCT
TCACTTGATC AGCTTATTGT GATGTCTGTA GTTCCCGGAA AATCTGGTCA GAAATACATT
GAGGAAACTC ATGCAAAAAT GGCTAGATTG AATACTATTC TAAACGAGCA TAATTTTTCC
GGATACATTG AGGCTGATGG CGGAGTAAAT CTTGAAAATA TAGGCTCGGT TTTTGCAGAT
GGAGCACGTG CTTTTGTTGG AGGAGGAGCA ATTATCGGAC AACAAGATGT TCGTGCTGCA
ATTAGAGACT TTAGAACTGA AGTATTGTCT TCTAGAAGAC AACTATTGCT TGACAAAGCA
AATGATCTAG GGGGCACTGA ATTAGTTAAC AAATGGATTG GATTGCATGT TGTTGGTGAA
AAACAAGAAC AAATTAAAAA AATTGCAGAG GAGAGAGGAT ACCTTTGA
 
Protein sequence
MGLNYYQIKK HVLRARKLVI KATNTAGSGH PGGSFSMAEI LGCLFNKYLK FDPKNPQWED 
RDRLVLSKGH AAPGLFSNMA VAGYFPESEL ETLRKFGSKL QGHPDLKCPG VEFCGGSLGT
GLSYSVGIAL AGKIDSKDYH VYTIIGDGES DEGQVWEAAM TAAKYKVDNL TVFLDRNFIQ
QDSYTEKIMP LDKKLETDDL SEMWKDASRW KTGDKWRSFG WNVIEIDGHR VEQIDAAITK
ANTIKGVPTI IISRTIKGKA VEHMEDNPAW HGKAPDSDVV PIINMELDSQ FMIAPSIIAG
DMSNLENEVK RCVSGRADYI HLDVMDGQFV PNKTFDHIKI KELRPLTVIP FDSHLMINEP
VKHVRDYIDA GSDIITVHAE VTDESSFGEI HDVLKQNQVG IGFAINPDTE LPEWSYKFIP
SLDQLIVMSV VPGKSGQKYI EETHAKMARL NTILNEHNFS GYIEADGGVN LENIGSVFAD
GARAFVGGGA IIGQQDVRAA IRDFRTEVLS SRRQLLLDKA NDLGGTELVN KWIGLHVVGE
KQEQIKKIAE ERGYL