Gene Nmar_0371 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNmar_0371 
Symbol 
ID5773357 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNitrosopumilus maritimus SCM1 
KingdomArchaea 
Replicon accessionNC_010085 
Strand
Start bp337790 
End bp339022 
Gene Length1233 bp 
Protein Length410 aa 
Translation table11 
GC content37% 
IMG OID641316000 
Productglucose sorbosone dehydrogenase 
Protein accessionYP_001581705 
Protein GI161527879 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2133] Glucose/sorbosone dehydrogenases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones40 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTATGATA AGATAGATAA GCACAGTATG AGAAACATCA TCTTTGTTGT ATCTATAATG 
ATTTTACTTG GAGGTACAAG TGCATATGCA GAATCTTTTC CAGAGATTGG GGTAAAAGTT
GATGTGATTG CAGACAACCT CAAAATTCCA TGGGGAATTG ACTTTGCACA AGATGGACGA
ATTTTCTTTA CAGAAAGACC AGGTACAGTA AATGTGATTG AAGATGGACA AGTAAGCCAG
ATCATGTCTC GAGGTGTGGG AGGAGGAGAA GGCGGAATGC TAGGAATTGC ACTAGATCCA
GAATTTGAGA AAAATCACTA CGTCTATGTG TACTATACGT ATAATGAACT ACTTGGAATC
AAGAACAGAT TAGTACAATA TGTTGAATCA GACAACAAAC TAAATCATGA GAAAATTTTG
CTTGAAGATA TTCCTGGTGC ACCGTATCAC GATGGTGGTC GAATAAAATT CGGACCAGAT
GAAATGTTGT ATGTTACAAC AGGGGATGCA GTAGAACCAG AACTTTCACA AAACTTGAAT
TCAGTTGCAG GAAAAATCTT GAGAATCAAG TCAGATGGAA CAATTCCTGA AGATAATCCG
TTTGGTTCAG CAATCTACTC CATTGGACAT CGTAATCCGC AAGGAATTGC ATGGGACAAG
TCTGGAAATT TAATTGCAAC AGAACATGGA CCTTCTGGAT GGCGTGGAGT TGCACATGAT
GAAATCAATT GGATAGTATC AGGTGCAAAC TATGGATGGC CAGATGTTAT TGGTGATGAA
ACATTAGAAG GTGCAACAAA TCCAATTTTG CATTCAGGTG ATGATACTTG GGCTCCTTCA
GGTTCTACAT TTTACTATGG AGACGACATG CCAATGTTTG ATGGAAAATA TTTTGTTGCA
GCACTTAAAG GACAACATAT TCACGTCATA GAATTTGATG AGAGTTACAA TGTGTTATTT
CACGGAGAAT TATTTTCAGG AGAGTTTGGA AGAATTAGAG ATGTTGCAAA TGGTCCGGAT
GGATTATACT TTATGACAAG TAATCAAGAT GGAAGAGGCA ATCCAAATCT CTACGATGAT
AAAATTTTGA GAATTTCTCC ATTGTATAAC TATGAAAACA ATTCATGGGT ACAAAACATC
TCAGAATGGT ACATGAAAGG AGAAATTTCA AAGGAAGAAT CAATTAATGC TCATTCATAT
CTAATTGAAA GAGGAACAAT TTCTCAAAAT TAA
 
Protein sequence
MYDKIDKHSM RNIIFVVSIM ILLGGTSAYA ESFPEIGVKV DVIADNLKIP WGIDFAQDGR 
IFFTERPGTV NVIEDGQVSQ IMSRGVGGGE GGMLGIALDP EFEKNHYVYV YYTYNELLGI
KNRLVQYVES DNKLNHEKIL LEDIPGAPYH DGGRIKFGPD EMLYVTTGDA VEPELSQNLN
SVAGKILRIK SDGTIPEDNP FGSAIYSIGH RNPQGIAWDK SGNLIATEHG PSGWRGVAHD
EINWIVSGAN YGWPDVIGDE TLEGATNPIL HSGDDTWAPS GSTFYYGDDM PMFDGKYFVA
ALKGQHIHVI EFDESYNVLF HGELFSGEFG RIRDVANGPD GLYFMTSNQD GRGNPNLYDD
KILRISPLYN YENNSWVQNI SEWYMKGEIS KEESINAHSY LIERGTISQN