Gene Nmar_1035 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNmar_1035 
Symbol 
ID5773297 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNitrosopumilus maritimus SCM1 
KingdomArchaea 
Replicon accessionNC_010085 
Strand
Start bp909195 
End bp910340 
Gene Length1146 bp 
Protein Length381 aa 
Translation table11 
GC content38% 
IMG OID641316677 
Producthypothetical protein 
Protein accessionYP_001582369 
Protein GI161528543 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1980] Archaeal fructose 1,6-bisphosphatase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones54 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGAAAATA TGAAAATTAC AGTTTCAGTT ATCAAAGCCG ATGTCGGCGG TGTCGGAGGA 
CATACAAAAC CTAGTGACGG ATTATTAGAC GCAATTAGAA ATACCGTTAA AAATTCAGCA
GATTTGCTTA TCGATTATTA CATTGGATAT TGTGGTGATG ACACCCATAT CGTAATGTCT
CACACTCATG GTGTAGACAA TCAACAAATT CACAAACTAG CATGGGATGC ATTCATGGCA
GGAACTCAAG TTGCAAAAGA AGAGGGATTG TATGGTGCAG GACAAGACTT GCTCAAAGAC
TCTTTCTCTG GAAACGTAAA AGGAATGGGT CCAGGAGTTG CAGAAATGGA ATTTGAAGAA
AGACCAAATG AAGCATTTAC AGTATTTGCA GCTGACAAAA CAGAACCAGG TGCATTCAAC
TATCCAATTT ACAGAATGTT TGTAGATGCA CTAAGTAACA CAGGATTAAT TGTAAACAAG
AATCTTGCAG ACGGGGTTAA AATTAATATC ATGGATGTTG AAAAGGCTCA GATTGCAGAG
TTGCAATTAT GGGAAGATAA ACCAACAATT GAAGCAGCAT TAATGTATCC AGGTAGATAC
GTTGTAGATT CAGTTACAAC AAAAGATGGA GAACCAATTC TTGCCGCATC AACTGATAGA
TTACACAATA TTGCAGGAAC ATATGTTGGA AAAGACGATC CAATTTGTGT TGTCAGAACA
CAAAAGAAAT TCCCTGCAAC TGAAGAAGTA GGAAGTGTGT TTAACAATCC ACATTTTGTT
GCAGGAAACA CAAGAGGAAG TCATAATATG CCATTAATGC CTGTAAAACT AAACTCTGCA
GCTACAATCA ACTTTTGTAT TCCAATCGTT GAGGCACTTG TATTTAGTAT GCATAACGGA
AAGTTTACAG GACCATTTGA TGGATTCTCA ACTCCAGATT GGGATCTAAT CAGAGAGAGA
GCAACAGAGA AAGCCATGGC AATTAGAAGC CAAGGATTTA TCCATCCAGC AACACTTGTA
CCATCAGAAC TAGAATATGC TGAAGGTTAT AGAGCTAGAA TGGATGTTCT TGAAAGTAAG
ATGAAACCAA TGGAAGGAAC TGATTCTAGC GGTGACAGAA AAGAGAATTA CGAAGATCCA
GATTAG
 
Protein sequence
MENMKITVSV IKADVGGVGG HTKPSDGLLD AIRNTVKNSA DLLIDYYIGY CGDDTHIVMS 
HTHGVDNQQI HKLAWDAFMA GTQVAKEEGL YGAGQDLLKD SFSGNVKGMG PGVAEMEFEE
RPNEAFTVFA ADKTEPGAFN YPIYRMFVDA LSNTGLIVNK NLADGVKINI MDVEKAQIAE
LQLWEDKPTI EAALMYPGRY VVDSVTTKDG EPILAASTDR LHNIAGTYVG KDDPICVVRT
QKKFPATEEV GSVFNNPHFV AGNTRGSHNM PLMPVKLNSA ATINFCIPIV EALVFSMHNG
KFTGPFDGFS TPDWDLIRER ATEKAMAIRS QGFIHPATLV PSELEYAEGY RARMDVLESK
MKPMEGTDSS GDRKENYEDP D