Gene Nmar_1220 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNmar_1220 
Symbol 
ID5773651 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNitrosopumilus maritimus SCM1 
KingdomArchaea 
Replicon accessionNC_010085 
Strand
Start bp1121044 
End bp1122180 
Gene Length1137 bp 
Protein Length378 aa 
Translation table11 
GC content34% 
IMG OID641316864 
Productdehydrogenase (flavoprotein)-like protein 
Protein accessionYP_001582554 
Protein GI161528728 
COG category[C] Energy production and conversion 
COG ID[COG0644] Dehydrogenases (flavoproteins) 
TIGRFAM ID[TIGR02032] geranylgeranyl reductase family 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones45 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGTACTACG ATGTAGTTGT TGCAGGGGGT AGTGTTGCAG GATTACTTTG TGCAAGAGAG 
ATTGCTGCAG ATGGTTTTTC AGTATTAGTT ATAGAAGAAG ATTATGAAAT AGGCACACCA
GAACATTGCG GTGGTTTAGT TAGTATTTCA GGACTACAAG AGTTAGGAGT AATTCCATTT
AGAAAGACAT TTGATCATAT GATAGAATCA GCTGAAATAA CTGCTCCTAA CGGAAAAAGT
TTTTCGATAA ACTCAAAAAA TCAAAAGGTC ATAGAAATTA GTAGAAGAGA ACTGGACAAA
CAAATTGCTT TTCAAGCCCA AAAAAATGGA GCTACAATCA AAGTCAGAAC TAGTTTTCAA
GAGATGACCA ATACGGGAGT TAGAACAAAT GAAGAAAATA TAGATTGTAA AATTTTTGTT
GATGCAAGAG GAGTATCATC TTTAATTCAT AAAGACAGAA CAGGGATTTT ATCATCTGCA
CAATACGAAA TCTATGCAGA CTGGATAAAA AAAGGAAAAG TAGAAGTGAT CTTTGATCAA
GACAAATATC CAGGTTTCTT TGCATGGATA ATTCCATCAA ATGAAGGGAA GGGAAAGATT
GGTGTTGCAG GAAAAGGCAT CAATGTTTCA GAAACAATGG ACAAAATTCT TGCAGAGAAA
GGCAAGCATT CTACAATAAG AAAGATTTTT GCGCCAATTT GGGTAAAAGG ACCAATTGAA
AAATTTGTAG AAGGAAATAC CGTAATTGTA GGAGATGCTG CAGGACAAGC AAAACCAACT
ACTGCAGGAG GTATTTTTAC TAGCGGTATG GGAGGAGTTT ATGCAGGACA AGCAATATCA
GAATTTCTAA AAACAGATGA AAAATCAAAA CTAGAAGTAT ATCAAACTAG ATGGACAGAT
AGATTTGGGA AAGAGTTTGA AAAACAAATT TTTGCAAGAA AAATTTTAGA AAGACTAGAC
AATAACACAA TCAACAAATT ATTTGAATCA ATAACACCTG AAATCCTAAA AGATATTTCA
GAAAAAGATG ATTTTGATTT TCATACAGGT TCAATTGTAA AACTATTAGG ATTAAAGGGA
TCAATCAAAA CAGCTCAAAC ACTAATCGGT GGAGAATTCA AAAAACTACT TCGATAA
 
Protein sequence
MYYDVVVAGG SVAGLLCARE IAADGFSVLV IEEDYEIGTP EHCGGLVSIS GLQELGVIPF 
RKTFDHMIES AEITAPNGKS FSINSKNQKV IEISRRELDK QIAFQAQKNG ATIKVRTSFQ
EMTNTGVRTN EENIDCKIFV DARGVSSLIH KDRTGILSSA QYEIYADWIK KGKVEVIFDQ
DKYPGFFAWI IPSNEGKGKI GVAGKGINVS ETMDKILAEK GKHSTIRKIF APIWVKGPIE
KFVEGNTVIV GDAAGQAKPT TAGGIFTSGM GGVYAGQAIS EFLKTDEKSK LEVYQTRWTD
RFGKEFEKQI FARKILERLD NNTINKLFES ITPEILKDIS EKDDFDFHTG SIVKLLGLKG
SIKTAQTLIG GEFKKLLR