Gene Nmar_0523 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNmar_0523 
Symbol 
ID5773307 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNitrosopumilus maritimus SCM1 
KingdomArchaea 
Replicon accessionNC_010085 
Strand
Start bp466837 
End bp467889 
Gene Length1053 bp 
Protein Length350 aa 
Translation table11 
GC content37% 
IMG OID641316156 
Productalcohol dehydrogenase 
Protein accessionYP_001581857 
Protein GI161528031 
COG category[R] General function prediction only 
COG ID[COG1064] Zn-dependent alcohol dehydrogenases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones49 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAATCCG CCAAAATCCC AGGTCCAAAT GAACCTCTAA CAATATCTGA AACTGAAAAC 
CCAAAACCAT CTGGAACCCA GGTATTACTT AAAGTAAAAT CTGAAGGTGT CTGTCATAGT
GATTTGCATC TGTGGGAAGG TGGATATGAC CTTGGAGATG GTCAATTTTT GAAAGTAACT
GATCGTGGTG TAAAATACCC TGTAACGCCT GGACATGAAA TTGTTGGAAC TATTGAAGAG
ATTGGAGAAA ATGTTTCAAA TGTAACTGTA GGTGATGATG TTCTAGTTTT TCCTTGGATG
GGCTGTGGTG AATGCCCTGC ATGTAAAGTT GGTAATGAAA ATCTATGTGA TGCTCCAAAA
TCGATGGGGC TTTTCCAAAA TGGTGGTTAT GCTGATTATG TTTTAGTTCC GAATTCTAAA
TATTTAGCAA AACTTGATGG TGTTGATCCT GATGCTGCAA CTTCACTTGC ATGTTCTGGA
TTAACTGCAT ACACTGCTAT CAAAAAAGCA AATCAAAATT CTCCAGAATT CATTGTAATT
GTAGGAGCTG GTGGATTGGG ATTGATGGGA GTTCAAATTG CTAGTGAGAT TACTAATGCA
AAAATCATTT GTGTTGATTT AGATGATGCA AAATTAGCAA CGGCAAAAGA AATGGGTGCT
CATTTTACTG TAAATTCTAA AGATTCTGAA ACTGTTCAAA AAATAATGTC AATATGTAAT
GATAAGGGTG CAGATAGTGT TGTTGACTTT GTTAATGCTC CACCAACTGT AAAGACTGGC
TTAGCAGTGT TAAGAAAAAG AGGAAATCTT GTTCTAGTTG GATTATTTGG TGGCTCACTA
GAATTGTCTC TCGTTACAAT TCCTCTAAAA TCAATTACCA TTCAAGGTGC ATACACTGGA
AATTACAATG ACATGGTTGA ACTACTTGGA CTTGCAAGAA AAGGAACCAT AAACCCAGTT
ATTTCAAAAA GATATTCTCT TGATGAAGCA AATTCTGCAT TACAGGATCT TAAAGATCGT
AAAATCCTTG GACGTGCAGT CATCAATCCA TGA
 
Protein sequence
MKSAKIPGPN EPLTISETEN PKPSGTQVLL KVKSEGVCHS DLHLWEGGYD LGDGQFLKVT 
DRGVKYPVTP GHEIVGTIEE IGENVSNVTV GDDVLVFPWM GCGECPACKV GNENLCDAPK
SMGLFQNGGY ADYVLVPNSK YLAKLDGVDP DAATSLACSG LTAYTAIKKA NQNSPEFIVI
VGAGGLGLMG VQIASEITNA KIICVDLDDA KLATAKEMGA HFTVNSKDSE TVQKIMSICN
DKGADSVVDF VNAPPTVKTG LAVLRKRGNL VLVGLFGGSL ELSLVTIPLK SITIQGAYTG
NYNDMVELLG LARKGTINPV ISKRYSLDEA NSALQDLKDR KILGRAVINP