Gene Noc_2551 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNoc_2551 
Symbol 
ID3704554 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNitrosococcus oceani ATCC 19707 
KingdomBacteria 
Replicon accessionNC_007484 
Strand
Start bp2901256 
End bp2902254 
Gene Length999 bp 
Protein Length332 aa 
Translation table11 
GC content53% 
IMG OID637739030 
Productzinc-containing alcohol dehydrogenase superfamily protein 
Protein accessionYP_344534 
Protein GI77166009 
COG category[C] Energy production and conversion
[R] General function prediction only 
COG ID[COG0604] NADPH:quinone reductase and related Zn-dependent oxidoreductases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAAGCCA TAATCATGAC AGCAACGGGA GGCCCCGACG TCTTGCAACT GCAAGAGTTG 
CCGAAGCCTA CAATCCGTCA ACCCGGGGAA GTTCTGGTTC AGCTCAAAGG GGCTGGAATT
AACCCGGTGG ACACCAAATT GCGAACTCGA GGTACTTTCT ACCCAGACCG CTCCCCGACA
ATCCTCGGTT GCGATGGGGC GGGCGTTGTG GATGCTGTGG GCAGAGAGGT TAAAAATTTC
CAGAAGGGAG ATGAGGTTTA TTTCTGCTTC GGGGGAATTG GAGGTCCGGA AGGGAACTAT
GGGGAATATG CGGTAGTAGA CCATCGCTTT ATCGCTAAAA AACCAAGAAC GCTCTCCTTT
GCCGAAGCTA GCGCTGCCCC CCTGGTTTTG ATAACCGCTT GGGAAGCACT GCATGATCGG
GCGCGAATCC AGCCAGAGGA TACAGTATTG ATTCATGGCG GCGCAGGCGG TGTAGGCCAT
GTAGCCATTC AATTAGCCAA ACAGACCGGT GCTCGGGTCT GCGTCACCGT GAGCTGCGAA
GAAAAAGAGG AACTTGCCTG CTCCTTGGGA GCAGACCATA TCATCAACTA TCGCCAAACC
GATTTCGTTG AAGCCATTAT GGAATGGACC AGCGGTAAAG GGGTGGACGT GGTATTTGAT
ACGGTGGGGG GAGAAATTTT TGAAAAGAGC TGTGGAGCCG TCGCCATGTA TGGAGATTTA
GTCACCCTCT TACAGCCGAG TGCCAACATA AATTGGAATA CGGCGCGTGC GCGTAATCTC
CGCTTTAGTC TGGAATTGAT GCTGACTCCT ATGCACCGGG GCCTTATCTC TGCCTTAGAA
CATCAAGCAG ATATTCTGCA TTGCTGCGCT GAATTATTCG ACTCCGAGCG TCTTCGGCTT
CACTTCCAGC AAACCTTTCC CCTAGCGGAA GCAGCGGCTG CCCACCGTTT GCTGGAACGG
GGAGGAATGA TGGGTAAATT AGCCCTTGAG ATGGGTTAG
 
Protein sequence
MKAIIMTATG GPDVLQLQEL PKPTIRQPGE VLVQLKGAGI NPVDTKLRTR GTFYPDRSPT 
ILGCDGAGVV DAVGREVKNF QKGDEVYFCF GGIGGPEGNY GEYAVVDHRF IAKKPRTLSF
AEASAAPLVL ITAWEALHDR ARIQPEDTVL IHGGAGGVGH VAIQLAKQTG ARVCVTVSCE
EKEELACSLG ADHIINYRQT DFVEAIMEWT SGKGVDVVFD TVGGEIFEKS CGAVAMYGDL
VTLLQPSANI NWNTARARNL RFSLELMLTP MHRGLISALE HQADILHCCA ELFDSERLRL
HFQQTFPLAE AAAAHRLLER GGMMGKLALE MG