Gene Noc_2046 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNoc_2046 
Symbol 
ID3705022 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNitrosococcus oceani ATCC 19707 
KingdomBacteria 
Replicon accessionNC_007484 
Strand
Start bp2355175 
End bp2356176 
Gene Length1002 bp 
Protein Length333 aa 
Translation table11 
GC content58% 
IMG OID637738521 
Productzinc-containing alcohol dehydrogenase superfamily protein 
Protein accessionYP_344036 
Protein GI77165511 
COG category[C] Energy production and conversion
[R] General function prediction only 
COG ID[COG0604] NADPH:quinone reductase and related Zn-dependent oxidoreductases 
TIGRFAM ID[TIGR02823] putative quinone oxidoreductase, YhdH/YhfP family 


Plasmid Coverage information

Num covering plasmid clones33 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGAATCTT TTCGGGGTCT GCGCATTGAC CGGGAGAAGG AGGGGATCCA GGCTCGCTTA 
GAAACCCTGC ATTTAGAGGA TCTTTCCCCC GGTTCGGTGG TGATCCGGGC TTATTATTCC
AGCGTTAATT ACAAGGATGC CTTGGCGGCC ACCGGCAAGG GTAAGATTAT GCAGCGTTTC
CCCCTGGTGG GCGGCATTGA TGTTTCAGGC GTGGTGGAGA GTTCGACCGA TCCCCGCTGT
CGTCCGGGGG ACAAGGTGTT GGTCACTGGC TATGGGTTAG GCAGTGATCA TGACGGAGGT
TATGCCGGCT ATGTTCGGGT GCCGGCGGAC TGGGTGGTGC CTTTGCCGGA GGGCTTAAGC
CTGTATGACG CCATGGCGTT GGGGACCGCG GGCTTTACCG CTGCCCTAGC CATCCAGCGG
ATGGAGGACA ATGGGCAGCG ACCGGATCGA GGTCTCGTCC TGGTCACGGG GGCGACAGGC
GGCGTGGGGA ATCTGGCCAT CAATATGCTG GCCGGGCTCG GTTACCCGGT GGTGGCTCTG
ACCGGTAAGC GGGAGGCAGT GGAAGATTTA AAAACCTTGG GCGCAAGCCA AATCTTATTT
CGACAAGAAT TAGAAATGGG CCAACGTCCC CTGGAAAAAG GGCAATGGGG CGGGGCCGTG
GATGTCGTCG GAGGAGATAT GCTGAGTTGG CTTACCCGGA CTGTGCTGCC CTGGGGCAAC
ATCGCCAGTA TTGGTCTAGC GGGGGGGAGT GAGCTGCACA CCACGGTTAT GCCTTTTATT
CTGCGGGGCG TGAGCCTGTT GGGGATTTCT TCCGCGGACT GTCCCATGCC CTTGCGCCAG
CATATTTGGC AACGGTTAGC CACCGATTTG CGGCCTAGGC ACCTTAATCA AATTGTCACC
GGAATGGTTT CCCTGGAGGA ATTATTACCC ATTTTTGAAG GCATGCTGGC GGGAGCTCAT
CGGGGAAGAA CGGTGGTAAA AATCAGGGAC GATGAGGGTT AG
 
Protein sequence
MESFRGLRID REKEGIQARL ETLHLEDLSP GSVVIRAYYS SVNYKDALAA TGKGKIMQRF 
PLVGGIDVSG VVESSTDPRC RPGDKVLVTG YGLGSDHDGG YAGYVRVPAD WVVPLPEGLS
LYDAMALGTA GFTAALAIQR MEDNGQRPDR GLVLVTGATG GVGNLAINML AGLGYPVVAL
TGKREAVEDL KTLGASQILF RQELEMGQRP LEKGQWGGAV DVVGGDMLSW LTRTVLPWGN
IASIGLAGGS ELHTTVMPFI LRGVSLLGIS SADCPMPLRQ HIWQRLATDL RPRHLNQIVT
GMVSLEELLP IFEGMLAGAH RGRTVVKIRD DEG