Gene Noc_1394 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNoc_1394 
Symbol 
ID3706080 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNitrosococcus oceani ATCC 19707 
KingdomBacteria 
Replicon accessionNC_007484 
Strand
Start bp1544221 
End bp1545333 
Gene Length1113 bp 
Protein Length370 aa 
Translation table11 
GC content51% 
IMG OID637737888 
Productzinc-containing alcohol dehydrogenase superfamily protein 
Protein accessionYP_343417 
Protein GI77164892 
COG category[C] Energy production and conversion 
COG ID[COG1062] Zn-dependent alcohol dehydrogenases, class III 
TIGRFAM ID[TIGR02818] S-(hydroxymethyl)glutathione dehydrogenase/class III alcohol dehydrogenase 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.0239994 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGATTAAAT CAAGAGCAGC CATTGCATGG GGACCGAAAC AACCCCTTTC CCTGGAAGAA 
GTGGAGGTAA TGCCGCCGCA AAAAGGGGAA GCATTGGTTC GTATCGCCGC CACTGGCGTG
TGCCATACCG ATGCGTTTAC TTTGTCAGGC GAAGACCCGG AAGGTGTTTT TCCCGCTATC
CTCGGTCATG AAGGAGGAGG GATCGTAGAG GCTATTGGCG AAGGCGTGAC CAGCGTCGCG
GAAGGAGACC ATGTCATCCC CTTATATACG CCAGAGTGTG GCGGATGTAA ATTCTGCCTA
TCCGGCAAAA CTAATCTATG CCAAAAAATT CGGGCGACTC AAGGAAAGGG GCTAATGCCA
GATGGCACCA CCCGCTTCTA CAAAGCGGGC CAGCCCATCT ACCACTATAT GGGCTGCTCC
ACTTTCTCCG AGTACACCAT ATTACCTGAA ATCTCGCTTG CGAAAGTGAA CAAGGAGGCT
CCTCTGGAAG AGGTTTGCCT GCTGGGCTGT GGCGTCACCA CAGGTATAGG GGCCGTCATG
AACACCGCTA AAGTGGAAGA AGGCGCCACG GTGGCTATTT TTGGTCTGGG AGGAATTGGT
TTGGCAGCCA TTATTGGCGC CACTATGGCC AAAGCCAGCC GCATTATCGG GATCGATATC
AATGAAGGTA AGTTTGAACT GGCCCGTAAG CTGGGGGCGA ACGACTGTAT CAATCCCCAA
AACTACGATC GGCCTATCCA GGAAGTAATC ATCGAACTGA CTGGCGGTGG CGTGGATTAT
TCCTTTGAAT GTATTGGTAA TGTTAAGGTT ATGCGTTCTG CATTGGAGTG TTGCCACAAA
GGCTGGGGGG AATCGGTGAT TATTGGCGTT GCCGGCGCTG GCCAGGAAAT CTCTACCCGC
CCATTTCAGT TGGTCACTGG ACGGGTATGG CGAGGTTCTG CATTTGGTGG CGTTAAAGGC
CGCTCTGAGT TACCAGAATA TGTAGAACGT TACCTGAAAG GCGAATTCAA ACTCGATGAC
TTTATTACCC ATACCATGGG GCTGGAAGAC ATCAATAAGG CTTTCGATCT AATGCATCAA
GGCAAGAGTA TTCGTAGCGT CATTCATTAT TAA
 
Protein sequence
MIKSRAAIAW GPKQPLSLEE VEVMPPQKGE ALVRIAATGV CHTDAFTLSG EDPEGVFPAI 
LGHEGGGIVE AIGEGVTSVA EGDHVIPLYT PECGGCKFCL SGKTNLCQKI RATQGKGLMP
DGTTRFYKAG QPIYHYMGCS TFSEYTILPE ISLAKVNKEA PLEEVCLLGC GVTTGIGAVM
NTAKVEEGAT VAIFGLGGIG LAAIIGATMA KASRIIGIDI NEGKFELARK LGANDCINPQ
NYDRPIQEVI IELTGGGVDY SFECIGNVKV MRSALECCHK GWGESVIIGV AGAGQEISTR
PFQLVTGRVW RGSAFGGVKG RSELPEYVER YLKGEFKLDD FITHTMGLED INKAFDLMHQ
GKSIRSVIHY