Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Noc_0592 |
Symbol | |
ID | 3706804 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nitrosococcus oceani ATCC 19707 |
Kingdom | Bacteria |
Replicon accession | NC_007484 |
Strand | - |
Start bp | 637749 |
End bp | 638954 |
Gene Length | 1206 bp |
Protein Length | 401 aa |
Translation table | 11 |
GC content | 58% |
IMG OID | 637737100 |
Product | zinc-containing alcohol dehydrogenase superfamily protein |
Protein accession | YP_342641 |
Protein GI | 77164116 |
COG category | [E] Amino acid transport and metabolism [R] General function prediction only |
COG ID | [COG1063] Threonine dehydrogenase and related Zn-dependent dehydrogenases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 33 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAAGCGG TTGTTTTTCA CGGGATTGGT GATATTCGCC TCGATGAAGT ACCGGAGCCG CAGATCAAAG ACCCGACCGA TGCGGTAATA CGGGTTACTG CCAGCGCTAT CTGTGGCACC GATCTGCACA TGGTGCGCGG CACCATGGGC GGCATGGAAA ACGGCACTAT TCTCGGGCAC GAAGCTGTCG GGGTGGTGGA AGCCCTCGGT AAGGGTGTAC GCAATCTTAA AGAAGGCGAT CGTGTAGTTG TACCCTCCAC CATTGCTTGC GGCTATTGCG CCTACTGCCG TGCGGGCTAT CATGCCCAGT GCGACAATGC CAACCCGCAA GGCTTATTAG CCGGCACCGC TTTCTTCGGC GGGCCGAAAG CCTCCGGCCC CTTCCATGGC CTGCAGGCGG AGAAGGCGCG GATTCCTTTT GCCAATGCCG GTTTGGTAAA ACTGCCCGAT GAAGTCAGCG ACGATGAGGC CATCCTGGTT TCCGATATCT TCCCCACGGC TTATTTTGGC GCTGAATTGG CCGAGATTAA AGCCGGCGAC ACTGTGGCTG TATTTGGCTG CGGTCCGGTA GGACAGTTTG TTATTACCAG CGCGCAGCTA CTGGGCGCCG GACGAATACT CGCCGTGGAC TCTGTGCCTT CCCGTCTCGA AATGGCTCGA ACCCAAGGCG CGGAAATCAT CGACTTCAAC GCCGAGGATC CGGTGGCAAC CATCAGGGAT TTGACCGGCG GCATCGGCGT GGATCGCGCC ATCGACGCTG TAGGCGTGGA TGCTGAGCGC CCGCACCATG GGCCGGCTGC CAAGCAAACG GATGCGCAAC AGGCGCAATT TGAGCAGGAA CTCAAGGAAA TTGCTCCCGA GAACCATCCC CAAGGCGGCC ACTGGCATCC TGGAGATGCT CCCTCCCAGG CCTTGCGTTG GGCGGTTGAA AGCTTGGCTA AAGCGGGCAC CCTATCGATT ATCGGAGTCT ACCCACCTAA TGATCGCTTT TTTCCCATCG GTCAGGCGAT GAACAAGAAC CTTACCCTCA AGATGGGCAA TTGCAATCAC CGTAAATATA TTCCCATGCT GGTCAACTTG GTGCACACCG GGGTGGTCAA TCCCGCTGCG GTGCTCACCC AGCAGGAGCC GCTCACGGCG GTAATCGATG CCTACCAAAA CTTCGACCAG CGCAAATCCG GCTGGATCAA GGTGGAACTG AAATAG
|
Protein sequence | MKAVVFHGIG DIRLDEVPEP QIKDPTDAVI RVTASAICGT DLHMVRGTMG GMENGTILGH EAVGVVEALG KGVRNLKEGD RVVVPSTIAC GYCAYCRAGY HAQCDNANPQ GLLAGTAFFG GPKASGPFHG LQAEKARIPF ANAGLVKLPD EVSDDEAILV SDIFPTAYFG AELAEIKAGD TVAVFGCGPV GQFVITSAQL LGAGRILAVD SVPSRLEMAR TQGAEIIDFN AEDPVATIRD LTGGIGVDRA IDAVGVDAER PHHGPAAKQT DAQQAQFEQE LKEIAPENHP QGGHWHPGDA PSQALRWAVE SLAKAGTLSI IGVYPPNDRF FPIGQAMNKN LTLKMGNCNH RKYIPMLVNL VHTGVVNPAA VLTQQEPLTA VIDAYQNFDQ RKSGWIKVEL K
|
| |