Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Noc_0391 |
Symbol | |
ID | 3706562 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nitrosococcus oceani ATCC 19707 |
Kingdom | Bacteria |
Replicon accession | NC_007484 |
Strand | + |
Start bp | 432095 |
End bp | 433459 |
Gene Length | 1365 bp |
Protein Length | 454 aa |
Translation table | 11 |
GC content | 57% |
IMG OID | 637736903 |
Product | aldehyde dehydrogenase |
Protein accession | YP_342447 |
Protein GI | 77163922 |
COG category | [C] Energy production and conversion |
COG ID | [COG1012] NAD-dependent aldehyde dehydrogenases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 0.641534 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGCTTTTG AATCCGTTAA TCCCGCCACC GGCCAATCCC TCAAAACCTT TGATGCTTGG GATCAGAATG CCATTGATGG CGTCCTTCAG CAGGTCCAAG AGGCGAGTCC CCTCTGGGCT GCTCGCGATC TTTCGGAGCG TTGCCGTTTA TTACGGGCGG CAGCCCAGCA GTTGCGAGAG CGCAAGGAGG ACTTGGCCCG CCTTATTACC CTGGAGATGG GCAAGCTGCT CGGCGAGGCC CGGGCAGAGA TCGAAAAATG CGCCTGGGTC TGTGAGTATT ACGAGGAGCA TGCCCCCCGC TTTCTCGCCG ATGAAGTGAT CGAAAGCGAT GCCCGCCGCA GCTATGTTGC GCTCCAGCCG TTAGGTACTG TATTGGCGAT TATGCCCTGG AATTTTCCTT TTTGGCAGGT CTTTCGTTTT TGCGCGCCTG CTCTAGTGGC TGGAAATACG GCGGTGCTGA AACATGCTGC TAATGTTCCC CAGTGTGGGC TTGCCATTGA GCAGACCTTA CTTGAGGCGG GTTTTCCCCC AGGGGTGTTT CGTACCTTAC TGGTGAGTTC ATCCCAGACG GCAAAGGTAA TTGCTGATCC GAGAGTCCAG GGTGTAACCC TCACAGGTAG CGAGGCAGCC GGGCGTAAAG TGGCCGAGTG CGCCGGCCGG CATTTGAAAA AAACAGTACT GGAGCTGGGA GGCGCAGATC CTTTTATTGT GCTGGCTGAT GCCGATCTAG AGCAGGCGGT TCCCGTAGCG GTGCAATCCC GTTTTATCAA TGGAGGGCAA AGCTGCATTG CCGCTAAACG CTTTATCGTT ATGGAGGAAA TAGCGGATGA GTTTATTGCC CGCTTTCAAG CTGATTTAGA AGCACTGCAG CCTGGTGATC CGCTGGATGA ACAGACTACC CTGGCGCCAT TGGCCCGATT GGATCTGCGG GCGGGGCTTC ATCAGCAGGT GACGGCGAGC ATTCAGCAGG GAGCAGTGGC GGTTGCCGGA TGCCAGCCAT TGCCAGGGAC GGGAACCTAC TATGCCCCTT CTATCCTAGA CCGGGTTCAG CCAGGCATGC CAGCCTTTGA TGAGGAACTT TTCGGCCCCG TGGCCGCCAT TATTCGCGTA GCGAACGAAG CAGAAGCGGT GGCGATGGCC AATGCTTCCC GTTACGGCCT TGGAGGCAGT GTCTGGAGCC AAAATACTTC CCGGGCTGAG CGCCTGGCCC TGGAGCTGCG ATGCGGAGCG GCTTTCGTCA ACGGGCTGGT AAAAAGCGAT CCTCGCTTGC CCTTTGGCGG GATCAAGTGT TCCGGTTATG GACGGGAACT GTCTTGGCAT GGGATGCGGG AGTTTACTAA CCAGAAGACC CTATGGATCA AGTGA
|
Protein sequence | MAFESVNPAT GQSLKTFDAW DQNAIDGVLQ QVQEASPLWA ARDLSERCRL LRAAAQQLRE RKEDLARLIT LEMGKLLGEA RAEIEKCAWV CEYYEEHAPR FLADEVIESD ARRSYVALQP LGTVLAIMPW NFPFWQVFRF CAPALVAGNT AVLKHAANVP QCGLAIEQTL LEAGFPPGVF RTLLVSSSQT AKVIADPRVQ GVTLTGSEAA GRKVAECAGR HLKKTVLELG GADPFIVLAD ADLEQAVPVA VQSRFINGGQ SCIAAKRFIV MEEIADEFIA RFQADLEALQ PGDPLDEQTT LAPLARLDLR AGLHQQVTAS IQQGAVAVAG CQPLPGTGTY YAPSILDRVQ PGMPAFDEEL FGPVAAIIRV ANEAEAVAMA NASRYGLGGS VWSQNTSRAE RLALELRCGA AFVNGLVKSD PRLPFGGIKC SGYGRELSWH GMREFTNQKT LWIK
|
| |