Gene Noc_0391 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNoc_0391 
Symbol 
ID3706562 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNitrosococcus oceani ATCC 19707 
KingdomBacteria 
Replicon accessionNC_007484 
Strand
Start bp432095 
End bp433459 
Gene Length1365 bp 
Protein Length454 aa 
Translation table11 
GC content57% 
IMG OID637736903 
Productaldehyde dehydrogenase 
Protein accessionYP_342447 
Protein GI77163922 
COG category[C] Energy production and conversion 
COG ID[COG1012] NAD-dependent aldehyde dehydrogenases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value0.641534 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCTTTTG AATCCGTTAA TCCCGCCACC GGCCAATCCC TCAAAACCTT TGATGCTTGG 
GATCAGAATG CCATTGATGG CGTCCTTCAG CAGGTCCAAG AGGCGAGTCC CCTCTGGGCT
GCTCGCGATC TTTCGGAGCG TTGCCGTTTA TTACGGGCGG CAGCCCAGCA GTTGCGAGAG
CGCAAGGAGG ACTTGGCCCG CCTTATTACC CTGGAGATGG GCAAGCTGCT CGGCGAGGCC
CGGGCAGAGA TCGAAAAATG CGCCTGGGTC TGTGAGTATT ACGAGGAGCA TGCCCCCCGC
TTTCTCGCCG ATGAAGTGAT CGAAAGCGAT GCCCGCCGCA GCTATGTTGC GCTCCAGCCG
TTAGGTACTG TATTGGCGAT TATGCCCTGG AATTTTCCTT TTTGGCAGGT CTTTCGTTTT
TGCGCGCCTG CTCTAGTGGC TGGAAATACG GCGGTGCTGA AACATGCTGC TAATGTTCCC
CAGTGTGGGC TTGCCATTGA GCAGACCTTA CTTGAGGCGG GTTTTCCCCC AGGGGTGTTT
CGTACCTTAC TGGTGAGTTC ATCCCAGACG GCAAAGGTAA TTGCTGATCC GAGAGTCCAG
GGTGTAACCC TCACAGGTAG CGAGGCAGCC GGGCGTAAAG TGGCCGAGTG CGCCGGCCGG
CATTTGAAAA AAACAGTACT GGAGCTGGGA GGCGCAGATC CTTTTATTGT GCTGGCTGAT
GCCGATCTAG AGCAGGCGGT TCCCGTAGCG GTGCAATCCC GTTTTATCAA TGGAGGGCAA
AGCTGCATTG CCGCTAAACG CTTTATCGTT ATGGAGGAAA TAGCGGATGA GTTTATTGCC
CGCTTTCAAG CTGATTTAGA AGCACTGCAG CCTGGTGATC CGCTGGATGA ACAGACTACC
CTGGCGCCAT TGGCCCGATT GGATCTGCGG GCGGGGCTTC ATCAGCAGGT GACGGCGAGC
ATTCAGCAGG GAGCAGTGGC GGTTGCCGGA TGCCAGCCAT TGCCAGGGAC GGGAACCTAC
TATGCCCCTT CTATCCTAGA CCGGGTTCAG CCAGGCATGC CAGCCTTTGA TGAGGAACTT
TTCGGCCCCG TGGCCGCCAT TATTCGCGTA GCGAACGAAG CAGAAGCGGT GGCGATGGCC
AATGCTTCCC GTTACGGCCT TGGAGGCAGT GTCTGGAGCC AAAATACTTC CCGGGCTGAG
CGCCTGGCCC TGGAGCTGCG ATGCGGAGCG GCTTTCGTCA ACGGGCTGGT AAAAAGCGAT
CCTCGCTTGC CCTTTGGCGG GATCAAGTGT TCCGGTTATG GACGGGAACT GTCTTGGCAT
GGGATGCGGG AGTTTACTAA CCAGAAGACC CTATGGATCA AGTGA
 
Protein sequence
MAFESVNPAT GQSLKTFDAW DQNAIDGVLQ QVQEASPLWA ARDLSERCRL LRAAAQQLRE 
RKEDLARLIT LEMGKLLGEA RAEIEKCAWV CEYYEEHAPR FLADEVIESD ARRSYVALQP
LGTVLAIMPW NFPFWQVFRF CAPALVAGNT AVLKHAANVP QCGLAIEQTL LEAGFPPGVF
RTLLVSSSQT AKVIADPRVQ GVTLTGSEAA GRKVAECAGR HLKKTVLELG GADPFIVLAD
ADLEQAVPVA VQSRFINGGQ SCIAAKRFIV MEEIADEFIA RFQADLEALQ PGDPLDEQTT
LAPLARLDLR AGLHQQVTAS IQQGAVAVAG CQPLPGTGTY YAPSILDRVQ PGMPAFDEEL
FGPVAAIIRV ANEAEAVAMA NASRYGLGGS VWSQNTSRAE RLALELRCGA AFVNGLVKSD
PRLPFGGIKC SGYGRELSWH GMREFTNQKT LWIK