Gene Noc_2605 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNoc_2605 
Symbol 
ID3704360 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNitrosococcus oceani ATCC 19707 
KingdomBacteria 
Replicon accessionNC_007484 
Strand
Start bp2958648 
End bp2960132 
Gene Length1485 bp 
Protein Length494 aa 
Translation table11 
GC content54% 
IMG OID637739086 
Productaldehyde dehydrogenase 
Protein accessionYP_344588 
Protein GI77166063 
COG category[C] Energy production and conversion 
COG ID[COG1012] NAD-dependent aldehyde dehydrogenases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.000129908 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGCCAGT CATCCCCATT GCCCCTCACC CACAAAGCGC AGGTAGAGGT ACGCCAAACG 
CGCTTGCTGA TTGATGGCAA ATTTTGCGAC AGCCTCAGCG GTAAGACCTT TGCCACCATC
GATCCCGCTA CTGAAGAAGT CATTGCCCAG GTGGCGGAGG GCGATGCGGA AGATATTGAC
CTGGCGGTTC AAGCAGCCCG CAAAGCTTTC GATAGCGGCC CCTGGCATCG GATGGACGCA
CGGGAACGGG GCCGCCGAAT GTTGAAGTGG GCCGATTTGA TTGAAGCTCA TATGGAAGAG
CTAGCTAAAC TTGAAGTCTT GGACAATGGG AAACCCATCA ACGAGGCCCT AGGCTACGAT
ATTCCCAGCG CCGCGGCTAC GCTTCGTTAT TTTGCCGGCT GGGCGGATAA AATCCATGGT
AAAACCATTC CTATTTCCGG GCCTTTCTTC ACTTATACGC GCCGGGAGCC GGTGGGGGTT
TGCGGGCTCA TCATTCCCTG GAATTTTCCT CTGGCAATGG CGGCTTGGAA GTTAGGCCCT
GCCCTAGCCA CCGGCTGCAC CGCTATTCTA AAGCCGGCGG AACAAACTCC CCTGACGGCC
CTTCGTGCCG GTGAGTTGGC CCTGGAGGCA GGTATTCCCC CGGGAGTCCT CAATATCGTG
CCCGGTTTTG GCCCTACGGC GGGCGCAGCC CTGGTACAGC ACCCTCTGGT TGAGAAAATC
GCCTTCACGG GAGAATATAA AACCGCCCAA ACGATTAAGC AGGCCACAGT CAACAGCATG
AAACGCCTGT CCTTTGAGCT GGGAGGTAAA AGCCCTAACA TCATCTTTAA TGATGCTAAT
CTCGAAGAAG CTATTACGGG TTCCTTTGGG GCTATTTTTC TAAATCAAGG ACAAAATTGT
TGTGCGGGTA GCCGCGCCTT TGTACAAAAC AATATTTATG ATGAATTCGT GGAGCAATTT
GCCGAGAAGG CAGCCAAGCG TAAACTGGGC GATCCCTTCG ATCCCGCTAC CGAGCATGGA
GCCCAGATCG ACAAGGCCCA GTTCGATAAA ATCATGCACT ATATCGCCCT TGGCAAGGAG
CAAGGCGCGG AATGTGTCAC CGGCGGCGAG CGGGCCTTTG AGCGAGGTTA TTTTATCCAA
CCCACCATCT TCAAGGAAGT CAATGAAAAT ATGGCCATCG CAACGGACGA GATTTTTGGG
CCAGTGGCTA GTGTGCTACG CTTCAAGAAC ATTAATGAGG TCATTGAAAA AGCCAATAAC
ACCCCGTTTG GCCTCGCGGC GGCCGTATGG ACCCAAGATA TCGACAAAGC CAACGCGGTA
GCCGCAGGGG TAAAAGCGGG AACCGTCTGG GTCAACTGCT ACAATATTGT CGATCCAGCG
GCGCCTTTTG GCGGCTTTAA ACTATCCGGG CTCGGCCGGG AACTAGGCGA ACAAGCCCTG
GATGCCTACA CGGAAACCAA AACGGTCACT GTATTGCGGA GGTAA
 
Protein sequence
MSQSSPLPLT HKAQVEVRQT RLLIDGKFCD SLSGKTFATI DPATEEVIAQ VAEGDAEDID 
LAVQAARKAF DSGPWHRMDA RERGRRMLKW ADLIEAHMEE LAKLEVLDNG KPINEALGYD
IPSAAATLRY FAGWADKIHG KTIPISGPFF TYTRREPVGV CGLIIPWNFP LAMAAWKLGP
ALATGCTAIL KPAEQTPLTA LRAGELALEA GIPPGVLNIV PGFGPTAGAA LVQHPLVEKI
AFTGEYKTAQ TIKQATVNSM KRLSFELGGK SPNIIFNDAN LEEAITGSFG AIFLNQGQNC
CAGSRAFVQN NIYDEFVEQF AEKAAKRKLG DPFDPATEHG AQIDKAQFDK IMHYIALGKE
QGAECVTGGE RAFERGYFIQ PTIFKEVNEN MAIATDEIFG PVASVLRFKN INEVIEKANN
TPFGLAAAVW TQDIDKANAV AAGVKAGTVW VNCYNIVDPA APFGGFKLSG LGRELGEQAL
DAYTETKTVT VLRR