Gene Noc_1710 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNoc_1710 
Symbol 
ID3704632 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNitrosococcus oceani ATCC 19707 
KingdomBacteria 
Replicon accessionNC_007484 
Strand
Start bp1913666 
End bp1915207 
Gene Length1542 bp 
Protein Length513 aa 
Translation table11 
GC content55% 
IMG OID637738191 
Productaldehyde dehydrogenase 
Protein accessionYP_343712 
Protein GI77165187 
COG category[C] Energy production and conversion 
COG ID[COG1012] NAD-dependent aldehyde dehydrogenases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones35 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAATTAC TTAAGGATTT GGGTCTAGAA GATTTCAATC CGGGGGTGTG TTGGGGGCCG 
GGGTGGTGGT CTGGAGCGGA TTCCCGTCGG CGAATCGATT CCAGCAACCC AGCCACTGAA
AAGCCGATTG CTAGTGTTGG GGCAGCGACC GCGGCCGATG TGGAAACCCT CATAGGTGCC
TCCTGGGAGA ATTTTAGAAC CTGGCGTGCC GTCCCCGCGC CGGTACGGGG AGATTTGGTG
CGCCGCCTGG GTGAGTCTCT GCGGGTCCAT AAAGATCGCC TGGGCAGCCT GGTGAGCTTG
GAGACGGGTA AGATCAAGGA AGAAGGGGAT GGGGAAGTAC AGGAAATGAT TGATATGGCG
GATTTTGCCG TGGGGCAATC CCGGATGCTC TATGGCAAGA CCATGCACTC GGAACGTCCC
AGCCATCGGA TGTATGAACA ATGGCATCCC TTAGGACCGG TAGGGGTGAT TACCGCCTTT
AATTTTCCGG TTGCCGTATG GGCCTGGAAT GCCCTGATTG CCGCTATTTG CGGCAATACA
GTCATTTGGA AGCCCTCTCC CAAGGCGCCC TTAACGGCCG TCGCTGTGCA GCACCTTTGT
AATCGAGTGA TGGAGGAAGC CGGTTATCCA GGGGTGTTTA ACCTCTTGGT GACCGATGAG
AATCCACTGG CTGAAAGTTT GGTGCAAGAC CGGCGGATTC CTTTGATTTC TTTTACTGGC
TCCACCAAAG TAGGACGGTG GGTGAGTCGG TTAGTGGCTG CGCGGTTGGG ACGAAGCCTG
CTGGAACTCT CTGGCAATAA CGCAGTGATT GTCGATGAAA CCGCTGATCT CGACTTGGCA
GTGCCGGCGG TAGTTTTTGG GGCGGTCGGC ACGGCGGGGC AGCGTTGCAC CACCACACGG
CGTCTGATTG TGCATGAAAA CTGCTATGAG GAGCTGATTT CCCGGCTAAT CCATGCTTAC
CGGCAATTGC CCATTGGCGA TCCCTTAGAT AGAAAAACCT TAGTGGGACC CCTTATTGAT
GCCGAGGCCG TGGCAAAGTT TAGCGATGCC ATAGCAACGC TTAAGCAGCG GGGAGGTGAA
ATTCTCTACG GTGGCCGTGT GCTTGAGAGG GGTGGATATT TTGTCGAGCC TACCTTGGTT
CGGGCCGAAA ATCATTGGGA AATGGTGCAG CGGGAAACTT TTGCTCCCAT CCTTTACCTT
ATTCCCTTTA AAACCCTGGA GGAGGCGATT GCACTGAACA ATGCCGTACC TCAAGGGTTT
TCCTCGTCAT TATTTACCAC TCATCTCCAG CATGCCGAAC GATTTCTCTC CCACTGGGGC
AGTGATTGTG GTATCGCCAA CATTAATATG GGTACGTCAG GGGCTGAGAT CGGCGGGGCT
TTTGGGGGTG AGAAGGAAAC TGGAGGAGGT CGAGAAGCGG GCTCGGATGC TTGGAAAAAC
TATATGCGGC GGCAAACCAA TACCATCAAT TGGGGCACGG AATTGCCCCT GGCTCAAGGA
ATTCGCTTCG AGCTGGAGGG GGAGAGCCCA CCAGAGAGAT GA
 
Protein sequence
MKLLKDLGLE DFNPGVCWGP GWWSGADSRR RIDSSNPATE KPIASVGAAT AADVETLIGA 
SWENFRTWRA VPAPVRGDLV RRLGESLRVH KDRLGSLVSL ETGKIKEEGD GEVQEMIDMA
DFAVGQSRML YGKTMHSERP SHRMYEQWHP LGPVGVITAF NFPVAVWAWN ALIAAICGNT
VIWKPSPKAP LTAVAVQHLC NRVMEEAGYP GVFNLLVTDE NPLAESLVQD RRIPLISFTG
STKVGRWVSR LVAARLGRSL LELSGNNAVI VDETADLDLA VPAVVFGAVG TAGQRCTTTR
RLIVHENCYE ELISRLIHAY RQLPIGDPLD RKTLVGPLID AEAVAKFSDA IATLKQRGGE
ILYGGRVLER GGYFVEPTLV RAENHWEMVQ RETFAPILYL IPFKTLEEAI ALNNAVPQGF
SSSLFTTHLQ HAERFLSHWG SDCGIANINM GTSGAEIGGA FGGEKETGGG REAGSDAWKN
YMRRQTNTIN WGTELPLAQG IRFELEGESP PER