Gene Noc_0467 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNoc_0467 
Symbol 
ID3706638 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNitrosococcus oceani ATCC 19707 
KingdomBacteria 
Replicon accessionNC_007484 
Strand
Start bp501414 
End bp503060 
Gene Length1647 bp 
Protein Length548 aa 
Translation table11 
GC content62% 
IMG OID637736976 
Productglucose-methanol-choline oxidoreductase 
Protein accessionYP_342520 
Protein GI77163995 
COG category[E] Amino acid transport and metabolism 
COG ID[COG2303] Choline dehydrogenase and related flavoproteins 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones39 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGTCGGATG ACGCCTTGCA TGTGCCGCGC GGCAAGGACG GGCGCGCCCC CGATCCCCTG 
CGGGTCGGCG GTTGGGTGCC GATGCGACAA TTCGGTGAGA ATGAGGAAGT GGATTTCGCC
ATTATCGGCA CCGGCGCCGG CGGCGGCACG TTGGCGGCCA AGCTGGCCGA AGCGGGCTTT
TCGGTCATCG GTTTCGATGC CGGCGCATGG TGGCGGCCCC TTGAAGAATT TGCCTCCGAC
GAAACCCACC AAGGCAAACT CTATTGGACC GACGAGCGCC TTTGCGACGG CGATAATCCG
CTGACGCTCG GCAGCAATAA TAGCGGCAAG GCGGTTGGCG GCTCAACGGT GCATTTTGCA
ATGGTGTCGC TGCGCATGCG TCCTGAGCGG TTCAAGTCGC GCACCCTGCT CGGCTATGGC
GCTGACTGGC CGCTCGACTG GCGCGAGATG TGGCATTATT ACACCGAAGT CGAGCAGGCG
CTGAAAATAG CCGGGCCGGT GACTTATCCC TGGGGTCCGC CGCGGCCACG TTATCCGTAC
CGCGCGCACG AGATTAATGC TGCCGGCTGG GTGCTGGCGA AAAGCTGTGA GGCGATGGGT
ATTCCCTGGT CGGAGACACC GCTTGCGACC GTATCCGCCC CGCGCGGCCG CTCGCATCCG
TGCGTCTATC GTGGCTTCTG TGTGACGGGC TGTTCCACCA ACGCCAAGCA GAGCGCGCTG
ATCACTTGGA TTCCCCGCGC CGTAAAAGCC GGTGCGGAGA TCAGGGATCT AGCGATGGTT
GGGCGTATTG AAACCAATGA CGCCGGGCGT GCCACCGGTG TGCATTATTA CCGCGAAGGC
GCTTGGCGCT TCCAGCGCGC GCGCAATGTC GTAGTCGCAG GTTATGGGAT TGAAACTCCA
AGGCTGCTGC TGAATTCGGC AAACGCGCGT TACCCCGATG GATTAGCGAA CAGCTCCGGG
CTGGTTGGTA AGTATCTCAT GGCGCAGACC AACCAGGGCG TCTTTGGCGT GATGGAGGAT
GAGATCCGCT GGTACAAGGG ACCGCCTTCT CTGACCCTGA CGGAGCATTG GAACTACACC
GATGAGGGCA AGGATTTTTT CGGCGGTTAC GCCTACATGG CTCAAGGTCC GTTGCCACAA
GCCTGGGCTG CGACGCAGGC TGGCAATCGC GGCCTGTGGG GCGATGCGCT GCTGCGCGAG
ATGGAAAAAT ATAATCACCA GGCGGGGCTT AAGATCGTCG GCGAAGTGCT GCCGCAGGAG
CGCAACTGCG TCACTCTTGC CGACGAAAAG GATCAATACG GATTGCCTCT CGCGCGGGTA
ACCTACTCGC TTTGCGACAA TGACCAGGCG CTGGTGAAAC ACGCGGTCGA TTTCATGTCC
CAAAGCCTCG CCGCAATCGG TGCCGGCGAC ATCTGGGCCG AGAGCGACGA CACCTGCCAT
CTAGGGGGCA CTGCACGCAT GGGCGACGAT CCGCGCAGCA GCGTCATCGA CGCTGACTGT
CGCTCGTGGG ACATTCCCAA TTTGTGGGTC TGCGACGGCT CGGTGTTTCC GGTCGTCGGC
GGTGTCAATC CATCGCTGAC CATCCAGGCC ATTGCCTGCC GCACTGCTGA TCGAATTCGG
GCGATGGCGG CGCAAGGCAC GCTTTGA
 
Protein sequence
MSDDALHVPR GKDGRAPDPL RVGGWVPMRQ FGENEEVDFA IIGTGAGGGT LAAKLAEAGF 
SVIGFDAGAW WRPLEEFASD ETHQGKLYWT DERLCDGDNP LTLGSNNSGK AVGGSTVHFA
MVSLRMRPER FKSRTLLGYG ADWPLDWREM WHYYTEVEQA LKIAGPVTYP WGPPRPRYPY
RAHEINAAGW VLAKSCEAMG IPWSETPLAT VSAPRGRSHP CVYRGFCVTG CSTNAKQSAL
ITWIPRAVKA GAEIRDLAMV GRIETNDAGR ATGVHYYREG AWRFQRARNV VVAGYGIETP
RLLLNSANAR YPDGLANSSG LVGKYLMAQT NQGVFGVMED EIRWYKGPPS LTLTEHWNYT
DEGKDFFGGY AYMAQGPLPQ AWAATQAGNR GLWGDALLRE MEKYNHQAGL KIVGEVLPQE
RNCVTLADEK DQYGLPLARV TYSLCDNDQA LVKHAVDFMS QSLAAIGAGD IWAESDDTCH
LGGTARMGDD PRSSVIDADC RSWDIPNLWV CDGSVFPVVG GVNPSLTIQA IACRTADRIR
AMAAQGTL