Gene Noc_0844 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNoc_0844 
SymbolglcE 
ID3707149 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNitrosococcus oceani ATCC 19707 
KingdomBacteria 
Replicon accessionNC_007484 
Strand
Start bp921882 
End bp922952 
Gene Length1071 bp 
Protein Length356 aa 
Translation table11 
GC content55% 
IMG OID637737346 
Productglycolate oxidase FAD binding subunit 
Protein accessionYP_342887 
Protein GI77164362 
COG category[C] Energy production and conversion 
COG ID[COG0277] FAD/FMN-containing dehydrogenases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones34 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGATGCGCA CTCATGACAG CAGCGAAGAA CTTCAGGAAA CCGTCCGTAC TGCGGCAGCG 
GCTCAAACCC CTTTGCGAAT TATAGGTGGT AATACCAAAG TTTTTTATGG CCGTCACCTA
AAAGGTACGC CCCTGAACGT AGGCAAACAC CAGGGCATCA CCAGCTACGA ACCCACCGAA
CTCGTCGTCA CTGCCCGTGC GGGAACACCG TTGGCCGAAA TTGAAACGCT GCTTGCCGAG
CAGGGCCAAA TGCTTACTTT TGAACCCCCT TACTTCGGCT CCAACGCTAC CCTAGGCGGC
GCTGTAGCTA GCGGCCTCTC CGGCCCACGC CGACCCTATG GGGGAGCAGT CCGGGATATG
ATATTAGGGG TACAGATTAT TAACGGCAGG GGCCAGGTGC TACGCTTTGG CGGACGGGTG
ATGAAAAACG TAGCTGGCTA TGATATCTCC CGCTTGATGG CAGGTAGCCT TGGTACCCTA
GGGGTGCTGC TGGAAGTCTC TCTAAAGGTT TCGCCCCGTC CTCCCATGGA AATCACACTC
TCCCAAGAAC GGGATGCCCA CAATGCCATT CGCCTGTTCA ATGTCTGGGC AGCCCAGCCC
CTTCCCCTTT CAGCGTGTGC TTTTGACGGA GAGCATCTTT ACGTACGTCT ATCCGGCTCT
GAAGAGAGCA TCCAGGCTGC GCGCAACAAG GTTGGAGGCG ATAAACTGAG CGACAGCACA
AAGTTCTGGG AAAAAGTGCG AGAACAAACC CATCTTTTTT TCCAACGAAG CACAGCCCCT
CTTTGGCGCT GGTCGGTACC CGCCACAACA CCACCCATCG ATCTGCCGGG GGAATGGAAA
ATTGGCTGGG GCGGAGCCCA ACGCTGGTTT CGCTCCGAAT TGACTGCAGA AACGATACGC
CCTGCCGCCG AGAGCATCAA CGGTTATGCC ACTCTATTCC GTGGTGGAGA TCGTAGCGGC
GAGGTCTTTC ACCCCCTATC TCCCCCACTC ATGGCGCTAC ATCAACATCT CAAACGAGCC
TTCGATCCCC ATGGCATTCT GAATCCAGGA CGAATGTACC AGGAATTCTA G
 
Protein sequence
MMRTHDSSEE LQETVRTAAA AQTPLRIIGG NTKVFYGRHL KGTPLNVGKH QGITSYEPTE 
LVVTARAGTP LAEIETLLAE QGQMLTFEPP YFGSNATLGG AVASGLSGPR RPYGGAVRDM
ILGVQIINGR GQVLRFGGRV MKNVAGYDIS RLMAGSLGTL GVLLEVSLKV SPRPPMEITL
SQERDAHNAI RLFNVWAAQP LPLSACAFDG EHLYVRLSGS EESIQAARNK VGGDKLSDST
KFWEKVREQT HLFFQRSTAP LWRWSVPATT PPIDLPGEWK IGWGGAQRWF RSELTAETIR
PAAESINGYA TLFRGGDRSG EVFHPLSPPL MALHQHLKRA FDPHGILNPG RMYQEF