Gene Noc_2822 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNoc_2822 
Symbol 
ID3705571 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNitrosococcus oceani ATCC 19707 
KingdomBacteria 
Replicon accessionNC_007484 
Strand
Start bp3198590 
End bp3199585 
Gene Length996 bp 
Protein Length331 aa 
Translation table11 
GC content52% 
IMG OID637739298 
Productdihydroorotate dehydrogenase 1 
Protein accessionYP_344799 
Protein GI77166274 
COG category[F] Nucleotide transport and metabolism 
COG ID[COG0167] Dihydroorotate dehydrogenase 
TIGRFAM ID[TIGR01037] dihydroorotate dehydrogenase (subfamily 1) family protein 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.0209539 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCTGAAA CATTCAATAC TGACTTAAGT GATACCGATT GGGCAAGGCT AAAGGTTGAT 
TTTTGTGGGC TGGAGCTGCA AAGCCCCTTA GTATTGCTTT CAGGTTGCGT CGGTTTTGGG
GAAGAATATA CTCGGGTAGT GGGTTTCTCC AACCGGGAGG TGGGAGCGGT ATGTCTCAAG
GGAACCACGG CGGCTCCCCG CTTGGGAAAT GCCCTCCATC GGATTTATGA AACGCCCATG
GGCATGCTCA ATGCCATTGG CCTGCAAAAT CCCGGCGTAG ATTATGTAGT CGATCATATC
TTGCCAGCGC TTGACTTTAG CGAAACCCGC TATATCGCCA ATGTTTCTGG CTCCACTATT
GAAGAGTATA CGGCAGTCAC CCGCCGCTTC GACAATTCCC CAATTGATGC CATAGAAATC
AATATTTCTT GCCCTAATGT AAAAGAAGGG GGCGTTGCTT TTGGCAACGA TCCCCATATG
TCGGCGCGGG TGGTGGAGGC CTGTCGAAAG GTGACCCGTA AACCCCTGAT CACCAAGCTT
TCCCCTAACC AAACCTCAAT AGAAGAAAAT GCCCGTCGCT GTATCGAAGC GGGAACGGAT
GGGTTTGCCG TCATCAATAC CTTGATGGGA ATGGCCATTG ATATAGAGCA GCGCACTCCG
CTTCTCGGAA ATATCCAGGG GGGATTGTCG GGGCCCGCCA TAAAGCCGAT TGCCTTACTC
AAGGTGCGTC AAGTCTATCA GGCATGCCGG GCGCATGGCA TCCCAATTAT TGGGCAGGGG
GGAGTCGCTT CTGGCAAAGA TGCTCTGGAA TTTCTCATTG CGGGCGCTAC TACGGTGGGA
GTAGGTACCG CCTTGTTTTA TGACCCTTTG CTTTGCGCCA AAATCAACGC GGAAATTGTA
GCTTACCTCA AGCGCCATGA CTTGAGAGCG GTGGCGCAAT TGACGGGCAG CTTGCGTTTA
GCGGAGGAAG TCTCGGACTG TGTTGTGAGT GGCTAA
 
Protein sequence
MAETFNTDLS DTDWARLKVD FCGLELQSPL VLLSGCVGFG EEYTRVVGFS NREVGAVCLK 
GTTAAPRLGN ALHRIYETPM GMLNAIGLQN PGVDYVVDHI LPALDFSETR YIANVSGSTI
EEYTAVTRRF DNSPIDAIEI NISCPNVKEG GVAFGNDPHM SARVVEACRK VTRKPLITKL
SPNQTSIEEN ARRCIEAGTD GFAVINTLMG MAIDIEQRTP LLGNIQGGLS GPAIKPIALL
KVRQVYQACR AHGIPIIGQG GVASGKDALE FLIAGATTVG VGTALFYDPL LCAKINAEIV
AYLKRHDLRA VAQLTGSLRL AEEVSDCVVS G