Gene Noc_2167 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNoc_2167 
Symbol 
ID3704841 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNitrosococcus oceani ATCC 19707 
KingdomBacteria 
Replicon accessionNC_007484 
Strand
Start bp2500374 
End bp2501423 
Gene Length1050 bp 
Protein Length349 aa 
Translation table11 
GC content51% 
IMG OID637738643 
ProductNAD-dependent epimerase/dehydratase 
Protein accessionYP_344157 
Protein GI77165632 
COG category[G] Carbohydrate transport and metabolism
[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0451] Nucleoside-diphosphate-sugar epimerases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones26 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAGGTGC TTGTTACCGG CGCAACGGGT TTTATCGGCC GGCACCTAGT GGCAGCGCTG 
TTATCGAGAA AAGAAACAAT CCGGATATTA GCAAGAAATG TGGAACAGGC CGAGGCAATA
TGGAGTCCGG GTTTACTAGA GGTGGTGCGG GGAGATTTTG CCAATGCCTT AACGCTTGGG
GATTTGTGTG AAGGAGTGGA TATTGTTTTT CACTTGGCTA GCGGCAGTTT TGCCGAAAAC
GATAAGACCG GCGAGGCAGA GTGGCTCCAC CAAAAAGTAG CGGTGGAGGG GACTAAAGAG
CTTCTGAGAC GGGCTGCAAG AGCTGGAGTT AAACGATTTA TTTTTGTGAG TAGCGTCAAG
GCCATGGGTG AGGGAGGAAG TAGCCACCTT GATGAAGCCA GTCCAGAGCT TCCCCAAAGC
GCCTATGGCC GAGGGAAGCT AGCTGCCGAA CGGGCGGTGC TGGAGGCTGG CCGTACTTAC
GGTATGCATG TTTGCAACTT GCGTCTTCCC ATGGTGTATG GCAGCGATGG CAAGGGGAAT
CTCCCCCGAA TGATGGCCGC AATTGATCGG GGGTGGTTTC CACCACTGCC GGAAGTAAAA
AACCGCCGTT CCATGGTGCA TGTAAATGAT GTCGTGCAGG CGCTATTGCT CACAGCCGAA
AATCCCAAGG CAAGCCATCA AACTTATATC GTGACGGATG GTTGTACTTA TTCTTCTCGC
CAGATTTATA TATTGCTTTG CCAGGCCTTA GGCCGTTCTA TTCCGGGGTG GCGTTTTCCG
GCCGGGTTGC TGCAAATGGC TGCCAAAGCG GGGGATCTTG CCGGGTGGGT TCTCAAGAGA
GAAGTGCCTT TGAATTCCCA AGTGTTATAC AAATTAATGG GTTCAGCATG GTATAGCTGC
GCCAAAATAC AAAGAGAACT CGGTTACCGT CCTCGATATT CCCTAGAAAC CGCATTGCCC
GAGATTCTTA AAGCTCTTTG TTTAGTGACT TCTTCTCAAA GATCCTTGGC ATTGTCTCCT
GATAGGAGGG AAGAGGGGAA GCGGAGCTAA
 
Protein sequence
MKVLVTGATG FIGRHLVAAL LSRKETIRIL ARNVEQAEAI WSPGLLEVVR GDFANALTLG 
DLCEGVDIVF HLASGSFAEN DKTGEAEWLH QKVAVEGTKE LLRRAARAGV KRFIFVSSVK
AMGEGGSSHL DEASPELPQS AYGRGKLAAE RAVLEAGRTY GMHVCNLRLP MVYGSDGKGN
LPRMMAAIDR GWFPPLPEVK NRRSMVHVND VVQALLLTAE NPKASHQTYI VTDGCTYSSR
QIYILLCQAL GRSIPGWRFP AGLLQMAAKA GDLAGWVLKR EVPLNSQVLY KLMGSAWYSC
AKIQRELGYR PRYSLETALP EILKALCLVT SSQRSLALSP DRREEGKRS