Gene Noc_0470 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNoc_0470 
Symbol 
ID3706641 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNitrosococcus oceani ATCC 19707 
KingdomBacteria 
Replicon accessionNC_007484 
Strand
Start bp504875 
End bp505978 
Gene Length1104 bp 
Protein Length367 aa 
Translation table11 
GC content59% 
IMG OID637736979 
Productmandelate racemase/muconate lactonizing protein 
Protein accessionYP_342523 
Protein GI77163998 
COG category[M] Cell wall/membrane/envelope biogenesis
[R] General function prediction only 
COG ID[COG4948] L-alanine-DL-glutamate epimerase and related enzymes of enolase superfamily 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones37 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAACGCCC GGAGCCTTGA AGTTCCCGTC GAAAAATTGG AAGTTTCCGC TTACCAGATT 
CCTACCGATT TTCCCGAGGC GGATGGAACG CTGGGCTGGG ATTCAACAAC CATCATCGTG
GTCAAACTCC ATGGCGGCGG TAAAGCAGGA CTGGGTTACT CCTACGGAAG TAAGGCGGTA
GCGGTTTTAA TTGATAATCA ACTCCGCAAA ACGGTCATCG GCCAGGATGC CATGGCCATT
GCCGGCAGAT GGCAAGCCAT GGTGAAAGCC ATCCGTAACC TGGGCCGGCC CGGCATCTGC
TCCATGGCCA TTGCCGCCGT GGATACCGCC CTGTGGGATC TGAAAGCCCG TCTCCTTGAT
TTGCCTTTGG TCACCCTGCT GGGCGCGGCG CGGGCGGAGG CGCCGGTTTA TGGCAGCGGC
GGCTTTACCA GTTATTCCCC TGAACAGCTT CAGCAACAAC TGGGCGGCTG GGCAAACGAG
GGGATTCAGG CGGTCAAAAT GAAAGTAGGA AGCGATCCGG AGCAGGACCC GAAACGGGTA
CAGCTTGCAC GGGAGGCTAT CGGTGAGGGG GTGGCGTTGT TTGTGGATGG CAATGGCGCC
TATGGGCGTA AACAGGCGCT GGCCCTAGCC GACTCCTTTA CCAAATACAG GGTCACGTGG
TTTGAAGAAC CGGTTTCATC TGATGATCTG GAGGGCCTTC GCTTGCTTCG CGACCGGGGT
CCAGCCGGGA TGGATATCGC CGCCGGAGAG TACGGCTATG ATCAGTATTA CTTCCGCCGT
ATGTTGGCTG CTGGCGCGGT GGACGTGCTG CAAGCGGACG CCACCCGCTG CGCCGGCATC
ACCGGCTTTA TGGCGGCCAG CGCCCTTTGC CAGGGCTATG GGATTCCCCT TTCCGCCCAT
ACAGCGCCTT CCCTCCACGC CCATCCTGTC TGCGCGCTCC CCCATATCCG CCCCCTGGAG
TATTTCCACG ATCATGTCCG CATTGAAGCG ATGTTCTTCG ACGGCGTTCT TAAACCCGTA
AACGGCGCCC TCCAGCCGGA TACTTCCCGT CCTGGATTGG GCCTAGAGTT ACGGGAGGCG
GATGCCGTCC AGTATGCGGT TTAA
 
Protein sequence
MNARSLEVPV EKLEVSAYQI PTDFPEADGT LGWDSTTIIV VKLHGGGKAG LGYSYGSKAV 
AVLIDNQLRK TVIGQDAMAI AGRWQAMVKA IRNLGRPGIC SMAIAAVDTA LWDLKARLLD
LPLVTLLGAA RAEAPVYGSG GFTSYSPEQL QQQLGGWANE GIQAVKMKVG SDPEQDPKRV
QLAREAIGEG VALFVDGNGA YGRKQALALA DSFTKYRVTW FEEPVSSDDL EGLRLLRDRG
PAGMDIAAGE YGYDQYYFRR MLAAGAVDVL QADATRCAGI TGFMAASALC QGYGIPLSAH
TAPSLHAHPV CALPHIRPLE YFHDHVRIEA MFFDGVLKPV NGALQPDTSR PGLGLELREA
DAVQYAV