Gene Noc_1438 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNoc_1438 
Symbol 
ID3706046 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNitrosococcus oceani ATCC 19707 
KingdomBacteria 
Replicon accessionNC_007484 
Strand
Start bp1594976 
End bp1596094 
Gene Length1119 bp 
Protein Length372 aa 
Translation table11 
GC content53% 
IMG OID637737928 
Producthomogentisate 1,2-dioxygenase 
Protein accessionYP_343457 
Protein GI77164932 
COG category[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG3508] Homogentisate 1,2-dioxygenase 
TIGRFAM ID[TIGR01015] homogentisate 1,2-dioxygenase 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGCCATT GGATCACTTT ACCGCGGATG GAGGGGATCA CATCCAGGCA GGCCCATGCT 
GATCTTCCGT CTGATAGCTA TGAACGGGAG TTGGGCAAGG AGGGCTTCTA TGGTCCCTCA
ACCCAAATGT ATCACCGGCA CCCGCCGACG GGTTGGGTGG AGGTAGAAGG CCCCTTGCGG
CCGCGAGCTT TTGATACGAC CCGGCTTAAA TCCTATCAGG CCTCCCCCTG GGAGGCTTTT
CCTCTCTTTA GCAATGAGCA TTTGCAATGG CGTTTTTGGA GGACAAAGGG TTCCATGGAC
CATTTGGCGC GTAACGCCGA TGGGGATGAG CTGTTATTTA TCCATGAAGG CGACGGGGAC
TTATATTGCG ACTATGGTCA TTTAGCGTTC CAGGAAGGGG ATTATATACT TCTACCCCGG
GGGACTCTGT GGCGGGTAGA AACCGAGAAG CCCCTGGGTG TGCTCTTGCT GGAGGCAACG
GGAGATAGTT ATCGGCTTCC TGACAAAGGG ATTGCGGGCA GTCATGCCGT CTTTGACCCG
GCGGTGCTGG ATACCCCTAC TATTAATGAA CATTTCTTGG CTCAACAGAC GGAAAGCGAA
TGGCGGGTTG TGGTCAAGCG GCGGGATGTT TTGAGTACTA TCATTTATCC GTTCAATCCT
CTGGATGCGG TGGGTTGGCA TGGAACTTTG CTTCCGGTGC GGCTTAATTG GCGGGACATC
CGACCTATCA TGAGTCATCG TTATCATATT CCGCCCTCGG TCCACACCAC CTTTACCACC
AGCCGGTTTG TGGTCTGTAC TTTCTGTCCT CGCCCTATGG AAAGCGACCC CGGGGCTTTA
AAGGTGCCTT TTTTCCATAA CAATGATGAT TATGATGAGG TTATTTTCTA CCATAAGGGG
GAGTTCTTCT CCCGGGATAA TATCCACCCG GGTATGATGA CCTTGCATCC TAGCGGTATC
ACCCACGGCC CCCATCCCAA GGCTTTTGCA GCAGGAAAGA AGGCGCTCCA TAAAGAAACC
AATGAAGTTG CGGTTATGGT GGATGCCCGG GACGCTTTGG CGGTGGCGGA ATTTCCCTCC
GGGGTGGAGT GGGTTGGTTA TGTGGATTCC TGGAAATGA
 
Protein sequence
MSHWITLPRM EGITSRQAHA DLPSDSYERE LGKEGFYGPS TQMYHRHPPT GWVEVEGPLR 
PRAFDTTRLK SYQASPWEAF PLFSNEHLQW RFWRTKGSMD HLARNADGDE LLFIHEGDGD
LYCDYGHLAF QEGDYILLPR GTLWRVETEK PLGVLLLEAT GDSYRLPDKG IAGSHAVFDP
AVLDTPTINE HFLAQQTESE WRVVVKRRDV LSTIIYPFNP LDAVGWHGTL LPVRLNWRDI
RPIMSHRYHI PPSVHTTFTT SRFVVCTFCP RPMESDPGAL KVPFFHNNDD YDEVIFYHKG
EFFSRDNIHP GMMTLHPSGI THGPHPKAFA AGKKALHKET NEVAVMVDAR DALAVAEFPS
GVEWVGYVDS WK