Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Noc_1438 |
Symbol | |
ID | 3706046 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nitrosococcus oceani ATCC 19707 |
Kingdom | Bacteria |
Replicon accession | NC_007484 |
Strand | + |
Start bp | 1594976 |
End bp | 1596094 |
Gene Length | 1119 bp |
Protein Length | 372 aa |
Translation table | 11 |
GC content | 53% |
IMG OID | 637737928 |
Product | homogentisate 1,2-dioxygenase |
Protein accession | YP_343457 |
Protein GI | 77164932 |
COG category | [Q] Secondary metabolites biosynthesis, transport and catabolism |
COG ID | [COG3508] Homogentisate 1,2-dioxygenase |
TIGRFAM ID | [TIGR01015] homogentisate 1,2-dioxygenase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 24 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGCCATT GGATCACTTT ACCGCGGATG GAGGGGATCA CATCCAGGCA GGCCCATGCT GATCTTCCGT CTGATAGCTA TGAACGGGAG TTGGGCAAGG AGGGCTTCTA TGGTCCCTCA ACCCAAATGT ATCACCGGCA CCCGCCGACG GGTTGGGTGG AGGTAGAAGG CCCCTTGCGG CCGCGAGCTT TTGATACGAC CCGGCTTAAA TCCTATCAGG CCTCCCCCTG GGAGGCTTTT CCTCTCTTTA GCAATGAGCA TTTGCAATGG CGTTTTTGGA GGACAAAGGG TTCCATGGAC CATTTGGCGC GTAACGCCGA TGGGGATGAG CTGTTATTTA TCCATGAAGG CGACGGGGAC TTATATTGCG ACTATGGTCA TTTAGCGTTC CAGGAAGGGG ATTATATACT TCTACCCCGG GGGACTCTGT GGCGGGTAGA AACCGAGAAG CCCCTGGGTG TGCTCTTGCT GGAGGCAACG GGAGATAGTT ATCGGCTTCC TGACAAAGGG ATTGCGGGCA GTCATGCCGT CTTTGACCCG GCGGTGCTGG ATACCCCTAC TATTAATGAA CATTTCTTGG CTCAACAGAC GGAAAGCGAA TGGCGGGTTG TGGTCAAGCG GCGGGATGTT TTGAGTACTA TCATTTATCC GTTCAATCCT CTGGATGCGG TGGGTTGGCA TGGAACTTTG CTTCCGGTGC GGCTTAATTG GCGGGACATC CGACCTATCA TGAGTCATCG TTATCATATT CCGCCCTCGG TCCACACCAC CTTTACCACC AGCCGGTTTG TGGTCTGTAC TTTCTGTCCT CGCCCTATGG AAAGCGACCC CGGGGCTTTA AAGGTGCCTT TTTTCCATAA CAATGATGAT TATGATGAGG TTATTTTCTA CCATAAGGGG GAGTTCTTCT CCCGGGATAA TATCCACCCG GGTATGATGA CCTTGCATCC TAGCGGTATC ACCCACGGCC CCCATCCCAA GGCTTTTGCA GCAGGAAAGA AGGCGCTCCA TAAAGAAACC AATGAAGTTG CGGTTATGGT GGATGCCCGG GACGCTTTGG CGGTGGCGGA ATTTCCCTCC GGGGTGGAGT GGGTTGGTTA TGTGGATTCC TGGAAATGA
|
Protein sequence | MSHWITLPRM EGITSRQAHA DLPSDSYERE LGKEGFYGPS TQMYHRHPPT GWVEVEGPLR PRAFDTTRLK SYQASPWEAF PLFSNEHLQW RFWRTKGSMD HLARNADGDE LLFIHEGDGD LYCDYGHLAF QEGDYILLPR GTLWRVETEK PLGVLLLEAT GDSYRLPDKG IAGSHAVFDP AVLDTPTINE HFLAQQTESE WRVVVKRRDV LSTIIYPFNP LDAVGWHGTL LPVRLNWRDI RPIMSHRYHI PPSVHTTFTT SRFVVCTFCP RPMESDPGAL KVPFFHNNDD YDEVIFYHKG EFFSRDNIHP GMMTLHPSGI THGPHPKAFA AGKKALHKET NEVAVMVDAR DALAVAEFPS GVEWVGYVDS WK
|
| |