Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Jann_1439 |
Symbol | |
ID | 3933886 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Jannaschia sp. CCS1 |
Kingdom | Bacteria |
Replicon accession | NC_007802 |
Strand | - |
Start bp | 1406329 |
End bp | 1407711 |
Gene Length | 1383 bp |
Protein Length | 460 aa |
Translation table | 11 |
GC content | 62% |
IMG OID | 637903789 |
Product | homogentisate 1,2-dioxygenase |
Protein accession | YP_509381 |
Protein GI | 89053930 |
COG category | [Q] Secondary metabolites biosynthesis, transport and catabolism |
COG ID | [COG3508] Homogentisate 1,2-dioxygenase |
TIGRFAM ID | [TIGR01015] homogentisate 1,2-dioxygenase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 16 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAACGAAC AAGCCACACG CAGACCGATG ATCCAGGCCC CCAGCCACCC CGGCCCCCAT GACGGCTATA TGCCGGGCTT CGGCAATGAC TTTGAGACAG AGGCGCTGCC GGGTGCGCTG CCGCAGGGGA TGAACTCGCC CCAGAAGGTA AATTACGGCC TCTACGGCGA ACAGCTCTCC GGCACCGCCT TCACCGATGT GCGGCCTGAG CGGACGTGGT GTTACCGCAT CCGTCCCTCC GTCAAGCACT CACACCGTTA CTCCAAGATC GACCTGCCCT ACTGGCACTC CGCGCCGACC ATTGACCCCG ATGTGATCTC ACTAGGTCAG TACCGCTGGG ACCCTGTTCC CCATTCCGAC ACGCCGCTGA CCTGGCTCAC CGGCATGCGC ACCATGACCA GCGCGGGGGA TGTGAACACG CAGGTCGGCA TGGCGACCCA TGTCTATCTG GTCACGCAAA GCATGGTGGA CGATTATTTC TACTCCGCTG ACAGTGAGAT GCTGGTTGTC CCGCAGGAGG GCCGCCTGCG CTTCTGCACC GAGCTTGGCA TCATCGACGT GGAGCCGCAG GAGATCGCCA TTCTGCCACG CGGTCTTGTG TACCGGGTGG AGGTGCTGGA CGGCCCCGCG CGCGGCTTTG TTTGCGAAAA CTACGGCGCG AAGTTTGACC TTCCGGGGCG CGGCCCCATT GGCGCGAACT GCATGGCCAA CCCGCGCGAC TTCAAGGCCC CCGTCGCGGC CTATGAGGAC CGCGAAGTGC CGTCCACAAT CACGATCAAA TGGTGCGGCC AGTTCCACAC GTCGAAGATC GCGCAGAGCC CGCTGGATGT CGTGGCCTGG CACGGCAATT ACGCGCCCTA CAAATACGAT CTGAAGACCT ATTGCCCCGT CGGCGCGATC CTGTTCGACC ACCCGGACCC GTCGATCTTC ACGGTGCTGA CCGCGCCATC GGGCCAACCG GGCGTCGCCA ACATCGACTT CGTGCTGTTC CGCGAGCGCT GGATGGTGGC CGAAAACACG TTCCGCCCGC CGTGGTATCA CAAGAACATC ATGTCCGAAC TGATGGGCAA CATCTACGGC CAATATGACG CCAAGCCCAA GGGGTTCGTG CCCGGCGGTA TCTCCCTGCA CAACATGATG ATCCCCCACG GCCCCGACAA AAACGCCTTC GAGGGCGCGT CAAACGCCGA CCTTCAGCCG CAGAAGCTCG ATAACACAAT GTCCTTCATG TTCGAGACCC GCTTCCCCCA ACACCTCACG GCCTTTGCGG CGAATGAGGC CCCGTTGCAG GACGACTACA TCGACTGCTG GGAAACGCTG GAGAAGAAGT TTGATCCCTC GCAGCGCCCC GATGCGGGTC ACGGGACGCC GGGCAAAAAA TGA
|
Protein sequence | MNEQATRRPM IQAPSHPGPH DGYMPGFGND FETEALPGAL PQGMNSPQKV NYGLYGEQLS GTAFTDVRPE RTWCYRIRPS VKHSHRYSKI DLPYWHSAPT IDPDVISLGQ YRWDPVPHSD TPLTWLTGMR TMTSAGDVNT QVGMATHVYL VTQSMVDDYF YSADSEMLVV PQEGRLRFCT ELGIIDVEPQ EIAILPRGLV YRVEVLDGPA RGFVCENYGA KFDLPGRGPI GANCMANPRD FKAPVAAYED REVPSTITIK WCGQFHTSKI AQSPLDVVAW HGNYAPYKYD LKTYCPVGAI LFDHPDPSIF TVLTAPSGQP GVANIDFVLF RERWMVAENT FRPPWYHKNI MSELMGNIYG QYDAKPKGFV PGGISLHNMM IPHGPDKNAF EGASNADLQP QKLDNTMSFM FETRFPQHLT AFAANEAPLQ DDYIDCWETL EKKFDPSQRP DAGHGTPGKK
|
| |