Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Strop_4258 |
Symbol | |
ID | 5060743 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Salinispora tropica CNB-440 |
Kingdom | Bacteria |
Replicon accession | NC_009380 |
Strand | + |
Start bp | 4826486 |
End bp | 4827652 |
Gene Length | 1167 bp |
Protein Length | 388 aa |
Translation table | 11 |
GC content | 69% |
IMG OID | 640476520 |
Product | homogentisate 1,2-dioxygenase |
Protein accession | YP_001161064 |
Protein GI | 145596767 |
COG category | [Q] Secondary metabolites biosynthesis, transport and catabolism |
COG ID | [COG3508] Homogentisate 1,2-dioxygenase |
TIGRFAM ID | [TIGR01015] homogentisate 1,2-dioxygenase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 33 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 10 |
Fosmid unclonability p-value | 0.0597877 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCCGTTCT ACCGCCGCGT CGGCGAGGTG CCGCGCAAGC GCCACACCCA GTTCCGTCAA CCCGACGGCC ACCTCTACGC CGAGGAGTTG ATGGGCCAGG AAGGCTTCTC CGCCGACTCC TCGCTGCTCT ACCACCGGTA CGCACCGACC GCGATCGTGG CTGCCGAGGT GTTCAGCCCA CCGACGGTCA CGGGCACGCC GAACCTTCCA CTCAAGCCCC GCCACCTGCG CACCCACCAA CTCGACGCGG CCGGTGCCGA CCCGGTGCTC GGCCGGCGCT ACCTGCTCGG CAACGACGAC GTCCGGATCG CGTACGTGCT AGCCGACCAG CCATCCCCGC TGTTCCGCGA CGCCACCGGC GACCACTGCC TCTATGTCGA GTCCGGCGCC CTGCGGGTCG AGTCCCCCTT CGGCCCACTC GACGCCGTCG CCGGCGACTA CGTGATCATC CCGACCTCGA CCATCCACCG GCTCGTTCCC ACCGGCACCG AACCCGTCCG ACTGTTGGCG ATCGAGGCGG CCGGACATGT CGGCCCGCCC AAGCGTTACC TGTCGGTGCG CGGCCAATTC CTGGAACACT CGCCGTACTG CGAGCGGGAC GTCCGCGGAC CGGACACACC GCTGCTCGTT GACGGCGAGG ACGTCGACGT CCTGGTCCGG CACCGACGCG GCTGGACCCG GCACGTCTAC GCCAATCACC CGTTCGACGT GGTCGGCTGG GACGGACACC TCTACCCCTG GGCATTCTCC ATCCACGACT TCGAGCCGAT CACCGGCCGG ATCCACCAGC CCCCGCCGGT GCACCAGACG TTCCAGGGCC CGAACTTCGT GATCTGCTCG TTCGTCCCCC GCAAGGTTGA CTACCACCCC GACGCCATCC CGGTGCCGTA CAACCACCAC AACGTCGACT CCGACGAGGT GCTCTTCTAC ACCGGGGGCA ACTACGAGGC GCGGCGCGGC TCCGGCATCG GGCAGGGTTC GATCTCGCTA CACCCCTCAG GCTTCACCCA CGGACCGCAG CCCGGTGCCG CCGAGCGGTC GATCGGCGTC GACTACTTCG ACGAACTCGC CGTCATGGTC GACACCTTCC GCCCGCTGGA ACTCTGCGAC GCGGCCGCGG CCTGCGAAGA CGACGCGTAC GCCTGGACCT GGGCGCGGCG CGGGTAG
|
Protein sequence | MPFYRRVGEV PRKRHTQFRQ PDGHLYAEEL MGQEGFSADS SLLYHRYAPT AIVAAEVFSP PTVTGTPNLP LKPRHLRTHQ LDAAGADPVL GRRYLLGNDD VRIAYVLADQ PSPLFRDATG DHCLYVESGA LRVESPFGPL DAVAGDYVII PTSTIHRLVP TGTEPVRLLA IEAAGHVGPP KRYLSVRGQF LEHSPYCERD VRGPDTPLLV DGEDVDVLVR HRRGWTRHVY ANHPFDVVGW DGHLYPWAFS IHDFEPITGR IHQPPPVHQT FQGPNFVICS FVPRKVDYHP DAIPVPYNHH NVDSDEVLFY TGGNYEARRG SGIGQGSISL HPSGFTHGPQ PGAAERSIGV DYFDELAVMV DTFRPLELCD AAAACEDDAY AWTWARRG
|
| |