Gene Strop_4258 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagStrop_4258 
Symbol 
ID5060743 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalinispora tropica CNB-440 
KingdomBacteria 
Replicon accessionNC_009380 
Strand
Start bp4826486 
End bp4827652 
Gene Length1167 bp 
Protein Length388 aa 
Translation table11 
GC content69% 
IMG OID640476520 
Producthomogentisate 1,2-dioxygenase 
Protein accessionYP_001161064 
Protein GI145596767 
COG category[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG3508] Homogentisate 1,2-dioxygenase 
TIGRFAM ID[TIGR01015] homogentisate 1,2-dioxygenase 


Plasmid Coverage information

Num covering plasmid clones33 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.0597877 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCGTTCT ACCGCCGCGT CGGCGAGGTG CCGCGCAAGC GCCACACCCA GTTCCGTCAA 
CCCGACGGCC ACCTCTACGC CGAGGAGTTG ATGGGCCAGG AAGGCTTCTC CGCCGACTCC
TCGCTGCTCT ACCACCGGTA CGCACCGACC GCGATCGTGG CTGCCGAGGT GTTCAGCCCA
CCGACGGTCA CGGGCACGCC GAACCTTCCA CTCAAGCCCC GCCACCTGCG CACCCACCAA
CTCGACGCGG CCGGTGCCGA CCCGGTGCTC GGCCGGCGCT ACCTGCTCGG CAACGACGAC
GTCCGGATCG CGTACGTGCT AGCCGACCAG CCATCCCCGC TGTTCCGCGA CGCCACCGGC
GACCACTGCC TCTATGTCGA GTCCGGCGCC CTGCGGGTCG AGTCCCCCTT CGGCCCACTC
GACGCCGTCG CCGGCGACTA CGTGATCATC CCGACCTCGA CCATCCACCG GCTCGTTCCC
ACCGGCACCG AACCCGTCCG ACTGTTGGCG ATCGAGGCGG CCGGACATGT CGGCCCGCCC
AAGCGTTACC TGTCGGTGCG CGGCCAATTC CTGGAACACT CGCCGTACTG CGAGCGGGAC
GTCCGCGGAC CGGACACACC GCTGCTCGTT GACGGCGAGG ACGTCGACGT CCTGGTCCGG
CACCGACGCG GCTGGACCCG GCACGTCTAC GCCAATCACC CGTTCGACGT GGTCGGCTGG
GACGGACACC TCTACCCCTG GGCATTCTCC ATCCACGACT TCGAGCCGAT CACCGGCCGG
ATCCACCAGC CCCCGCCGGT GCACCAGACG TTCCAGGGCC CGAACTTCGT GATCTGCTCG
TTCGTCCCCC GCAAGGTTGA CTACCACCCC GACGCCATCC CGGTGCCGTA CAACCACCAC
AACGTCGACT CCGACGAGGT GCTCTTCTAC ACCGGGGGCA ACTACGAGGC GCGGCGCGGC
TCCGGCATCG GGCAGGGTTC GATCTCGCTA CACCCCTCAG GCTTCACCCA CGGACCGCAG
CCCGGTGCCG CCGAGCGGTC GATCGGCGTC GACTACTTCG ACGAACTCGC CGTCATGGTC
GACACCTTCC GCCCGCTGGA ACTCTGCGAC GCGGCCGCGG CCTGCGAAGA CGACGCGTAC
GCCTGGACCT GGGCGCGGCG CGGGTAG
 
Protein sequence
MPFYRRVGEV PRKRHTQFRQ PDGHLYAEEL MGQEGFSADS SLLYHRYAPT AIVAAEVFSP 
PTVTGTPNLP LKPRHLRTHQ LDAAGADPVL GRRYLLGNDD VRIAYVLADQ PSPLFRDATG
DHCLYVESGA LRVESPFGPL DAVAGDYVII PTSTIHRLVP TGTEPVRLLA IEAAGHVGPP
KRYLSVRGQF LEHSPYCERD VRGPDTPLLV DGEDVDVLVR HRRGWTRHVY ANHPFDVVGW
DGHLYPWAFS IHDFEPITGR IHQPPPVHQT FQGPNFVICS FVPRKVDYHP DAIPVPYNHH
NVDSDEVLFY TGGNYEARRG SGIGQGSISL HPSGFTHGPQ PGAAERSIGV DYFDELAVMV
DTFRPLELCD AAAACEDDAY AWTWARRG