Gene Strop_4256 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagStrop_4256 
Symbol 
ID5060741 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalinispora tropica CNB-440 
KingdomBacteria 
Replicon accessionNC_009380 
Strand
Start bp4824343 
End bp4825539 
Gene Length1197 bp 
Protein Length398 aa 
Translation table11 
GC content72% 
IMG OID640476518 
Productfumarylacetoacetase 
Protein accessionYP_001161062 
Protein GI145596765 
COG category[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG0179] 2-keto-4-pentenoate hydratase/2-oxohepta-3-ene-1,7-dioic acid hydratase (catechol pathway) 
TIGRFAM ID[TIGR01266] fumarylacetoacetase 


Plasmid Coverage information

Num covering plasmid clones34 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value0.0938214 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCTGGG TAACCGGCCT GGAGGGCTCG CCGTACGGGC TGCACAACCT GCCGTACGGG 
GTGTTCCGTA CCGCCGACCG CGAGCCGCGG GTGGGCGTAC GCGTCGGCAC CCACGTACTG
GACCTCGGTG GCGCCGAGCA GGCCGGCCTG GTGTTTGCCG CCGGCACACT GAGTCGCCCT
CGGCTGAACG ACTTCATGGC GCTCGGACGT CCGCAGTGGA CGGCGGTACG GCAGCGGATC
GTCGAGCTGC TCACCGACAG CACCCACCGG GCGGCCGTCG AGCCGCTGCT CCTGGCACTG
GCGAACGTGG AGCTCCTGCT CCCCTTCGAC GTGGCGGACT ACGTCGACTT CTACTCCTCC
GAGCAGCATG CCAGCAACGT CGGTCAGATC TTCCGGCCGG GCCAGCCGCC GCTGCTGCCC
AACTGGAAGC ACCTGCCGAT CGGCTACCAC GGCCGGGCCG GCACCGTCGT GGTCTCCGGC
ACCCCGATCG TCCGACCGTG CGGCCAGCGG GCGACCCCGC AGGGCCCGGT CACCGGCCCC
TCGGTCCGCC TCGACATCGA GGCGGAGGTC GGCTTTGTGG TGGGTGTACC CAGTCCGTTG
GGCAGCCGCG TGCCGGTCGG TGACTTCGCC GACCACGTGT TCGGCGTGGT GCTGGTCAAC
GACTGGTCCG CCCGGGACAT CCAGGCCTGG GAGTACCAGC CCCTCGGACC GTTCCTCGGC
AAGTCCTTTG CCACCTCGGT CGCGGCCTGG GTGACGCCGC TGGAGGCGCT TGGCGCGGCG
TTCGTGCCGG CGCCGGAGCA GGATCCGCCG GTCGCTGACT ACCTGCGCGA CGAGCCCCAC
CTCGGGCTGG ATCTGCGTCT GTCGGTGGAG TGGAACGGCG AGCGGGTGAG CGAGCCGCCG
TTCGCCACGA TGTACTGGAC CCCGGCGCAA CAGCTGGCCC ACCTGACCGT CAACGGAGCG
GCCCTGCGCA CCGGCGACCT CTACGCCTCC GGCACCGTCT CCGGCGCCGA GCGCGGTCAG
GTCGGCTCGT TCCTCGAGCT GACCTGGGGC GGCGCGGAGC CGGTCCGGAT CGGTGGCAGC
GAACGCACTT TCCTGGCCGA CGGCGACACC ATCACCATCA CCGGCACCGC GCCTGGCCCG
AACGGCACGA CCGTCGGCCT CGGCGAGGTC ACCGGCACCG TCATCGCCCC CCGCTGA
 
Protein sequence
MTWVTGLEGS PYGLHNLPYG VFRTADREPR VGVRVGTHVL DLGGAEQAGL VFAAGTLSRP 
RLNDFMALGR PQWTAVRQRI VELLTDSTHR AAVEPLLLAL ANVELLLPFD VADYVDFYSS
EQHASNVGQI FRPGQPPLLP NWKHLPIGYH GRAGTVVVSG TPIVRPCGQR ATPQGPVTGP
SVRLDIEAEV GFVVGVPSPL GSRVPVGDFA DHVFGVVLVN DWSARDIQAW EYQPLGPFLG
KSFATSVAAW VTPLEALGAA FVPAPEQDPP VADYLRDEPH LGLDLRLSVE WNGERVSEPP
FATMYWTPAQ QLAHLTVNGA ALRTGDLYAS GTVSGAERGQ VGSFLELTWG GAEPVRIGGS
ERTFLADGDT ITITGTAPGP NGTTVGLGEV TGTVIAPR