Gene Strop_2490 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagStrop_2490 
Symbol 
ID5058953 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalinispora tropica CNB-440 
KingdomBacteria 
Replicon accessionNC_009380 
Strand
Start bp2800060 
End bp2801229 
Gene Length1170 bp 
Protein Length389 aa 
Translation table11 
GC content69% 
IMG OID640474748 
Product4-hydroxyphenylpyruvate dioxygenase 
Protein accessionYP_001159314 
Protein GI145595017 
COG category[E] Amino acid transport and metabolism
[R] General function prediction only 
COG ID[COG3185] 4-hydroxyphenylpyruvate dioxygenase and related hemolysins 
TIGRFAM ID[TIGR01263] 4-hydroxyphenylpyruvate dioxygenase 


Plasmid Coverage information

Num covering plasmid clones31 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGCAACCTT TCGCGGGCGG GGTGCCCGCA CGACCGGCAC GGGGGATCTG TGGGAAAGGG 
TGGGAGATGG ACATCCGTGG CATAGACCAC ATCGAATTCT ATGTGGGTGA CGCCAGTCAG
GCTGCCTTCT ACTTCGGCAA CGGGGTGGGG ATGCGGCTCT GTGGTCAGGG CGGCCCGGAG
ACCGGGCTGA CCGGGCAGCG GTCGCTGCTG CTACGGCACG CCGACATCCG GTTGCTGCTC
ACCTCCGGGC TGACCGTCGA TCATCCGGCG GCGGAATATG TGCGGCGGCA CGGGGATGGC
ATCGCTGTGG TCGCCTTGGC GGTTGACGAC GCCACCAAGG CGTACGCCGA ACTGCTGGAC
AGGGGCGCGG TCGGTGCGCT GCCACCCACC ACCGTCACCA GCGCGGACGC GGAGGTCGTC
ATCGCCGAGG TGGAAGGTTT CGCCGACGTG CGGCACCGGT TGGTCGAGCG CCGTCGGGGC
GGTCGTGACT TCCTGCCGGG CCTGGCGGAG CTGCCGCCGG TAGCGGAGAC CGCCGAGGAC
CTGCTCGTCG AGATCGACCA TCTGGCGGTG TGCGTGCCGC CCGGGCAGCT CGCCGAGACG
GTATGCGGCT ACCGGGAGGT CTTCGGCTTC GACGAGATCT TCCACGAGTA CGTGGAGGTC
GGCGGCCAGG CGATGAACTC CACCGTGGTG CAGTGCCCGT CCGGGCGGGT GACACTGGTG
CTCCTCGAGC CGGACACCAA CCGGCGGGCT GGGCAGATCG ATGCATTCCT TGCCCAGCAC
TCGGGCGCGG GAGTGCAGCA CCTGGGGTTG CGTACCAACG ACATCATCGA GGCGATCGGC
GCGATGCGTC AGCGAGGGCT TCGGTTCGCG CGCACGCCGG CGGCCTACTA CGACGACCTT
GAGACCCGGG TCGGCCGGGT CGACGGGTCT GTGGACCAGT TGCGGGAGTT CGGTGTGCTG
GTCGACCGGG ACCACGACGG TCAACTGCTG CAGATCTTCA CGGAGTCGAT GCACGTGCGC
CGCACGCTCT TCCTGGAGCT GATCGAGCGG CGTGGGGCAC AGACCTTCGG CAGCGGCAAT
ATCAAGGCGC TCTACGAGGC CAAGGAGCGG GAACTGGCGG TGGCGGCCTC GGCCGGCGGT
GTCGCCGCTA CCCAGGAGGT GACGGGATGA
 
Protein sequence
MQPFAGGVPA RPARGICGKG WEMDIRGIDH IEFYVGDASQ AAFYFGNGVG MRLCGQGGPE 
TGLTGQRSLL LRHADIRLLL TSGLTVDHPA AEYVRRHGDG IAVVALAVDD ATKAYAELLD
RGAVGALPPT TVTSADAEVV IAEVEGFADV RHRLVERRRG GRDFLPGLAE LPPVAETAED
LLVEIDHLAV CVPPGQLAET VCGYREVFGF DEIFHEYVEV GGQAMNSTVV QCPSGRVTLV
LLEPDTNRRA GQIDAFLAQH SGAGVQHLGL RTNDIIEAIG AMRQRGLRFA RTPAAYYDDL
ETRVGRVDGS VDQLREFGVL VDRDHDGQLL QIFTESMHVR RTLFLELIER RGAQTFGSGN
IKALYEAKER ELAVAASAGG VAATQEVTG