Gene Svir_31830 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSvir_31830 
Symbol 
ID8388507 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSaccharomonospora viridis DSM 43017 
KingdomBacteria 
Replicon accessionNC_013159 
Strand
Start bp3452118 
End bp3453302 
Gene Length1185 bp 
Protein Length394 aa 
Translation table11 
GC content64% 
IMG OID644977210 
Product4-hydroxyphenylpyruvate dioxygenase 
Protein accessionYP_003134983 
Protein GI257057151 
COG category[E] Amino acid transport and metabolism
[R] General function prediction only 
COG ID[COG3185] 4-hydroxyphenylpyruvate dioxygenase and related hemolysins 
TIGRFAM ID[TIGR01263] 4-hydroxyphenylpyruvate dioxygenase 


Plasmid Coverage information

Num covering plasmid clones30 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones25 
Fosmid unclonability p-value0.905898 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCAAATC CAGCACTCGA CGACGTCAGC TACGACCAAC TCCGACAACT CGTCGGTCTG 
GTCGACCACG ATCCGACCAA GGACCCCTTC CCCGTCAAGG CGATGGACGC GGTGGTCTTC
GTGGTCGGTA ACGCCACCCA GACCGCGCAC TTCTACCAGT CGGCGTTCGG CATGGACCTC
GTCGCCTACT CCGGACCGGA AACGGGCAAC CCCGAGTACA AATCGTTCGT CCTCAAGTCG
GGTTCCGCGC GGTTCGTGGT CAACGGCGGG GTGAAGCCGG ACTCGCCGCT GCTGGACCAC
CACCGCAAGC ACGGTGACGG CGTCATCGAC CTCGCGCTCG AAGTAGCCGA TGTGGACAAG
TGCGTCGAAC ACGCCAGGGC GAACGGGGCC ACGATCCTGG ACGAACCGTA CGAGGTCTCC
GACGAACACG GCACCGTACG CATGGCGGCC ATAGCGGCCT ACGGCGACAC CCGGCACACG
CTCGTGGACC GCTCCCGCTA CTCCGGCCCC TACCTGCCGG GATACGAGGC GCGTACCCGC
AGCGTGCCCA AGCCCGAGGG AGCACCGAAA CGGCTGTTCC AGGCCATCGA CCACTGTGTC
GGCAACGTCG AACTCGGCAA GATGGACGAA TGGGTGGGGT TCTACCACCG GGTCATGGGC
TTCGTGAACA TGGCCGAGTT CGTGGGTGAC GACATCGCCA CCGAGTATTC GGCGTTGATG
AGCAAGGTGG TCGCCAACGG TAACCACCGC GTCAAGTTCC CGCTCAACGA ACCCGCCATC
GGCAAGAAGA AGTCGCAGAT CGACGAGTTC CTCGAGTTCT ACGACGGCGC CGGCTGCCAG
CACATCGCGT TGGCCACCAA CGACATCGTC GGCACGGTCC AGGCGATGCG TCAGGCGGGT
GTGGAATTTT TGGACACGCC GGATTCGTAC TACGACGATC CGGAGTTGCG TGCCCGCATC
GGCGAGGTGC GGGTGCCGAT CGAGACGCTG AAGGAACACC GCATCCTCGT CGACCGCGAC
GAGGACGGCT ATCTGCTCCA GATCTTCACC AAACCGATCG GTGACCGACC CACCGTGTTC
TACGAACTCA TCGAGCGACA CGGCTCGCTC GGTTTCGGGA AGGGCAACTT CAAAGCCCTG
TTCGAGGCCA TCGAGCGGGA ACAGGCCCGT CGCGGCAACC TCTGA
 
Protein sequence
MANPALDDVS YDQLRQLVGL VDHDPTKDPF PVKAMDAVVF VVGNATQTAH FYQSAFGMDL 
VAYSGPETGN PEYKSFVLKS GSARFVVNGG VKPDSPLLDH HRKHGDGVID LALEVADVDK
CVEHARANGA TILDEPYEVS DEHGTVRMAA IAAYGDTRHT LVDRSRYSGP YLPGYEARTR
SVPKPEGAPK RLFQAIDHCV GNVELGKMDE WVGFYHRVMG FVNMAEFVGD DIATEYSALM
SKVVANGNHR VKFPLNEPAI GKKKSQIDEF LEFYDGAGCQ HIALATNDIV GTVQAMRQAG
VEFLDTPDSY YDDPELRARI GEVRVPIETL KEHRILVDRD EDGYLLQIFT KPIGDRPTVF
YELIERHGSL GFGKGNFKAL FEAIEREQAR RGNL