Gene Ksed_10550 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagKsed_10550 
Symbol 
ID8372563 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameKytococcus sedentarius DSM 20547 
KingdomBacteria 
Replicon accessionNC_013169 
Strand
Start bp1079124 
End bp1080347 
Gene Length1224 bp 
Protein Length407 aa 
Translation table11 
GC content67% 
IMG OID644991335 
Product4-hydroxyphenylpyruvate dioxygenase 
Protein accessionYP_003148864 
Protein GI256824904 
COG category[E] Amino acid transport and metabolism
[R] General function prediction only 
COG ID[COG3185] 4-hydroxyphenylpyruvate dioxygenase and related hemolysins 
TIGRFAM ID[TIGR01263] 4-hydroxyphenylpyruvate dioxygenase 


Plasmid Coverage information

Num covering plasmid clones28 
Plasmid unclonability p-value0.00901383 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones24 
Fosmid unclonability p-value0.579453 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACTGACA CCACCCTGAC CGCCCTGACG AACCAGGAGC AGGCCGAGAA CCTGTCCGTC 
GAGCAGCTCA AGCAGCTGGT CGGCCTGGTG GAGTACGACG GCTCCAACGA CCCCTTCCCG
GTGACCGGCT GGGACTCCAT CGTCTTCGTG GTGGGCAACG CCACGCAGGC TGCGCACTTC
TACCAGTCGG CTTTCGGCAT GGAGCTGGTC GCCTACTCCG GCCCCGAGAA CGGCAACCGC
GACCACAAGG CGTTCGTCCT GAAGTCGGGC AACATCAAGT TCGTGCTGAA GGGCGCGGTG
GACCCCCAGT CCCCGCTGCT GGACCACCAC CGCGCGCACG GTGACGGCGT GGTGGACATC
TCCCTGGAGG TGCCGGACGT CGACCAGTGC ATCGAGCACG CCCGCTCGGT GGGTGCCACG
GTGCTCCAGG AGCCCACGGA CCTGAGCGAC GACCACGGCA CCGTGCGCGT CGGCGCCATC
GCGACCTACG GGGAGACCCG GCACACCCTC GTCCAGCGGG AGGTCGACGG GACCCGCTAC
GCCGGCCCCT ACCTGCCGGG CTACGAGGCG CGCGAGGGCA CCTACGTCAA GCGCGAGGGC
TCGCCGAAGC GCCTGTTCCA GGCCCTGGAC CACATCGTCG GCAACGTCGA GCTCGGCAAG
ATGGATGAGT GGGTGGAGTT CTACCACCGC GTCATGGGCT TCACGGACAT GGCCGAGTTC
GTGGGCGACG ACATCGCCAC CGACTACTCC GCGCTGATGT CCAAGGTGGT GGCCAACGGC
AACCACCGCG TGAAGTTCCC GCTCAACGAG CCGGCGATCG CCAAGAAGAA GTCGCAGATC
GATGAGTACC TGGAGTTCTA CGGCTGCGCC GGTGCCCAGC ACCTGGCCCT GGCCACGAAC
GACATCATCA CGACCGTCGA CCGCATGCGT GCCGAGGGCG TCGAGTTCCT GGCCACCCCG
GACTCCTACT ACGAGGACCC GGAGCTGCGT GAGCGCATCG GCAACGTGCG CGTCCCCATC
GAGGAGCTGC AGAAGCGCGG CATCCTGGTG GACCGCGACG AGGACGGCTA CCTGCTGCAG
ATCTTCACCA AGCCGATCGG CGACCGCCCC ACGGTGTTCT TCGAGTTGAT CGAGCGCCAC
GGCTCGCTGG GCTTCGGCAT CGGCAACTTC AAGGCGCTGT TCGAGGCCAT CGAGCGCGAG
CAGGAGCTGC GCGGCAACTT CTGA
 
Protein sequence
MTDTTLTALT NQEQAENLSV EQLKQLVGLV EYDGSNDPFP VTGWDSIVFV VGNATQAAHF 
YQSAFGMELV AYSGPENGNR DHKAFVLKSG NIKFVLKGAV DPQSPLLDHH RAHGDGVVDI
SLEVPDVDQC IEHARSVGAT VLQEPTDLSD DHGTVRVGAI ATYGETRHTL VQREVDGTRY
AGPYLPGYEA REGTYVKREG SPKRLFQALD HIVGNVELGK MDEWVEFYHR VMGFTDMAEF
VGDDIATDYS ALMSKVVANG NHRVKFPLNE PAIAKKKSQI DEYLEFYGCA GAQHLALATN
DIITTVDRMR AEGVEFLATP DSYYEDPELR ERIGNVRVPI EELQKRGILV DRDEDGYLLQ
IFTKPIGDRP TVFFELIERH GSLGFGIGNF KALFEAIERE QELRGNF