Gene Ksed_10650 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagKsed_10650 
Symbol 
ID8372573 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameKytococcus sedentarius DSM 20547 
KingdomBacteria 
Replicon accessionNC_013169 
Strand
Start bp1088622 
End bp1089926 
Gene Length1305 bp 
Protein Length434 aa 
Translation table11 
GC content70% 
IMG OID644991345 
Producthomogentisate 1,2-dioxygenase 
Protein accessionYP_003148874 
Protein GI256824914 
COG category[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG3508] Homogentisate 1,2-dioxygenase 
TIGRFAM ID[TIGR01015] homogentisate 1,2-dioxygenase 


Plasmid Coverage information

Num covering plasmid clones36 
Plasmid unclonability p-value0.233818 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones23 
Fosmid unclonability p-value0.424884 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCGTACT ACCGCAGTGT CGGAACAATC CCCCCCAAGC GGCACACCCA GCACCGCACC 
CCCGAGGGGG GCCTGTACTA CGAGGAGCTG ATGGGGGAGG AGGGCTTCTC CTCGGACTCC
TCCCTGCTGT ACCACCGCAA CATCCCCTCG ACCATCACCG ACGCCCGCGT CTGGGAGGTG
CCGGACGCCT CGCTGACCCC CAACCACCCC CTGCGGCCCC TGCACCTGCG CCCCCACGAC
CTGTTCGGCG GCGACGAGCC CACCGAGGGG GTCGATGTGG TGACCGGTCG CCGGCTGCTG
ATGGGCAACG CCGACGTGCG GCTGAGCTAC GTGGTGGCGG ACACGGTCAG CCCCTGGTAC
CGCAACGCCA TCGGCGACGA GTGCCTGTAC GTCGAGCGCG GCCACGCCCG GGTGGAGACC
GTCTTCGGCG CCTTCGAGCT GGAGCAGGGT GACTATCTGA TCATGCCGCG GGCAACCACC
CACCGCTGGA TCCCGCGCGA TGCGGGGGAC GTCGGCTACA GCGAGCCGCT GCGTGTGTAC
GCCATCGAGG CCTCCAGCCA CATCGGTCCC CCCAAGCGCT TCCTCTCGCG GTTCGGCCAG
CTCCTGGAGC ACGCGCCCTA CTGCGAGCGG GACCTGCGCG GGCCGACCGA GCCGCTGCTG
GCCGAGGACA TCGGGGCGGA CCGGGCCGAG GAGACCGAGG TCTACATCCG GCACCGCGCC
ACCGGGGAGG GCGCCTCCGG TGGGCAGGGC GGCACGATCC ACACGGTCCC CTTCCACCCG
CTCGACGTGG CCGGCTGGGA CGGCTGCCTC TACCCGTACG TCTTCAACGT CTCTGACTAC
GAGCCGATCA CCGGCCGGGT GCACCAGCCG CCGCCCGCCC ACCAGGTCTT CGAGGGCCAC
AACTTCGTGG TGTGCAACTT CGTGCCCCGC AAGGTGGACT ACCACCCGTT GAGCATCCCG
GTGCCCTACT ACCACTCGAA CGTCGACTCC GACGAGGTCA TGTTCTACGT CGACGGGGAC
TACGAGGCGC GCAAGGGCAG CGGCATCAAA CAGGGCTCGA TCAGCCTGCA CCCGGGCGGC
CACGCGCACG GCCCCCAACC CGGCGCGTAC GAGAACTCGA TCGGGGCCGA GTACTTCGAC
GAGCTGGCCG TGATGGTGGA CACCTTCCGG CCCCTGGACC TCGGGGAGGG AGGGCTGGCG
TGCGACGACG GCCGCTACGC GTGGTCCTGG CACTCCACGG CCCAGGCCTC GCAGTCCGAG
CAGCGGGAGC GGGCGCGCCA GGAGCCGCCC ACCGCGTCTG ACTGA
 
Protein sequence
MAYYRSVGTI PPKRHTQHRT PEGGLYYEEL MGEEGFSSDS SLLYHRNIPS TITDARVWEV 
PDASLTPNHP LRPLHLRPHD LFGGDEPTEG VDVVTGRRLL MGNADVRLSY VVADTVSPWY
RNAIGDECLY VERGHARVET VFGAFELEQG DYLIMPRATT HRWIPRDAGD VGYSEPLRVY
AIEASSHIGP PKRFLSRFGQ LLEHAPYCER DLRGPTEPLL AEDIGADRAE ETEVYIRHRA
TGEGASGGQG GTIHTVPFHP LDVAGWDGCL YPYVFNVSDY EPITGRVHQP PPAHQVFEGH
NFVVCNFVPR KVDYHPLSIP VPYYHSNVDS DEVMFYVDGD YEARKGSGIK QGSISLHPGG
HAHGPQPGAY ENSIGAEYFD ELAVMVDTFR PLDLGEGGLA CDDGRYAWSW HSTAQASQSE
QRERARQEPP TASD