Gene Ksed_12540 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagKsed_12540 
Symbol 
ID8372762 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameKytococcus sedentarius DSM 20547 
KingdomBacteria 
Replicon accessionNC_013169 
Strand
Start bp1291110 
End bp1292114 
Gene Length1005 bp 
Protein Length334 aa 
Translation table11 
GC content77% 
IMG OID644991531 
Productshikimate 5-dehydrogenase 
Protein accessionYP_003149051 
Protein GI256825091 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0169] Shikimate 5-dehydrogenase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones26 
Plasmid unclonability p-value0.00205236 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clones24 
Fosmid unclonability p-value0.478593 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TTGAGCGAGC CCCCGGGCAC GGGTGGTGCA GCGGGCCGGG CGCCGCACGC GGTCGTCGTG 
GGTTCCCCGG TGGCGCACTC GCTCTCGCCG CTGCTGCACC GTGCGGCCTG GCAGGAGCTG
GGTCTGGGGG CGGGGAGCTA CGAGCGCGTC GAGGTGCCCG CGGGAGAGCT GGCGCAGAGC
GTCGAGCGCC TGCACTCGCG CGAGCAGGTG ACAGCGCTGT CGGTCACGAT GCCGCTCAAG
GAGGAGGCCC TCGCCCTGGG GCTCGCGGGC GCGGGTGCGT CGGAGGTGGC CGTCCTGGCC
GGCGGGGCGA ACACGCTCGT GCTGGGCGGT GACGGGTGGG CCGCGAGCAA CACCGACGTG
CCGGCGCTGG CGCGCGTGAT CGGCCGGGCC TGCGGGCAGC GGTCCACTCC GGGGGCCGAC
CCGGCCGCAC CGCACGAGTC CGAAGTGGTG CCGGGGGAGG TAGTGCTGCT CGGGTCGGGT
GCCACGGCCC GTTCGGGGCT GTTGGCGCTG CACTGGTGCG GCGCGCGGGA GGTCACGGTG
GCGGTGCGCG ACGCGGTCCG CCGGGACTTC TCGCGGCTGG CCGACGAGCT CGGCATCCGG
GTGGCGGTGG TGCGCCTGGA CGCCCTGGCG CCGACGCTGC GCGAGGCGAC GGCTGCACCG
GGCAGCCTGC TCGTGGTGAA CACCCTGCCC GCCGGCTGGT GGGAGACCCT GGCGGCGCCC
GCCTGGGCAC CCGGTGGGCG GGCGCACGGG GGGCTGCACC AGCGGGACGC CCCCGGGCCG
GATCACCCGG CGGCGGGGGT CGTGTGGGTG GACGCCCAGT ACGCAGGCTG GCCGCACCCG
TGGGCGGCGG CCGCCGAGGA CGCCGGGGCT CAGGTGCTCA GCGGATTGCA CATGCTCGTG
GAGCAGGCCG TGGACCAGGT GGAGCTCATG ACCGGCCTGC GCCCCACGGC CGAGGCCACC
GCCTCCCTGC TGCCGCCGGA GCTCCGCGCG CTGCATAGGG TGTGA
 
Protein sequence
MSEPPGTGGA AGRAPHAVVV GSPVAHSLSP LLHRAAWQEL GLGAGSYERV EVPAGELAQS 
VERLHSREQV TALSVTMPLK EEALALGLAG AGASEVAVLA GGANTLVLGG DGWAASNTDV
PALARVIGRA CGQRSTPGAD PAAPHESEVV PGEVVLLGSG ATARSGLLAL HWCGAREVTV
AVRDAVRRDF SRLADELGIR VAVVRLDALA PTLREATAAP GSLLVVNTLP AGWWETLAAP
AWAPGGRAHG GLHQRDAPGP DHPAAGVVWV DAQYAGWPHP WAAAAEDAGA QVLSGLHMLV
EQAVDQVELM TGLRPTAEAT ASLLPPELRA LHRV