Gene Ksed_18840 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagKsed_18840 
Symbol 
ID8373389 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameKytococcus sedentarius DSM 20547 
KingdomBacteria 
Replicon accessionNC_013169 
Strand
Start bp1959023 
End bp1960069 
Gene Length1047 bp 
Protein Length348 aa 
Translation table11 
GC content76% 
IMG OID644992140 
Product5-enolpyruvylshikimate-3-phosphate synthase 
Protein accessionYP_003149651 
Protein GI256825691 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0128] 5-enolpyruvylshikimate-3-phosphate synthase 
TIGRFAM ID[TIGR01356] 3-phosphoshikimate 1-carboxyvinyltransferase 


Plasmid Coverage information

Num covering plasmid clones50 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones31 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCAGCA ACCCGACGGG GCGCGAGGAC CACACCGGTG GCACCTCGCG CCCCGCGGCG 
TACCCCGCCC CCACCGCGAC CGGCCCACTG GACGCCACCG TGCACGTCCC GGGCTCCAAG
TCCTGGACCA ACCGCTGGCT GGCCCTGGCT GCCCTGGCCA GCGGCCCCAG CACCCTGCAC
TCGCCCTTGG ACGCCCGCGA CACCCGCCTG ATGGCCAGCG CCCTGCACCC CAGCGACCCC
GACGACGCCA CCGTCGAGGT GGACGCCAGC GCCTCCTCCC AGTTCGTCAG CGCCCTGCTG
CTGGCCGGCT GCACGGTGCA CTCCGGCCTG ACGGTCCGGG CCACCGGCAC CGTGCCGAGC
CGCCCCCACA TCGACATGAC CTGCCACGCC CTGCAGCAGG TGGGCGTCGT GGCCGAGCAA
CGCGACGAGA CCAGCTGGTG GGTGGAGGGC AAGCGCCCCG ACCCCTTCGA GGTGACGGTG
GAGCCGGACC TCTCCAGCGC CGCGGTGTTC GCCGCGGCCG CCGCCGTGGC CGGCGGCACC
GTCACCCTGC CCGGGTGGCC CCGCTCCACC ACGCAGGCCG GCGACACCAT CCGCGACCTC
CTGACCCGGA TGGGGGCGCG CTGCGAGCTG ACCGAGGCCG GCCTGCGCGT CACCGGCGGC
GAGCTCCACG GCATCGAGGC CGACCTCTCC GCCGCAGGCG AGCTGACGCC CGTGGTCGCG
GCCACGGCAG CCCTGGCGGA CTCCCCCAGC CGGCTCACCG GCATCGGCCA CCTGCGCGGC
CACGAGACCG ACCGGCTGGC CGCCCTGGCC ACCGAGATCA ACGCCCTGGG CGGCGAGGTG
CGCGAGCTGC CCGACGGCCT GGAGATCACC CCCCGGCCGC TGCACGGGGG ATCGTTCGCG
ACCTACCACG ACCACCGGCT GGCGATGGCG GGCGCGCTGC TCGGGCTGCG CGTGCCCGGC
ATCGAAGTGC AGGACATCGC CACCACCGCC AAGACCGTGC CCGGGTTCGA CCACCTCTGG
GGCGCCATGC TGGAGGGGGA CACCTGA
 
Protein sequence
MSSNPTGRED HTGGTSRPAA YPAPTATGPL DATVHVPGSK SWTNRWLALA ALASGPSTLH 
SPLDARDTRL MASALHPSDP DDATVEVDAS ASSQFVSALL LAGCTVHSGL TVRATGTVPS
RPHIDMTCHA LQQVGVVAEQ RDETSWWVEG KRPDPFEVTV EPDLSSAAVF AAAAAVAGGT
VTLPGWPRST TQAGDTIRDL LTRMGARCEL TEAGLRVTGG ELHGIEADLS AAGELTPVVA
ATAALADSPS RLTGIGHLRG HETDRLAALA TEINALGGEV RELPDGLEIT PRPLHGGSFA
TYHDHRLAMA GALLGLRVPG IEVQDIATTA KTVPGFDHLW GAMLEGDT