Gene Ksed_20590 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagKsed_20590 
Symbol 
ID8373563 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameKytococcus sedentarius DSM 20547 
KingdomBacteria 
Replicon accessionNC_013169 
Strand
Start bp2137686 
End bp2138708 
Gene Length1023 bp 
Protein Length340 aa 
Translation table11 
GC content66% 
IMG OID644992308 
ProductDNA-directed RNA polymerase subunit alpha 
Protein accessionYP_003149818 
Protein GI256825858 
COG category[K] Transcription 
COG ID[COG0202] DNA-directed RNA polymerase, alpha subunit/40 kD subunit 
TIGRFAM ID[TIGR02027] DNA-directed RNA polymerase, alpha subunit, bacterial and chloroplast-type 


Plasmid Coverage information

Num covering plasmid clones46 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones24 
Fosmid unclonability p-value0.556688 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGCTGATTG CCCAGCGCCC CACCCTCTCC GAGGAGAAGG TCTCCGAGGC CCGTTCCCGG 
TTCACCATCG AGCCGCTGGA GCCCGGCTTC GGCTACACCC TCGGCAACTC GCTCCGCCGC
ACCCTGCTCT CGAGCATCCC GGGTGCCGCG GTCACCAGCA TCCGCATCGA CGGTGTGCTG
CACGAGTTCT CCACCGTTCC CGGTGTGAAG GAGGACGTCA CCGAGCTCAT CCTCAACATC
AAGTCCCTCG TCTTCTCCTC GGAGCACGAC GAGCCCGTGG TGGCCTACCT GCGCAAGCAG
GGTTCGGGTG AGATGACCGG TGCCGACATC AGCTGCCCGG CAGGTGTCGA GGTGCACAAC
CCCGACCTCT ACCTGGGTGC GCTGAACGAC GAGGGTGCGA TCGACCTCGA GCTCACCATC
GAGCGTGGCC GCGGCTACGT CTCGGCGCAG CAGAACAAGG GCGGCGAGCA GGAGATCGGC
CGGATCCCGG TCGACTCCAT CTACTCGCCG GTGCTGTCGG TCACCTACAA GGTGGAGGCC
ACCCGTGTCG AGCAGCGCAC CGACTTCGAC AAGCTGATCG TCGACGTCGA GACCAAGAAC
TCCATGTCCC CGGCCGATGC CATGGCCTCG GCCGGCAAGA CGCTGGTGGA GCTCTTCGGT
CTGGCGCGCG ATCTCAACGT CGAGGCCGAG GGCATCGAGA TGGGCACCGT GCAGACGGAC
GCCTCGCTGG CCGCCGACCT GGCGTTGCCG GTGGAGGAGC TCAACCTGTC CGTGCGTTCC
TACAACTGCC TGAAGCGCGA GGGCATCCAC ACCGTGGGTG AGCTCGTGGC ACGCAGCCAG
GCGGACCTGC TGGACATCCG CAACTTCGGC AACAAGTCCA TCGACGAGGT GCAGGTCGAG
CTCCACAAGC TCGGTCTGGC CCTCAAGGAC ACGCCGGCCG ACTTCGACCC GTCCACCATC
GTGCTCGACC GCGACGAGGA CGAGGCCGCC GACGACGAGG TCCTCGAGGA CGAGCAGTAC
TGA
 
Protein sequence
MLIAQRPTLS EEKVSEARSR FTIEPLEPGF GYTLGNSLRR TLLSSIPGAA VTSIRIDGVL 
HEFSTVPGVK EDVTELILNI KSLVFSSEHD EPVVAYLRKQ GSGEMTGADI SCPAGVEVHN
PDLYLGALND EGAIDLELTI ERGRGYVSAQ QNKGGEQEIG RIPVDSIYSP VLSVTYKVEA
TRVEQRTDFD KLIVDVETKN SMSPADAMAS AGKTLVELFG LARDLNVEAE GIEMGTVQTD
ASLAADLALP VEELNLSVRS YNCLKREGIH TVGELVARSQ ADLLDIRNFG NKSIDEVQVE
LHKLGLALKD TPADFDPSTI VLDRDEDEAA DDEVLEDEQY