Gene Sare_4286 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSare_4286 
Symbol 
ID5706998 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalinispora arenicola CNS-205 
KingdomBacteria 
Replicon accessionNC_009953 
Strand
Start bp4861451 
End bp4862473 
Gene Length1023 bp 
Protein Length340 aa 
Translation table11 
GC content66% 
IMG OID641273705 
ProductDNA-directed RNA polymerase subunit alpha 
Protein accessionYP_001539058 
Protein GI159039805 
COG category[K] Transcription 
COG ID[COG0202] DNA-directed RNA polymerase, alpha subunit/40 kD subunit 
TIGRFAM ID[TIGR02027] DNA-directed RNA polymerase, alpha subunit, bacterial and chloroplast-type 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value0.665586 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.00623467 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGCTCATCT CCCAGCGACC GTCCCTCTCC GAGGAGTCGA TCAACGAGAC CCGGTCCCGG 
TTCACCATCG AACCGCTGGA GCCCGGCTTC GGCTACACCC TGGGCAACTC GCTGCGCCGG
ACGCTGCTGT CGTCCATTCC CGGCGCGGCG GTGACCTCGA TCAAGATCGA TGGTGTGCTG
CACGAGTTCA CCACGATCCC GGGGGTCAAG GAGGATGTGG TCGAACTCGT CATGAACATC
AAGGAGCTGT GCGTCAGCTC CGAGCATGAC GAGCCGGTCA GCATGTACCT GCGCAAGCAG
GGCCCGGGTG ACGTGACCGC GGGTGACATC CAGCCCCCGG CCGGTGTCTC GGTACACAAC
CCGGACCTGA AGCTCGCCAC CCTGAACGGT AAGGGCCGGC TCGACATGGA GCTGACCGTC
GAGCGGGGCC GTGGTTACGT CACCGCGGCG CAGAACAAGC AGGCCGGCGC CGAGATCGGT
CGGATCCCGG TCGACTCGAT CTACTCACCG GTACTGCGGG TCACCTACCG GGTCGAGGCG
ACCCGAGTCG AGCAGCGGAC CGACTTCGAT CGGCTGATCA TCGACGTCGA GTCCAAGCCG
TCGATGGGGC CACGTACGGC CCTGGCCTCG GCCGGCTCCA CGCTGGTCGA ACTCTTCGGC
CTGGCCCGCG AGCTGGACGA GACCGCGGAG GGTATCGACA TCGGGCCGTC CCCGCAGGAC
GCCCAGCTGG CGGCGGACCT GGCGCTGCCG ATCGAGGAGC TGGACCTCAC CGTCCGCTCC
TACAACTGCC TCAAGCGCGA GGGCATCAAC TCCGTTGGTG AGCTCATCGG GCGTACCGAG
GCTGACCTCC TCGACATCCG TAACTTCGGT CAGAAGTCGA TCGACGAGGT CAAGATGAAG
CTCGCCGGGA TGGGACTGGG GCTGAAGGAC TCGGCCCCGA ACTTCGACCC GGCGAACGTC
GTGGACGCCT TCGGTGAGGC TGACTACGAC ACCGAGGACT ACCGCGAGAC TGAGCAGCTG
TAA
 
Protein sequence
MLISQRPSLS EESINETRSR FTIEPLEPGF GYTLGNSLRR TLLSSIPGAA VTSIKIDGVL 
HEFTTIPGVK EDVVELVMNI KELCVSSEHD EPVSMYLRKQ GPGDVTAGDI QPPAGVSVHN
PDLKLATLNG KGRLDMELTV ERGRGYVTAA QNKQAGAEIG RIPVDSIYSP VLRVTYRVEA
TRVEQRTDFD RLIIDVESKP SMGPRTALAS AGSTLVELFG LARELDETAE GIDIGPSPQD
AQLAADLALP IEELDLTVRS YNCLKREGIN SVGELIGRTE ADLLDIRNFG QKSIDEVKMK
LAGMGLGLKD SAPNFDPANV VDAFGEADYD TEDYRETEQL