Gene Acel_0334 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcel_0334 
Symbol 
ID4486120 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAcidothermus cellulolyticus 11B 
KingdomBacteria 
Replicon accessionNC_008578 
Strand
Start bp345197 
End bp346222 
Gene Length1026 bp 
Protein Length341 aa 
Translation table11 
GC content63% 
IMG OID639729101 
ProductDNA-directed RNA polymerase subunit alpha 
Protein accessionYP_872094 
Protein GI117927543 
COG category[K] Transcription 
COG ID[COG0202] DNA-directed RNA polymerase, alpha subunit/40 kD subunit 
TIGRFAM ID[TIGR02027] DNA-directed RNA polymerase, alpha subunit, bacterial and chloroplast-type 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.0665548 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value0.000496522 
Fosmid HitchhikerNo 
Fosmid clonabilitydecreased coverage 
 

Sequence

Gene sequence
GTGCTCATCG CACAACGACC AACCATTACC GAAGAACCTG TCCACGACAC GCGGTCACGT 
TTCGTGATCG AACCTCTCGA ACCGGGTTTC GGCTACACCC TGGGCAATTC GCTGCGGCGG
ACGCTGCTCT CGTCCATTCC GGGAGCCGCG GTGACGAGCC TGCGCATTGA CGGCGTCCTG
CACGAGTTCT CCACCGTGCC GGGAGCCAAA GAGGACGTCA CCGAGATGAT CCTCAACATC
AAAGAGCTGG TCGTCTCGTC CGAGCACGAC GACCCGCAGG TCATCTACCT CCGCAAGCAA
GGCCCGTGCG AAGTGACGGC GGCGGACATC GTCGCCCCCG CCGGGGTCGA GGTGCACAAT
CCCGACCTGC ACATCGCGAC CCTGAACGAC AAGGGCAAGC TCGAGATCGA GATGGTCGTG
GAGCGGGGTC GTGGTTACGT GCCTGCGGCG CAGAACAAGC TTCCCGGCCA CGAGATCGGC
CGTATTCCCA TTGACTCGAT CTACTCCCCG GTGCTGAAGG TGACGTACAA GGTCGAGGCC
ACCCGTGTCG AGCAGCGCAC CGACTTCGAC CGGTTGATCA TGGACGTCGA GACGAAACCG
TCGATGCGGC CGCGGGACGC CATGGCCAGT GCCGGCAAGA CACTCGTCGA GCTCTTCGGC
CTGGTCCGCG AGCTGAATGT GGACGCTGAA GGCATCGACA TCGGCCCGTC GCCGTCCGAT
GCCGCGCTCG CCGCCGATCT CGCCCTCCCC ATCGAGGATC TCAACCTCAC CGTCCGGTCC
TACAACTGCT TGAAGCGGGA GGGGATCCAC ACGGTCGGCG AGCTCGTCGC GAGGAGCGAG
GCGGATCTCC TGGACATCCG GAATTTCGGC CAGAAGTCGA TCGAAGAGGT GAAGACGAAG
CTGGCGGAGA TGGGTCTGTC GCTCAAGGAC TCTCCGCCGG GATTCGACCC CGGCCGGGTG
GTCTACTCCG CTTCCCGTTC TGACTACGAC GAGGATCAGC GGTATATCGA GACGGAGCAG
CTGTAG
 
Protein sequence
MLIAQRPTIT EEPVHDTRSR FVIEPLEPGF GYTLGNSLRR TLLSSIPGAA VTSLRIDGVL 
HEFSTVPGAK EDVTEMILNI KELVVSSEHD DPQVIYLRKQ GPCEVTAADI VAPAGVEVHN
PDLHIATLND KGKLEIEMVV ERGRGYVPAA QNKLPGHEIG RIPIDSIYSP VLKVTYKVEA
TRVEQRTDFD RLIMDVETKP SMRPRDAMAS AGKTLVELFG LVRELNVDAE GIDIGPSPSD
AALAADLALP IEDLNLTVRS YNCLKREGIH TVGELVARSE ADLLDIRNFG QKSIEEVKTK
LAEMGLSLKD SPPGFDPGRV VYSASRSDYD EDQRYIETEQ L