Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Acel_0334 |
Symbol | |
ID | 4486120 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Acidothermus cellulolyticus 11B |
Kingdom | Bacteria |
Replicon accession | NC_008578 |
Strand | + |
Start bp | 345197 |
End bp | 346222 |
Gene Length | 1026 bp |
Protein Length | 341 aa |
Translation table | 11 |
GC content | 63% |
IMG OID | 639729101 |
Product | DNA-directed RNA polymerase subunit alpha |
Protein accession | YP_872094 |
Protein GI | 117927543 |
COG category | [K] Transcription |
COG ID | [COG0202] DNA-directed RNA polymerase, alpha subunit/40 kD subunit |
TIGRFAM ID | [TIGR02027] DNA-directed RNA polymerase, alpha subunit, bacterial and chloroplast-type |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 0.0665548 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 13 |
Fosmid unclonability p-value | 0.000496522 |
Fosmid Hitchhiker | No |
Fosmid clonability | decreased coverage |
| |
Sequence |
Gene sequence | GTGCTCATCG CACAACGACC AACCATTACC GAAGAACCTG TCCACGACAC GCGGTCACGT TTCGTGATCG AACCTCTCGA ACCGGGTTTC GGCTACACCC TGGGCAATTC GCTGCGGCGG ACGCTGCTCT CGTCCATTCC GGGAGCCGCG GTGACGAGCC TGCGCATTGA CGGCGTCCTG CACGAGTTCT CCACCGTGCC GGGAGCCAAA GAGGACGTCA CCGAGATGAT CCTCAACATC AAAGAGCTGG TCGTCTCGTC CGAGCACGAC GACCCGCAGG TCATCTACCT CCGCAAGCAA GGCCCGTGCG AAGTGACGGC GGCGGACATC GTCGCCCCCG CCGGGGTCGA GGTGCACAAT CCCGACCTGC ACATCGCGAC CCTGAACGAC AAGGGCAAGC TCGAGATCGA GATGGTCGTG GAGCGGGGTC GTGGTTACGT GCCTGCGGCG CAGAACAAGC TTCCCGGCCA CGAGATCGGC CGTATTCCCA TTGACTCGAT CTACTCCCCG GTGCTGAAGG TGACGTACAA GGTCGAGGCC ACCCGTGTCG AGCAGCGCAC CGACTTCGAC CGGTTGATCA TGGACGTCGA GACGAAACCG TCGATGCGGC CGCGGGACGC CATGGCCAGT GCCGGCAAGA CACTCGTCGA GCTCTTCGGC CTGGTCCGCG AGCTGAATGT GGACGCTGAA GGCATCGACA TCGGCCCGTC GCCGTCCGAT GCCGCGCTCG CCGCCGATCT CGCCCTCCCC ATCGAGGATC TCAACCTCAC CGTCCGGTCC TACAACTGCT TGAAGCGGGA GGGGATCCAC ACGGTCGGCG AGCTCGTCGC GAGGAGCGAG GCGGATCTCC TGGACATCCG GAATTTCGGC CAGAAGTCGA TCGAAGAGGT GAAGACGAAG CTGGCGGAGA TGGGTCTGTC GCTCAAGGAC TCTCCGCCGG GATTCGACCC CGGCCGGGTG GTCTACTCCG CTTCCCGTTC TGACTACGAC GAGGATCAGC GGTATATCGA GACGGAGCAG CTGTAG
|
Protein sequence | MLIAQRPTIT EEPVHDTRSR FVIEPLEPGF GYTLGNSLRR TLLSSIPGAA VTSLRIDGVL HEFSTVPGAK EDVTEMILNI KELVVSSEHD DPQVIYLRKQ GPCEVTAADI VAPAGVEVHN PDLHIATLND KGKLEIEMVV ERGRGYVPAA QNKLPGHEIG RIPIDSIYSP VLKVTYKVEA TRVEQRTDFD RLIMDVETKP SMRPRDAMAS AGKTLVELFG LVRELNVDAE GIDIGPSPSD AALAADLALP IEDLNLTVRS YNCLKREGIH TVGELVARSE ADLLDIRNFG QKSIEEVKTK LAEMGLSLKD SPPGFDPGRV VYSASRSDYD EDQRYIETEQ L
|
| |