Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | ECH_0432 |
Symbol | rpoA |
ID | 3927273 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Ehrlichia chaffeensis str. Arkansas |
Kingdom | Bacteria |
Replicon accession | NC_007799 |
Strand | + |
Start bp | 409729 |
End bp | 410754 |
Gene Length | 1026 bp |
Protein Length | 341 aa |
Translation table | 11 |
GC content | 32% |
IMG OID | 637901556 |
Product | DNA-directed RNA polymerase subunit alpha |
Protein accession | YP_507250 |
Protein GI | 88658298 |
COG category | [K] Transcription |
COG ID | [COG0202] DNA-directed RNA polymerase, alpha subunit/40 kD subunit |
TIGRFAM ID | [TIGR02027] DNA-directed RNA polymerase, alpha subunit, bacterial and chloroplast-type |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGTGATC ATTGGAGTAA ATTAACTAAG CCTTCTTCTA TTAAGGTAGT TCCTAGTGGT GCTTCTCCAA ATAAGGCAGA TTTAATTATT GAACCTCTTG AGAGTGGTTT TGCTTTAACT CTAGGTAATG CTTTAAGAAG AGTTATGATG TCTTCTTTGC GTGGTTTTGC TGTTTATGGT GTTGAAATAG AGAATGTTTT ACACGAATTT ACTTCGATTT CTGGGGTTAG AGAAGATGTA ACAGATATTC TTTTAAATAT TAGCATGATG AGAGTAAAGC TTTCTGGTAT GAATAGTAAA GTTTTGTCTC TTAAAGTTAA AGGACCATGT GAAGTAAGAT CTGGAATGAT ACCTGACACT GATGATTGCA TTATACTCAA TAAGGATTTG TTAATTTGCA CTTTAGATCA AGGTGTAGAT TTTAATATAA AAATGTATGT AAATGCTGGT AAAGGTTATG TTCCTGCTGT AAAGCGTAAG TCAGTTAATA AACTTGGTGA TATACCAATA AATTTCATTG CAACTAATGC TCTTTATAGC CCTATAAAAA AGGCGTCTTT TAAAGTAGAA AGTAGTCGTA TTGGTCAATT TACAGATTAT GATCGATTGA TTATGTCTGT GGAAACAGAT GGGTCTATTT TACCTGATGA AGCAGTAGCA TTAGCTGCAA GAATTTTACA AGATCAATTT CAACCATTTA TTAATTTTGA TGAAACTGAT GAGCCTCATA AGAAAGTTGA TGCTAAAGAT ACATTGCCTT ATGATTCTAA TCTGTTACGT AAAGTTGATG AGTTAGAACT TTCAGTAAGG TCTTATAATT GTTTGAAAAA TGATAATATT ACTTATATAG GAGATTTAGT CCAAAAGACT GAATCTGATA TGTTAAGAAC TCCAAATTTT GGTAGAAAGT CGCTTAATGA GATTAATGAG CTTTTAGCAA GTATGAATTT GCATTTAGGA ATGAAAATTG CGAATTGGCC TCCTGAATCC ATTGAGAGTT TAAGTAAACA ATATAGTGAA GAATGA
|
Protein sequence | MSDHWSKLTK PSSIKVVPSG ASPNKADLII EPLESGFALT LGNALRRVMM SSLRGFAVYG VEIENVLHEF TSISGVREDV TDILLNISMM RVKLSGMNSK VLSLKVKGPC EVRSGMIPDT DDCIILNKDL LICTLDQGVD FNIKMYVNAG KGYVPAVKRK SVNKLGDIPI NFIATNALYS PIKKASFKVE SSRIGQFTDY DRLIMSVETD GSILPDEAVA LAARILQDQF QPFINFDETD EPHKKVDAKD TLPYDSNLLR KVDELELSVR SYNCLKNDNI TYIGDLVQKT ESDMLRTPNF GRKSLNEINE LLASMNLHLG MKIANWPPES IESLSKQYSE E
|
| |