Gene ECH_0432 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagECH_0432 
SymbolrpoA 
ID3927273 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEhrlichia chaffeensis str. Arkansas 
KingdomBacteria 
Replicon accessionNC_007799 
Strand
Start bp409729 
End bp410754 
Gene Length1026 bp 
Protein Length341 aa 
Translation table11 
GC content32% 
IMG OID637901556 
ProductDNA-directed RNA polymerase subunit alpha 
Protein accessionYP_507250 
Protein GI88658298 
COG category[K] Transcription 
COG ID[COG0202] DNA-directed RNA polymerase, alpha subunit/40 kD subunit 
TIGRFAM ID[TIGR02027] DNA-directed RNA polymerase, alpha subunit, bacterial and chloroplast-type 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGTGATC ATTGGAGTAA ATTAACTAAG CCTTCTTCTA TTAAGGTAGT TCCTAGTGGT 
GCTTCTCCAA ATAAGGCAGA TTTAATTATT GAACCTCTTG AGAGTGGTTT TGCTTTAACT
CTAGGTAATG CTTTAAGAAG AGTTATGATG TCTTCTTTGC GTGGTTTTGC TGTTTATGGT
GTTGAAATAG AGAATGTTTT ACACGAATTT ACTTCGATTT CTGGGGTTAG AGAAGATGTA
ACAGATATTC TTTTAAATAT TAGCATGATG AGAGTAAAGC TTTCTGGTAT GAATAGTAAA
GTTTTGTCTC TTAAAGTTAA AGGACCATGT GAAGTAAGAT CTGGAATGAT ACCTGACACT
GATGATTGCA TTATACTCAA TAAGGATTTG TTAATTTGCA CTTTAGATCA AGGTGTAGAT
TTTAATATAA AAATGTATGT AAATGCTGGT AAAGGTTATG TTCCTGCTGT AAAGCGTAAG
TCAGTTAATA AACTTGGTGA TATACCAATA AATTTCATTG CAACTAATGC TCTTTATAGC
CCTATAAAAA AGGCGTCTTT TAAAGTAGAA AGTAGTCGTA TTGGTCAATT TACAGATTAT
GATCGATTGA TTATGTCTGT GGAAACAGAT GGGTCTATTT TACCTGATGA AGCAGTAGCA
TTAGCTGCAA GAATTTTACA AGATCAATTT CAACCATTTA TTAATTTTGA TGAAACTGAT
GAGCCTCATA AGAAAGTTGA TGCTAAAGAT ACATTGCCTT ATGATTCTAA TCTGTTACGT
AAAGTTGATG AGTTAGAACT TTCAGTAAGG TCTTATAATT GTTTGAAAAA TGATAATATT
ACTTATATAG GAGATTTAGT CCAAAAGACT GAATCTGATA TGTTAAGAAC TCCAAATTTT
GGTAGAAAGT CGCTTAATGA GATTAATGAG CTTTTAGCAA GTATGAATTT GCATTTAGGA
ATGAAAATTG CGAATTGGCC TCCTGAATCC ATTGAGAGTT TAAGTAAACA ATATAGTGAA
GAATGA
 
Protein sequence
MSDHWSKLTK PSSIKVVPSG ASPNKADLII EPLESGFALT LGNALRRVMM SSLRGFAVYG 
VEIENVLHEF TSISGVREDV TDILLNISMM RVKLSGMNSK VLSLKVKGPC EVRSGMIPDT
DDCIILNKDL LICTLDQGVD FNIKMYVNAG KGYVPAVKRK SVNKLGDIPI NFIATNALYS
PIKKASFKVE SSRIGQFTDY DRLIMSVETD GSILPDEAVA LAARILQDQF QPFINFDETD
EPHKKVDAKD TLPYDSNLLR KVDELELSVR SYNCLKNDNI TYIGDLVQKT ESDMLRTPNF
GRKSLNEINE LLASMNLHLG MKIANWPPES IESLSKQYSE E