Gene ECH_0760 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagECH_0760 
SymbolrpoD 
ID3927201 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEhrlichia chaffeensis str. Arkansas 
KingdomBacteria 
Replicon accessionNC_007799 
Strand
Start bp766191 
End bp768059 
Gene Length1869 bp 
Protein Length622 aa 
Translation table11 
GC content33% 
IMG OID637901878 
ProductRNA polymerase sigma factor RpoD 
Protein accessionYP_507558 
Protein GI88657966 
COG category[K] Transcription 
COG ID[COG0568] DNA-directed RNA polymerase, sigma subunit (sigma70/sigma32) 
TIGRFAM ID[TIGR02393] RNA polymerase sigma factor RpoD, C-terminal domain
[TIGR02937] RNA polymerase sigma factor, sigma-70 family 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.856067 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAAGATC TACAAACAGA TAAAGAATTA CTCAGAACTC TTGTGAGTAA GGGACTCAAA 
CAAGGATTTG TTACATTTAA CGATATCAAC GATGTTTTTT CTGACTACGT GCTGTCATCT
GACAACATAG ATGAAACTAT ATCAATGCTT CAGGATTCAG GTATTAATGT TCTTGAACAA
AGTGATGAAG ACGAGGCAAA TAGCATAATA GAAGAAAACA GAATCAAAGA TGAAATGGAC
GAAGATGATG TATCAGACAC CAGTATAAAA TCTTGGGATT TTGGACAAAC AGATGACCCT
ATACGCATGT ACCTATGTGA AATGAGTTCT GTTGAATTAC TATCTCGTGA AGGAGAAATT
GAAATTGCCA AAAAGATCAA ATCAGAAAAA GTAAATATGC TCAGGTCACT AGTAGAATCC
CCAATAGTAT TACGTACATT TATGTCATGG AGGGATGACT TAGTAAATGA ACAAATAATG
TTAAGAGATT TAATAGATTT AGATGCAAAT TACAGGTATG AATTTCCTGA AAAATTTTCT
GACACTGATG ATGCTAGTGT TTTAGGTTAT GATAAAGATT TAATGGATGA ACCTGATGAA
GATCCAGATA TACCAGAAGA TGAAGATGAA GAAAACATTT CAATAAATGC TTCAATATTA
GAAATGGAAA ATGCATTGCT ACCCAAAGTA GTAAGCATCT TAGATTCAGT AATTGCATCA
GCAGAAAAAA TTTTAGAACT CAAAAAACAA TATCAAGGTA AAATTAACCA AGATATTGAA
AAACAATACA ATGACTTACA CGATAGCATA TGGGAAATGA TATATCAAAT TAAACTTAGT
AATTCCGCAG TCCTATCAAT CACACAACAA ATATATAGCT TAAGTAAATC AATAGCAGCA
GAAGAGGCAA AAATTATTTC CCTTGCAGAA AGCTATGGTA TACAACGCAA AGACTTCTTA
GATGCATATA ATACTAATTC TGTATTGCAA AAAAAAGGCA CTTCTCCTCA ATGGGATAAC
ATGTTACTCA ATGAAGAGAG TAATATAGTA AGTATGTACA GTAAAATTAA GCTACTGTCT
GGAGAAAATA ACCTTACCGA ATTCAAAGCA TTAGTCACAA AAATACAAAA GCACGAACGT
GCAGCAAATC AAGCAAAGCA AGAAATGATA AAAGCAAATC TAAGGTTAGT AGTATCGATT
GCAAAAAAAT ATTCAAATCG AGGTTTACAG TTTCTAGATT TAGTACAAGA AGGGAATATT
GGCTTAATGA AAGCAGTAGA TAAATTTGAT TACAAACGTG GATATAAATT CTCAACTTAT
GCAACGTGGT GGGTGAGGCA AGCTATCACT AGAGCAATAG CAGATCAGGC ACGAACTATT
AGAATTCCTG TACACATGAT TGAAACAGTA AATAAAATCA ATCGCACACT AAGACAAATG
CTACACGAAA TGGGTAGAGA ACCAACATTA GAGGAATTAT CAGCAAGATT AAACATTAAT
GTAGATAAGA TACGTAAGGT AATGAAAATA GTTAAAGACC CCGTAAGCTT AGAAAGCCCC
ATAGGAGATG ATGACAGTAG TACCTTTGGT GATTGCATAG AAGATAAACG TGCAGTAAAG
CCAGAAGATG CAGCAGTACT TGCAGACTTA CGAGAAATAA CTACAAAAGT ATTATCAACA
TTAACACCAA AAGAAGAGAG AATCTTACGT ATGCGGTTTG GAATAGGTAA AGGTGGAAAA
GACCATACCT TAGAAGAAGT AGGAAAACTA TTCAATGTAA CAAGAGAACG TATACGACAA
ATTGAAGCAA AAGCTTTACG TAAACTACGT CATCCAAGTC GTGCAAGAAA GCTTAGAGGA
TTTTTCTAA
 
Protein sequence
MKDLQTDKEL LRTLVSKGLK QGFVTFNDIN DVFSDYVLSS DNIDETISML QDSGINVLEQ 
SDEDEANSII EENRIKDEMD EDDVSDTSIK SWDFGQTDDP IRMYLCEMSS VELLSREGEI
EIAKKIKSEK VNMLRSLVES PIVLRTFMSW RDDLVNEQIM LRDLIDLDAN YRYEFPEKFS
DTDDASVLGY DKDLMDEPDE DPDIPEDEDE ENISINASIL EMENALLPKV VSILDSVIAS
AEKILELKKQ YQGKINQDIE KQYNDLHDSI WEMIYQIKLS NSAVLSITQQ IYSLSKSIAA
EEAKIISLAE SYGIQRKDFL DAYNTNSVLQ KKGTSPQWDN MLLNEESNIV SMYSKIKLLS
GENNLTEFKA LVTKIQKHER AANQAKQEMI KANLRLVVSI AKKYSNRGLQ FLDLVQEGNI
GLMKAVDKFD YKRGYKFSTY ATWWVRQAIT RAIADQARTI RIPVHMIETV NKINRTLRQM
LHEMGREPTL EELSARLNIN VDKIRKVMKI VKDPVSLESP IGDDDSSTFG DCIEDKRAVK
PEDAAVLADL REITTKVLST LTPKEERILR MRFGIGKGGK DHTLEEVGKL FNVTRERIRQ
IEAKALRKLR HPSRARKLRG FF