Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | ECH_0760 |
Symbol | rpoD |
ID | 3927201 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Ehrlichia chaffeensis str. Arkansas |
Kingdom | Bacteria |
Replicon accession | NC_007799 |
Strand | + |
Start bp | 766191 |
End bp | 768059 |
Gene Length | 1869 bp |
Protein Length | 622 aa |
Translation table | 11 |
GC content | 33% |
IMG OID | 637901878 |
Product | RNA polymerase sigma factor RpoD |
Protein accession | YP_507558 |
Protein GI | 88657966 |
COG category | [K] Transcription |
COG ID | [COG0568] DNA-directed RNA polymerase, sigma subunit (sigma70/sigma32) |
TIGRFAM ID | [TIGR02393] RNA polymerase sigma factor RpoD, C-terminal domain [TIGR02937] RNA polymerase sigma factor, sigma-70 family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 5 |
Plasmid unclonability p-value | 0.856067 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAAGATC TACAAACAGA TAAAGAATTA CTCAGAACTC TTGTGAGTAA GGGACTCAAA CAAGGATTTG TTACATTTAA CGATATCAAC GATGTTTTTT CTGACTACGT GCTGTCATCT GACAACATAG ATGAAACTAT ATCAATGCTT CAGGATTCAG GTATTAATGT TCTTGAACAA AGTGATGAAG ACGAGGCAAA TAGCATAATA GAAGAAAACA GAATCAAAGA TGAAATGGAC GAAGATGATG TATCAGACAC CAGTATAAAA TCTTGGGATT TTGGACAAAC AGATGACCCT ATACGCATGT ACCTATGTGA AATGAGTTCT GTTGAATTAC TATCTCGTGA AGGAGAAATT GAAATTGCCA AAAAGATCAA ATCAGAAAAA GTAAATATGC TCAGGTCACT AGTAGAATCC CCAATAGTAT TACGTACATT TATGTCATGG AGGGATGACT TAGTAAATGA ACAAATAATG TTAAGAGATT TAATAGATTT AGATGCAAAT TACAGGTATG AATTTCCTGA AAAATTTTCT GACACTGATG ATGCTAGTGT TTTAGGTTAT GATAAAGATT TAATGGATGA ACCTGATGAA GATCCAGATA TACCAGAAGA TGAAGATGAA GAAAACATTT CAATAAATGC TTCAATATTA GAAATGGAAA ATGCATTGCT ACCCAAAGTA GTAAGCATCT TAGATTCAGT AATTGCATCA GCAGAAAAAA TTTTAGAACT CAAAAAACAA TATCAAGGTA AAATTAACCA AGATATTGAA AAACAATACA ATGACTTACA CGATAGCATA TGGGAAATGA TATATCAAAT TAAACTTAGT AATTCCGCAG TCCTATCAAT CACACAACAA ATATATAGCT TAAGTAAATC AATAGCAGCA GAAGAGGCAA AAATTATTTC CCTTGCAGAA AGCTATGGTA TACAACGCAA AGACTTCTTA GATGCATATA ATACTAATTC TGTATTGCAA AAAAAAGGCA CTTCTCCTCA ATGGGATAAC ATGTTACTCA ATGAAGAGAG TAATATAGTA AGTATGTACA GTAAAATTAA GCTACTGTCT GGAGAAAATA ACCTTACCGA ATTCAAAGCA TTAGTCACAA AAATACAAAA GCACGAACGT GCAGCAAATC AAGCAAAGCA AGAAATGATA AAAGCAAATC TAAGGTTAGT AGTATCGATT GCAAAAAAAT ATTCAAATCG AGGTTTACAG TTTCTAGATT TAGTACAAGA AGGGAATATT GGCTTAATGA AAGCAGTAGA TAAATTTGAT TACAAACGTG GATATAAATT CTCAACTTAT GCAACGTGGT GGGTGAGGCA AGCTATCACT AGAGCAATAG CAGATCAGGC ACGAACTATT AGAATTCCTG TACACATGAT TGAAACAGTA AATAAAATCA ATCGCACACT AAGACAAATG CTACACGAAA TGGGTAGAGA ACCAACATTA GAGGAATTAT CAGCAAGATT AAACATTAAT GTAGATAAGA TACGTAAGGT AATGAAAATA GTTAAAGACC CCGTAAGCTT AGAAAGCCCC ATAGGAGATG ATGACAGTAG TACCTTTGGT GATTGCATAG AAGATAAACG TGCAGTAAAG CCAGAAGATG CAGCAGTACT TGCAGACTTA CGAGAAATAA CTACAAAAGT ATTATCAACA TTAACACCAA AAGAAGAGAG AATCTTACGT ATGCGGTTTG GAATAGGTAA AGGTGGAAAA GACCATACCT TAGAAGAAGT AGGAAAACTA TTCAATGTAA CAAGAGAACG TATACGACAA ATTGAAGCAA AAGCTTTACG TAAACTACGT CATCCAAGTC GTGCAAGAAA GCTTAGAGGA TTTTTCTAA
|
Protein sequence | MKDLQTDKEL LRTLVSKGLK QGFVTFNDIN DVFSDYVLSS DNIDETISML QDSGINVLEQ SDEDEANSII EENRIKDEMD EDDVSDTSIK SWDFGQTDDP IRMYLCEMSS VELLSREGEI EIAKKIKSEK VNMLRSLVES PIVLRTFMSW RDDLVNEQIM LRDLIDLDAN YRYEFPEKFS DTDDASVLGY DKDLMDEPDE DPDIPEDEDE ENISINASIL EMENALLPKV VSILDSVIAS AEKILELKKQ YQGKINQDIE KQYNDLHDSI WEMIYQIKLS NSAVLSITQQ IYSLSKSIAA EEAKIISLAE SYGIQRKDFL DAYNTNSVLQ KKGTSPQWDN MLLNEESNIV SMYSKIKLLS GENNLTEFKA LVTKIQKHER AANQAKQEMI KANLRLVVSI AKKYSNRGLQ FLDLVQEGNI GLMKAVDKFD YKRGYKFSTY ATWWVRQAIT RAIADQARTI RIPVHMIETV NKINRTLRQM LHEMGREPTL EELSARLNIN VDKIRKVMKI VKDPVSLESP IGDDDSSTFG DCIEDKRAVK PEDAAVLADL REITTKVLST LTPKEERILR MRFGIGKGGK DHTLEEVGKL FNVTRERIRQ IEAKALRKLR HPSRARKLRG FF
|
| |