Gene ECH_0922 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagECH_0922 
SymboldnaE 
ID3927989 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEhrlichia chaffeensis str. Arkansas 
KingdomBacteria 
Replicon accessionNC_007799 
Strand
Start bp945609 
End bp948971 
Gene Length3363 bp 
Protein Length1120 aa 
Translation table11 
GC content31% 
IMG OID637902039 
ProductDNA polymerase III, alpha subunit 
Protein accessionYP_507712 
Protein GI88658193 
COG category[L] Replication, recombination and repair 
COG ID[COG0587] DNA polymerase III, alpha subunit 
TIGRFAM ID[TIGR00594] DNA-directed DNA polymerase III (polc) 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGTCAAA TATTTGTTCA TCTTAGGTCT CATAGTGATT ATTCCTTACT TCATGGAATG 
ATAAAAATAG ATTCTTTGGT GAACTTATGT GTGCAGTATA ATATGCCTGC ATTAGCATTA
ACAGATTCAG GTAATTTATT TGGATCTTTG GAATTTTCTG ATTATGCATC GAGTTTAGGT
GTACAGCCTA TTATAGGCTG TAACATAATG ATGAGCTATA ATGGTGATAG TATTGGAGAA
CTGGTATTAT TAGTAAAAGA TCAAATAGGA TATAATAATA TCGTTAATCT TGTTAGTAAT
TCATTTAAGG ATAGCCAGTC CAACAAAGTT AATAGAGTAG ATTTAGAAAA ACTTATTGAT
CTGAAAGATG GTATTATTGT TTTAACAGGA GGTCATGATG GATTTTTATC CCAACTTTTG
TTAGGTAATG TTATTGACTA TGGCATTATA GATAAGTTGT TACTTGCTTT TAATGGGAAT
CTTTATGTTG AACTTCAACG TCATGGAATG GAAGAGGAAA AAATCATAGA AAAAACACTT
GTTAATTTTG CTTATGAGAA AGATTTGCCA TTGGTTGCCA CGAATGATGT GTTGTTTGCT
AAGAAAAATG ATTTTTTAGC GCATGATGTT TTATCTTGTA TATCAGATGG TAATTATATT
GCACAGGAAG GCCGAAAGAT GTTCACAGAA GAGCATTATT TTAAGTCTTC TGATGAGATG
TATAGGTTGT TTAAGGATAT TCCTGAAGCA GTTTCTAATA CTGTGTTAAT AGCACAGCGT
TGTTCTTTTA TGCCAGAAAC CAGGAAACCT ATGTTACCTC ATTTTCCTTG TTCAAGTGGT
AAAAATGAGA GTCAAGAATT AATAAGTCAA GCTATTAATG GATTGGATAA TCGTTTAAAA
AGTAAAAGTT TATCTACAGA ACAAGTTTCC AAGTACTATG AAAGACTGCA TTATGAATTA
GATATTATTA TCTCAATGGA TTATGCTGGG TATTTTTTGA TAGTATCGGA TTTTATATGT
TGGAGTAAAA GAAATGATAT AATGGTAGGT CCTGGTAGAG GATCTGGAGC TGGTTCTCTT
GTTGCTTGGT CTTTGCAAAT TACTGATCTT GACCCTATAG AATTTGGTTT AATTTTTGAA
AGATTTTTAA ATCCTGACCG TATTTCAATG CCAGATTTTG ATATTGATTT TTGTCAGGAA
AAGAGAGATT ACGTTATAGA GTATGTAAGA AAGAAATATG GTTACGTTGC TCACATTATA
ACTTTTGGAA AATTACAAGC TAAGGCTGTG CTTCGTGATG TGGGTAGAGT TATGCAGATG
CCTTATTTCC AGGTGGATAG AATCTGTAAA ATGATTCCTC ACAACCCAGT TAAGCCTGTT
ACTTTATCTG AAGCAATAGA GATGGATAAA AATTTGCAGA AAGAGCAGGA TGATGATGAA
ACAGTTGCTA AATTATTAGA AATATCATTA AAGCTTGAAG GGCTGTATAG ACATGTTTCG
ATACATGCTG CAGGTATTGT AATTTGTGAT AGAGAATTGG AAGAATTACT GCCTTTATAT
TATGATAGCA CGTCTTCTCT TCCTATTACA CAGTATAATA TGAAATATAC TGAGAAAGCA
GGATTAGTGA AATTTGATTT TTTGGGATTG CGGACTTTAA CTGTAATAAA TCAAATTTGT
CATTTAGTTA ATAGGGGAGG TCACAGTGTT GATATATCAC GTATTCCTTT AAATGATAGA
AAAACATATG AAATGTTGTC TGCAGGTGAT TCTGTTGGTG TATTCCAGCT TGAAAGTTCG
GGTATGAGGG AAGTAATTAG TAAGCTGAAG CCAGATAATA TAAATGATAT TATAGCATTG
ATTTCTCTGT ATAGGCCAGG TCCTATGGAT AACATTTCTA TATATATAGC ACGCAAGCAT
GGGTTTGAAA AACCAGATTA TATACATCCA ATTTTAGAAG ATGTCCTTAG AGAAACTTTT
GGGGTTATAA TTTATCAAGA GCAAGTAATG GAAATTGCCA AGATTATGGC TGGATATAGT
TTAGGTGAAG CAGATTTGTT AAGGCGTGCT ATGGGAAAAA AGATTAAGGA GGAGATGGAT
AATCAACGCA GAACTTTTAT AAATGGTGCT GTTAGCAATG GAATAGAGGA AGAGAAGGCA
AGCTATATTT TTGATTTAGT AGCAAAATTT GCGGGATATG GGTTTAATAA ATCACATGCT
GCAGCTTATG CTTTGATCAG TTATCAGACA GCTTATTTGA AAGCTAATTA TACGTTGGAA
TTTTTTACTG CTTCGATGAA TTTAGATATA ATGGATAAGG ATAAGTTAGA AATGTTGTGT
CATCAAGCAA AGTTGCATGG TATTGAAATA TTACCTCCGG ATATTAACTC TTCTAAAGTA
TTGTTTACTA TTGAGGGTGC ATCGTCTATT AGATATGCCC TTGGAGCTCT TAAGAATGTA
GGACAGCATT CAGCAAAAGA GATAGTAGAC GATGTTGCTT ATAAAGACAT ATGGGATTTT
ATTGACAGAG TAAGTACTAA ATGTGTGCAC AAGAGAATAC TGGAAAGTAT TATTAAAGCG
GGGGTTCTGG ATAGTATTCA TAGCAATAGA AAGCAGCTTT TTGAATCAGT ATTTTTATTT
TTAGATATTA TTGAATATAA TAAGTATAAT GCTAATTTTA ACCAATTTAG TTTGTTTAAT
GATAAAAATC ATTATAAGTT ATCAGAAACT GATGATTGGA CTAAAGAAGA AAAGCTGAAT
AATGAGTTTT CCTCTATAGG TTTTTATCTT AATCATCATC CCATGGAAAA CTATAAGTAT
CTTTTAGACA AATTGAATAT AGGTTTTATT CACTACGATG ATAAATCTTC TTATAATAAT
GTGATTGTAG GTATAATTTC TAATGTTAAA GTGCGTTCTA CAAATAAGGA TAAGTTTGCA
GTTGTAACTC TTTCTGATCC TTTTAACATT CATGAAATAG TTTTTTATAA TGGAAATATA
ATAGAAGATA ATAAAGACTT ATTTACTACT GGTGCATCTG TAATTATTGA GATGGATAAT
ACTTTTTATA GTGCCTCAGT ACGATTACTT GGTAAAAATA TCTATAGTTT TGAAAACAAA
ATTTCTTCTA TTATAAAAAC TATGGTTATA CATGTTAATG CTAAGAATAG TGTTGTAATA
AAAGAGCTAT CTAGTTTGTT GCAGAATAGG GGATCTACTG TTGTATTAAT TGATCTGGTT
CTTCCAAATG ATCATGTGAC AATTCAATTG CCAAATAGCT TTTTAGTTAC TCCTTTAATT
TTTGCACAAA TTTTTAAGTT AAATTGGGTA AAGGATATTG AGATTAATAG CATTTTAGTT
TAA
 
Protein sequence
MSQIFVHLRS HSDYSLLHGM IKIDSLVNLC VQYNMPALAL TDSGNLFGSL EFSDYASSLG 
VQPIIGCNIM MSYNGDSIGE LVLLVKDQIG YNNIVNLVSN SFKDSQSNKV NRVDLEKLID
LKDGIIVLTG GHDGFLSQLL LGNVIDYGII DKLLLAFNGN LYVELQRHGM EEEKIIEKTL
VNFAYEKDLP LVATNDVLFA KKNDFLAHDV LSCISDGNYI AQEGRKMFTE EHYFKSSDEM
YRLFKDIPEA VSNTVLIAQR CSFMPETRKP MLPHFPCSSG KNESQELISQ AINGLDNRLK
SKSLSTEQVS KYYERLHYEL DIIISMDYAG YFLIVSDFIC WSKRNDIMVG PGRGSGAGSL
VAWSLQITDL DPIEFGLIFE RFLNPDRISM PDFDIDFCQE KRDYVIEYVR KKYGYVAHII
TFGKLQAKAV LRDVGRVMQM PYFQVDRICK MIPHNPVKPV TLSEAIEMDK NLQKEQDDDE
TVAKLLEISL KLEGLYRHVS IHAAGIVICD RELEELLPLY YDSTSSLPIT QYNMKYTEKA
GLVKFDFLGL RTLTVINQIC HLVNRGGHSV DISRIPLNDR KTYEMLSAGD SVGVFQLESS
GMREVISKLK PDNINDIIAL ISLYRPGPMD NISIYIARKH GFEKPDYIHP ILEDVLRETF
GVIIYQEQVM EIAKIMAGYS LGEADLLRRA MGKKIKEEMD NQRRTFINGA VSNGIEEEKA
SYIFDLVAKF AGYGFNKSHA AAYALISYQT AYLKANYTLE FFTASMNLDI MDKDKLEMLC
HQAKLHGIEI LPPDINSSKV LFTIEGASSI RYALGALKNV GQHSAKEIVD DVAYKDIWDF
IDRVSTKCVH KRILESIIKA GVLDSIHSNR KQLFESVFLF LDIIEYNKYN ANFNQFSLFN
DKNHYKLSET DDWTKEEKLN NEFSSIGFYL NHHPMENYKY LLDKLNIGFI HYDDKSSYNN
VIVGIISNVK VRSTNKDKFA VVTLSDPFNI HEIVFYNGNI IEDNKDLFTT GASVIIEMDN
TFYSASVRLL GKNIYSFENK ISSIIKTMVI HVNAKNSVVI KELSSLLQNR GSTVVLIDLV
LPNDHVTIQL PNSFLVTPLI FAQIFKLNWV KDIEINSILV