Gene ECH_0750 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagECH_0750 
SymboltopA 
ID3927721 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEhrlichia chaffeensis str. Arkansas 
KingdomBacteria 
Replicon accessionNC_007799 
Strand
Start bp754252 
End bp756714 
Gene Length2463 bp 
Protein Length820 aa 
Translation table11 
GC content31% 
IMG OID637901868 
ProductDNA topoisomerase I 
Protein accessionYP_507548 
Protein GI88657600 
COG category[L] Replication, recombination and repair 
COG ID[COG0550] Topoisomerase IA 
TIGRFAM ID[TIGR01051] DNA topoisomerase I, bacterial 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGTTTAA TTATAGTAGA ATCCCCATCT AAAGCAAAAA CTATAAGTAA ATATTTAGGT 
AGCAAGTATA AAGTTGTTGC ATCATTTGGG CATATTAGGG ATTTTCCAGC AAAAAGTGGT
TCTGTTGATC CGGATAAAGA TTTCGCTATG ACTTATGAAA TTATTAGTAA GTCAGAAAAA
TATGTAAATA AGTTAATTCA GGTAGCTAAT AAGGAAACAC AAGGAATTTA TCTTGCAACT
GATCCAGATC GTGAGGGTGA AGCAATAGCA TGGCATATTA TAGAAGTATT GAAGGAAAAT
AATGCTATTG ATGATAATGT TGTAATTAAT AGAATGGTTT TTAATGAGGT AACTAAAAGT
GCTATTCTTG AATCTATAAA ACGCCCAAGA AGTATCAATA TGGATCTTGT GTATGCACAG
CAAGCACGTA GAGCATTAGA TTATTTAGTT GGTTTTACGT TATCTCCTTT ATTATGGAGA
AAACTTCCGG GTAGTAAATC TGCTGGTAGA GTTCAATCTG TTGCTTTAAG GTTAATATGT
GACAGGGAAA GTGATATTGA AAAGTTCATT ACACAAGAAT ATTGGGATAT TGAGGCTAAA
TGTTGCAATA CAGAAGGTAA GCAGTTTTCT GCATTTTTAA GTTGTTATGC TGGTAAAAAA
TTAGAGAAGT TTGATATTAA GAATGAAGAA GATGCACAAA AGTTGTCAGA AGAAGTAAGA
TCTCGTAGTT ATTCTGTTAT AAATGTTGAG AAAAAGCATA CAAAGCGTAA TCCTTATCCT
CCTTTTATTA CATCAAGTTT ACAGCAAGAA GCATCAACAA AGCTTGGATT TACTGCAAAA
AATACTATGC TTATTGCTCA AAAACTGTAT GAAGGAATTG ATGTTGGAGG GGAAATTGTT
GGACTCATTA CTTATATGAG GACGGATGGA TTTTATATTG CAGATGAAGC ATTAGAGCAT
ATTAGGAGTA TCATTGTTTC TATGTTTGGT AAAGAATATT TACCGGAATC TGCTCGCAAG
TATGTGAAAA AGGTAAAAAA TGCTCAAGAA GCACATGAAG CAATTAGACC AACGAATATT
ACAATTACTC CTGGTAGTTT ATCAAATTAT CTCACTCCAG AGCAACTCAA GCTTTATGAT
CTTATATGGA AAAGGACTGT TGCAAGTCAG ATGAATTCTG CTGTAATTGA TCAGGTTATA
GTAGAATTAG GGTCGCTGGA TAAAGTAGTA ACTTTGCGAG CCACAGGTTC TTCCTTGTTT
TTTGATGGAT TTTATAAGGT GTATGGTCAT GGTGAGGATG ATGATAGTAA AAATAATATG
CTACCGTTAC TTCAAAAAAA TGATAAATGC CCTTTAGATG AAGTGGTACC TCATCAACAT
TTTACTAATC CTCCTGCTAG ATATAATGAA GCAAGTTTAG TGAAGAAAAT GGAGGAAATA
GGGATAGGAA GGCCTTCTAC TTATGCTTCT ATTATTTCTG TATTGCAAGA TAGACAGTAT
GTTATACTAA ATCAAAAAAG ATTTTTTCCA ACAGAGAGAG GTAGAATAGT AAATGTTTTT
CTTGTTAACT TTTTTAGTCG CTATGTAGAA TATGATTTTA CTGCTGACCT TGAAGAAAAG
TTGGATTTAA TCTCTAATGG GAAGATTAAT TGGAAGGAAG TACTACATCA ATTCTGGTTT
CTGTTTATTG ATAATGTTAA TTCAGTAAAA AAACTAGAAT TTACTGAAGT ATTAAAAGTT
ATTAGTTGTG AGATGGAAAA TTATATTTTT TCTTCTGATA AATCTGAATC TAGTAGAGTA
TGTCCTGTAT GTAATGATGG AACATTAGAG CTTAATATTA GTAAGAATGG TATTTTTTTA
GGATGTTCTA AATATCCAGA GTGTAAGTAT ACAAAAAATG TTGGAAGTGG TATTGTTGAT
GATACAGACA ATACACAATA TCCTAAAATA TTAGGTATTG ATGCTGCAAC AGGACAAGAA
ATATATCTTA AAAAAGGTCC GTATGGTCTC TATTTACAGA TTGGTAATGA TAAAAGTGGT
AAACGTGCAG CTATTCCAAA AAATATAGAT GCTAATTTAT TAGATTTGCA AATGGCAGAA
AAAATATTAA GTTTACCTAT AAGCTTAGGT ACTCATCCGG ATACTGGTCA AGAAATTAAG
TTAGGACTTG GTAAGTTTGG TGCATACATA TTGTATGGTG GTAAGTATTT TTCAGTAAAA
AATGAGCATA ATTTTTTAGA TATTAGCATT GAAAACGCTA TTAATATTAT TGCGAATAAT
TCTACAAGTG GGGTTATAAA GTCTCTGGGT ATCACTGAAG ATAATAAAGA AGTATTCTTA
TGTAAAGGAA GATATGGATT CTATTTAAAA TATGATAAGT TGAATGTTGC TTTGAAGAAA
GATACAGATA TTGAACAAAT ATCTTTATCG GATGCTATTA AGCTCATATC AGAAAAATTA
TAA
 
Protein sequence
MSLIIVESPS KAKTISKYLG SKYKVVASFG HIRDFPAKSG SVDPDKDFAM TYEIISKSEK 
YVNKLIQVAN KETQGIYLAT DPDREGEAIA WHIIEVLKEN NAIDDNVVIN RMVFNEVTKS
AILESIKRPR SINMDLVYAQ QARRALDYLV GFTLSPLLWR KLPGSKSAGR VQSVALRLIC
DRESDIEKFI TQEYWDIEAK CCNTEGKQFS AFLSCYAGKK LEKFDIKNEE DAQKLSEEVR
SRSYSVINVE KKHTKRNPYP PFITSSLQQE ASTKLGFTAK NTMLIAQKLY EGIDVGGEIV
GLITYMRTDG FYIADEALEH IRSIIVSMFG KEYLPESARK YVKKVKNAQE AHEAIRPTNI
TITPGSLSNY LTPEQLKLYD LIWKRTVASQ MNSAVIDQVI VELGSLDKVV TLRATGSSLF
FDGFYKVYGH GEDDDSKNNM LPLLQKNDKC PLDEVVPHQH FTNPPARYNE ASLVKKMEEI
GIGRPSTYAS IISVLQDRQY VILNQKRFFP TERGRIVNVF LVNFFSRYVE YDFTADLEEK
LDLISNGKIN WKEVLHQFWF LFIDNVNSVK KLEFTEVLKV ISCEMENYIF SSDKSESSRV
CPVCNDGTLE LNISKNGIFL GCSKYPECKY TKNVGSGIVD DTDNTQYPKI LGIDAATGQE
IYLKKGPYGL YLQIGNDKSG KRAAIPKNID ANLLDLQMAE KILSLPISLG THPDTGQEIK
LGLGKFGAYI LYGGKYFSVK NEHNFLDISI ENAINIIANN STSGVIKSLG ITEDNKEVFL
CKGRYGFYLK YDKLNVALKK DTDIEQISLS DAIKLISEKL