Gene ECH_0785 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagECH_0785 
SymboluvrA 
ID3927195 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEhrlichia chaffeensis str. Arkansas 
KingdomBacteria 
Replicon accessionNC_007799 
Strand
Start bp799533 
End bp802418 
Gene Length2886 bp 
Protein Length961 aa 
Translation table11 
GC content30% 
IMG OID637901903 
Productexcinuclease ABC, A subunit 
Protein accessionYP_507583 
Protein GI88657960 
COG category[L] Replication, recombination and repair 
COG ID[COG0178] Excinuclease ATPase subunit 
TIGRFAM ID[TIGR00630] excinuclease ABC, A subunit 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACTGATT TGATTAGTAT ACGTAATGCT AAAGAACATA ATCTTCACAA CATAAATATA 
GATCTTCCAA AAAATAAGTT AATTGTAATA ACTGGAGTTA GTGGATCTGG AAAATCCAGT
CTCGCATTTG ATACAATATA TGCAGAAGGG CAGCGCAGAT ATGTAGAAAG TCTATCATCC
TATGCTCGTC AGTTTTTAGA TTCATGTACA AGACCAGAAG TAGAATCAAT CACAGGTCTT
TCTCCAACTA TTGCAATCAG TCAAAAGCTT TTATCGAAAA ATTCAAGATC TACTGTATCA
ACTACTACAG GAATTTATGA TTATTTAAGA ATCATGTATT CCAAGATAGG AACACCATAT
TCCCCAATAA CTGGATTACC TATAACAAAA CAATCACTTT CTCAAATAGT AGAAACTATC
TTGAAATTAC CAATAGGAAC AAAAATTTCA ATTTTAGGTA TACTAATAAG AGGGAAACGA
GGAGAACATA ATAAAGAAAT ATATGAAATC AAAAAACAGC ACTACAAACT ATTAAAAATT
GATGGTATTA CTTATGATAT TAACAATATT CCACAACTAG ATAGAAATAA AAAGCATGAT
ATTCATGTAA TCATAAATCA GATATCAGTT CTAGAAAATT CTATTGACAA TATAACTGAA
AACATAAAAA CTGCGTTAAA AATAGGTAAT GGTATAACTC ACATAGAAAT ACTTGATTTA
CCACAAGAAT ACAAAAGTGA CACGTATTAC AAGAATCAGG TACTGGTTTT TTCAGAAAAT
TTTGCATGTC CTGAAACAGA ATTTAGCATA GAAGAAGTAG AACCAAGATT ATTTTCATTC
AATACATCAT ATGGATCATG CAAAATTTGT AGTGGAATAG GAAAAAAATA TAGCGTTGAC
AAAAATCTAG TCATAAAAAA TGATAATTTA TCAATTTCAG AAGGAGCTAT ACATCCTATA
GGCCCAATTG ATATAAAAAC CATTAATTAC AAAATTAATT CGCAATTTAG CTATCATTAC
TCAAATATTA TTTTGTCCTT AGCAAAACAT TATAGTTTTG ATTTAACAAC ACCATGGAAA
AAATTAGATG ATAATATAAA AGATATAATT TTATTTGGCA GCAAAGAAGT ACAAATACCT
ACACATTATC AACACGAAGG ATATAAATAT TCTAGAAACA AACCATTTGA AGGCATTATA
AATATCTTAA ATAACAAAGA AGACATAGTT ATCGAAAAAT TAGCTGAACA ATATTCATCT
ATAAACATAT GTAATGAATG CCAAGGATTT CGACTAAGGC AGGAAGCACT ATCAGTAAAA
ATTGACAATA AACATATCGG AGAATTGACA AGGTTATCCG TTATAGACGC AATATCATGG
TGTAAAGCAC TACCGTCTAG ACTCAATGAT CAACAGAAGC ATATTTCACA TAAAATATTA
GACGAAATCA TTAAAAGACT AATATTTTTA AAAAATGTTG GATTAGGTTA TTTAACACTC
GATCGTGAAT CAGGAACATT ATCTGGAGGA GAAAGTCAAA GAATAAAACT AGCATCACAA
ATTGGATCCG GATTAACAGG AATAATTTAC GTATTAGATG AACCTTCAAT CGGGTTACAT
CAACGTGACA ACACGTTATT AATTAATACA TTAAAAACAT TACGCGATAT AGGAAACACA
GTCATAGTTG TAGAACATGA TGAAGAAACT ATCACAAATG CAGATTATGT AGTAGATATA
GGTCCAGGAG CAGGACAAAA TGGAGGTTAT GTAATTGCAA GTGGTACTCC AAAAGAAATA
ATGGATAATC ATAACAGTAT AACTGGCCAA TATCTCAGTA ACAAAAAAAT AATACCTAAC
TTCAAAAAGC ATAGAACTTT TTACAAATGG ATAGAAATCA TTGGAGCACA TGTAAACAAT
TTACAAAACA TTGATGTTAA AATTCCTTTA CATGCCTTTA CTTGCATTAC TGGAGTATCT
GGAAGTGGAA AATCTAGCTT AATAAACAAT ACTCTATATC AACATGCAAC TGATCAACTA
AATAATATTA ACTGTATATA CAGTAAATGC ACATCTATAT CAGGATTAGA ACATATAGAT
AAAGTAATTA AAATAGACCA GTCCCCTATT GGTAGAACCC CTTTATCAAA TTCAGCAACA
TACACAGGAT TATTTACCCA TATAAGGTCA TGGTTTGCTG AATTACCTTT ATCAAAAGAA
AGAGGATATA CTATAAGCAG GTTTTCATTT AACTCTAAAG GAGGAAGATG TGAAGCATGT
AAAGGGGATG GGCAAGTAAA AATAGAAATG CATTTTTTAC CTGATGTATA TGTCAAGTGT
GAACAATGCA AAGGATCGCG TTATAACAAA GAGACTCTGG AAGTAACACA TCAAGGAAAA
TCTATAGCTG ACATATTAGA TATGACAGTT GATCAAGCAT ACGAATTTTT TATCCATTTA
CCATTAATCA AAGAAAAATT AGATGCCTTA CGCAGCGTAG GCATGGGCTA TATTAAAATA
GGACAAACAT CGAATACATT ATCAGGAGGT GAAGCACAAA GAATAAAATT ATCAAAAGAA
CTATCAAAAC GTTCAACAGG AAAAACATTG TATATATTAG ATGAACCAAC TACAGGATTA
CATTTTGCAG ACATAGATAA TTTATTACAT GTTCTACATA AATTACGTGA TTTAGGTAAT
ACAATAGTAG TAATAGAACA TAATTTACAT GTCATCAAAA CAGCAGAATA CATAATAGAT
ATAGGTCCTA ACGGAGGAGA TGATGGAGGT CAAGTAGTAG CTACTGGGAC TCCTGAAGAA
ATAGTTCAAA ACCCTCATAG CATAACTGGA CAATACTTAA AGCCTTATTT AAATGCCCAA
TCATAA
 
Protein sequence
MTDLISIRNA KEHNLHNINI DLPKNKLIVI TGVSGSGKSS LAFDTIYAEG QRRYVESLSS 
YARQFLDSCT RPEVESITGL SPTIAISQKL LSKNSRSTVS TTTGIYDYLR IMYSKIGTPY
SPITGLPITK QSLSQIVETI LKLPIGTKIS ILGILIRGKR GEHNKEIYEI KKQHYKLLKI
DGITYDINNI PQLDRNKKHD IHVIINQISV LENSIDNITE NIKTALKIGN GITHIEILDL
PQEYKSDTYY KNQVLVFSEN FACPETEFSI EEVEPRLFSF NTSYGSCKIC SGIGKKYSVD
KNLVIKNDNL SISEGAIHPI GPIDIKTINY KINSQFSYHY SNIILSLAKH YSFDLTTPWK
KLDDNIKDII LFGSKEVQIP THYQHEGYKY SRNKPFEGII NILNNKEDIV IEKLAEQYSS
INICNECQGF RLRQEALSVK IDNKHIGELT RLSVIDAISW CKALPSRLND QQKHISHKIL
DEIIKRLIFL KNVGLGYLTL DRESGTLSGG ESQRIKLASQ IGSGLTGIIY VLDEPSIGLH
QRDNTLLINT LKTLRDIGNT VIVVEHDEET ITNADYVVDI GPGAGQNGGY VIASGTPKEI
MDNHNSITGQ YLSNKKIIPN FKKHRTFYKW IEIIGAHVNN LQNIDVKIPL HAFTCITGVS
GSGKSSLINN TLYQHATDQL NNINCIYSKC TSISGLEHID KVIKIDQSPI GRTPLSNSAT
YTGLFTHIRS WFAELPLSKE RGYTISRFSF NSKGGRCEAC KGDGQVKIEM HFLPDVYVKC
EQCKGSRYNK ETLEVTHQGK SIADILDMTV DQAYEFFIHL PLIKEKLDAL RSVGMGYIKI
GQTSNTLSGG EAQRIKLSKE LSKRSTGKTL YILDEPTTGL HFADIDNLLH VLHKLRDLGN
TIVVIEHNLH VIKTAEYIID IGPNGGDDGG QVVATGTPEE IVQNPHSITG QYLKPYLNAQ
S