Gene ECH_0056 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagECH_0056 
Symbol 
ID3927167 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEhrlichia chaffeensis str. Arkansas 
KingdomBacteria 
Replicon accessionNC_007799 
Strand
Start bp49530 
End bp50696 
Gene Length1167 bp 
Protein Length388 aa 
Translation table11 
GC content31% 
IMG OID637901180 
Productputative exodeoxyribonuclease VII, large subunit 
Protein accessionYP_506887 
Protein GI88657931 
COG category[L] Replication, recombination and repair 
COG ID[COG1570] Exonuclease VII, large subunit 
TIGRFAM ID[TIGR00237] exodeoxyribonuclease VII, large subunit 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACACCAG AATTTACTGT TAGTGAAATC ACTAAAATTT TCCAAAATTT TGTACATGAA 
ACGTTTACTC ATATAAAAGT TAGAGGAGAA ATCAGCAATT TATCACAACC AAAATCTGGG
CATACATATT TCACATTAAA AGATGATGCT GCTGTACTCA ATGCAATATG CTGGAACAAT
ACCAAAGTTG AATTTGATTT AAAAAATGGA TTGGAAGTCA TATGCTCCGG GTTCCTAACA
ACCTACCAAT CAAAATATCA GCTAATAACA GAAAACATGT TGCTAGCCGG AATAGGCAAC
TTGAAAATAA TGCTTGAACA AAGAAAAGCA AAATTAGAAA AAGAAGGACT TTTTGATCAA
TCAAACAAAA AACCTTTGCC TTTACTACCT AAAATTACAG GTGTAATCAC ATCTACTACT
GGAGCAGTGA TTAACGACAT ACTAAACAGA GTGAAAAGCC GCTTTCCAAG TCACATAGTT
ATATCTCCAG TATCCGTACA AGGCAATGAA TCTATCAACC AAATTATAGA TGCAATATCA
AAACTAAACA ACGCCGATAC AAATAAACCA GACGTAATCA TTATCGCCAG AGGAGGAGGC
AGTATAGAAG ATTTATGGAT TTTTAATGAT GAATCAATAG TAAGGGCAGT AGCTAGATCT
AGCATTCCTA TAGTTTCTGC AATCGGTCAT GAAACTGACT TTACTTTAAT TGATTATGCA
GCAGATGTAC GTGCTCCTAC ACCTACAGCA GCAGTAGAAA TTGTTTTGCC AACAAAAACC
CAACTCATAG AACATATAAA CAGTAAATTC AACAAAATAA AGACAACTTT ACACTATAAA
ATAAATAAAA AAAAAGAGAG GCTGTTTTAT TTACACAACA ACTTAATCAA AACTAAACAT
CAAATTAAAG TACTAAAACT TCAACTATCT GAATACAAAA ACAAAATAGA AGTATTGCTA
AAAATACTGC TATTAAATAA GAAACAATCC CTAAACGCGC TATATAATAA AATCAATAAA
TTTAACAAAG AAAAAACTTT AGAAGCAGGA TATGCTGTAT TATACGATAC AAACCGTAAC
CACATCAGCA GTATAAAAAA ACTAAAATCA AATGATATTA TATCAATTGA ACTAAAAGAT
GGTATAATAG AAGCTATAAT AAAATAA
 
Protein sequence
MTPEFTVSEI TKIFQNFVHE TFTHIKVRGE ISNLSQPKSG HTYFTLKDDA AVLNAICWNN 
TKVEFDLKNG LEVICSGFLT TYQSKYQLIT ENMLLAGIGN LKIMLEQRKA KLEKEGLFDQ
SNKKPLPLLP KITGVITSTT GAVINDILNR VKSRFPSHIV ISPVSVQGNE SINQIIDAIS
KLNNADTNKP DVIIIARGGG SIEDLWIFND ESIVRAVARS SIPIVSAIGH ETDFTLIDYA
ADVRAPTPTA AVEIVLPTKT QLIEHINSKF NKIKTTLHYK INKKKERLFY LHNNLIKTKH
QIKVLKLQLS EYKNKIEVLL KILLLNKKQS LNALYNKINK FNKEKTLEAG YAVLYDTNRN
HISSIKKLKS NDIISIELKD GIIEAIIK