Gene ECH_0048 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagECH_0048 
Symbol 
ID3928027 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEhrlichia chaffeensis str. Arkansas 
KingdomBacteria 
Replicon accessionNC_007799 
Strand
Start bp43952 
End bp45022 
Gene Length1071 bp 
Protein Length356 aa 
Translation table11 
GC content27% 
IMG OID637901172 
ProductTPR domain-containing protein 
Protein accessionYP_506879 
Protein GI88658481 
COG category[R] General function prediction only 
COG ID[COG4976] Predicted methyltransferase (contains TPR repeat) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.24301 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAAAGTA ACTTCTTCAT AACAAAGCGT AATGTATCGT CTTTAATATC GAGTATTAAT 
AAAACTATAC CGAAACCACA AGATATTTCT CAATCTATAA AAAACAAAAT CACTTCCATA
AAAAAGGAAG CCATTCTATT ACAAAATAAG CTAAAAAACT TACTTGAAAC TAATATAGAT
CTTGGTTTAT ACCACTTTTA CAAAGGAAAT ATATCTGATG CAAAATTCAG ATTTCGCCTT
ATTAGTATCT TTAAACCTAA ATTACCAGTA GTATATTATA ATATTGGAAG ATGTTACTTT
ACACTACAAA ACTTCAATAA AGCACAACAA AATTTTACAC GTGCAATAGA ATTAGATAAT
AATTATGCAG ATGCTTTATA CTACTTAAAC AAAATAACAA ATCCAGAAAG TATTGTATAT
GTGCCTGAAA ACATTATAAA GCAATATTTC GATTACACTA GCGAACATTT TGTAGAACAT
TGGCTTATAG CAAAACAATA CAAAGCACAT GAATATGTTA AGTCTCTAAT AATAAACTTT
TTTGGAAATA AATCTTCATA CTTAAATATT TTAGACCTTG GATGTGGTAC TGGTATATGT
GGTCAATTTT TAAAAATGAA AAGTATAGGT AATCATATAA CAGGCATTGA CATATCAAAC
AAAATGATAA ATATAGCAAG AGGCTGTTTT GTAAACGGTA AACAAGCTTA TAATGAATTA
ATAAATATAA GCATTTTTGA TTTTCTTAAG AAAAACCAAA ACAAGAAAAA ATACAACGTT
ATCATTCTAA CTGAAGTACT ACAGTACACA GGCAGTTTAA ATCCTATTCT TAAATTACTA
AAAACAATGT TAGAGACAGA TGGTATTATT ATCGGACTTG CAAGAAGAAA GAAAGGATCA
GGTTTCCAAT TTATAAATGA AGGAGATTTC TTTTGCCATT CAGACAAGTA TATAAAATCA
TCTATTATAG AATCAGGATT ACAGTGTAGC TATTCTAGCT ACTGTAAAAT ATATGGATCA
CAAGTCGAAG GAATACTTTT TGTTGCACAA TCTAACAAAA TTGAAGTTTA A
 
Protein sequence
MKSNFFITKR NVSSLISSIN KTIPKPQDIS QSIKNKITSI KKEAILLQNK LKNLLETNID 
LGLYHFYKGN ISDAKFRFRL ISIFKPKLPV VYYNIGRCYF TLQNFNKAQQ NFTRAIELDN
NYADALYYLN KITNPESIVY VPENIIKQYF DYTSEHFVEH WLIAKQYKAH EYVKSLIINF
FGNKSSYLNI LDLGCGTGIC GQFLKMKSIG NHITGIDISN KMINIARGCF VNGKQAYNEL
INISIFDFLK KNQNKKKYNV IILTEVLQYT GSLNPILKLL KTMLETDGII IGLARRKKGS
GFQFINEGDF FCHSDKYIKS SIIESGLQCS YSSYCKIYGS QVEGILFVAQ SNKIEV