Gene ECH_0883 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagECH_0883 
Symbol 
ID3927519 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEhrlichia chaffeensis str. Arkansas 
KingdomBacteria 
Replicon accessionNC_007799 
Strand
Start bp905243 
End bp906370 
Gene Length1128 bp 
Protein Length375 aa 
Translation table11 
GC content31% 
IMG OID637902000 
Productputative DNA processing protein DprA 
Protein accessionYP_507678 
Protein GI88657722 
COG category[L] Replication, recombination and repair
[U] Intracellular trafficking, secretion, and vesicular transport 
COG ID[COG0758] Predicted Rossmann fold nucleotide-binding protein involved in DNA uptake 
TIGRFAM ID[TIGR00732] DNA protecting protein DprA 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.430936 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAAATCA ATAAAGAATT ATCCCACCAA GAACTGATAT CATGTCTTAG AGTAATAAGA 
ACACCAAACA TAGGTCCATC AACATTTCAT GCATTAATTA AGCTATATAA AACTTGTCAA
CATATACTGG AAGTTCTACC AAACTTAATA AAAAAATCTA AAATTAATAA TAAAATTCAC
AACATATGCT CTATTGAAGC AGCAGAACTA GAAATTGAAA ATACTACTAA GATTGGTGGA
AAAATAATTA CTGTATTTGA TGAAGACTAC CCAGAAATTT TACGTAATAT TCACGATTAT
CCACCAGTTA TAACAGTACT AGGAGACTCA TCACTGTTAA AAGAAAAAAC AATAGGAATA
GTAGGAAGCA GGAACCCTTC CATCAATGGA AAAAATTTTG CTTATAAGTT ATCATACGAA
TTAGCTAACT CTGGTTTTGT TATAGCGTCC GGATTAGCAA GAGGAATAGA TAAATCTGCA
CATAGTATAA TTTACCAACA ATTACCAACA ATTGCTGTCA TGGCCAGCGG AGTCAACATA
GTATACCCGC AGGAAAATAT ACATTTATAT AACACCATAG TAGATAAAGG AGGATTAATA
ATCACGGAAT TTCCTTTTTC TACATTACCA AGGGCTCAAT TATTTCCACA ACGTAATCGT
ATAATTTCTG GATTATCACT TGGAGTAGTA ATTGTCGAAG CATCTATACA ATCAGGATCA
CTTATTACAG CAAATTTCGC TTTAGAACAA AATAGAGAAG TATTTGCAGT TCCTGGGTCA
CCACTTGACC ATAGGTGTAG AGGAAGTAAC AGTCTAATAA AAAACGGAGC AAAATTAGTA
GAATTTACAC ACGATATCAC AGAAAGTTTA CAATTTAACA ATAATAAACC TTACATACAA
CAATCAATAT TCGATAATAC AACAAAAAGT GATAATAACC TTTTTGAAAT CAATAATGCA
AAAGATACTA TTCTGCAATA CATAACCCAT AGTCCAACCG AAATTGAAGA AATCATTGCG
TCTACTAATT TGAACATCAG TAGTATATTA ATAGCCTTAA TTGAGCTAGA AGCAGCACAA
AAAATAGAAA GATTTCCTAA CAATAAAGTA GCTTTAATGC ACTACTAG
 
Protein sequence
MKINKELSHQ ELISCLRVIR TPNIGPSTFH ALIKLYKTCQ HILEVLPNLI KKSKINNKIH 
NICSIEAAEL EIENTTKIGG KIITVFDEDY PEILRNIHDY PPVITVLGDS SLLKEKTIGI
VGSRNPSING KNFAYKLSYE LANSGFVIAS GLARGIDKSA HSIIYQQLPT IAVMASGVNI
VYPQENIHLY NTIVDKGGLI ITEFPFSTLP RAQLFPQRNR IISGLSLGVV IVEASIQSGS
LITANFALEQ NREVFAVPGS PLDHRCRGSN SLIKNGAKLV EFTHDITESL QFNNNKPYIQ
QSIFDNTTKS DNNLFEINNA KDTILQYITH SPTEIEEIIA STNLNISSIL IALIELEAAQ
KIERFPNNKV ALMHY