Gene ECH_0558 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagECH_0558 
Symbol 
ID3927967 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEhrlichia chaffeensis str. Arkansas 
KingdomBacteria 
Replicon accessionNC_007799 
Strand
Start bp560279 
End bp562030 
Gene Length1752 bp 
Protein Length583 aa 
Translation table11 
GC content32% 
IMG OID637901680 
Productputative lipoprotein 
Protein accessionYP_507370 
Protein GI88657823 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTTTAGAC GTAATATCTT GAATGTACTG TTGGTATTGA TTTTCTGTTT TGTTATTTCG 
TGTTCAAATA AGAGTAGATA TCAGTTTAGT AAGAAATATT CTCCAGTTTA TAATCCTGAT
GGTGAAGCTT TTGATAGTGA TGTGGGGTTT TCTCGTGCTT ACTCAATTTA TAAAGAACGT
AGAAATGCAC TGGTTTCAGG TATTGATGAA AGCAGTAAGA AAGTTAGTGT AAGACGTAAA
GTAAAGCCAA AAGTTCAAGT TAGAGATGTT GACTTGTTAA AGGAATATGG GGATCTTTTA
AAAGAGGAAA ACTGTGATTT AATGGTAGAT GGTAATGGCG TTAATTTAGT TGATGTTGCA
GGTGCTAAGT TTATTGATCC TATAGAAAAT GATGATGATG TTATAGATCA CGATAATAGA
CATGTTGATG TGAAAAGTTC AGTGGTCAAA ACTAAAGATA AGAATAAGTT ACAAGATGTT
AAGGATAACA AACCTAGTGA TGTTAAGCTT CCAGTAATTA AAGCTGAAGA TAAAAATAAG
TTACAAGATG TTAAGGATAA CAAACCTAGT GATGTTAAGC TTCCGGTAAT TAAAGCTGAA
GATAAGAGTA AGCTGCGAGA TGTTAAGGAT AACAAATCTA CTGATGTTAA GCTTCCGGTA
GTTAAAGCTG AAGATAAGAA TAAGTTACAA GATGTTAAGG ATAATAAACC TAGTGATGTT
AAGCTTCCAG TAATTAAAGC TGAAGATAAG AGTAAGCTGC GAGATGTTAA GGATAACAAA
TCTACTGATG TTAAGCTTCC GGTAGTTAAA GCTGAAGATA AGAATAAGTT ACAAGATGTT
AAGGATAATA AACCTAGTGA TGTTAAGCTT CCGGTAGTTA AAGCTGAAGA TAAGAATAAG
TTACAAGATG TTAAGGATAA TAAACCTAGT GATGTTAAGC TTCCAGTAAT TAAAGCTGAA
GATAAGAATA AGTTACAAGA TGTTAAGGAT AACAAACCTA GTGATGTTAA GCTTCCAGTA
GTTAAAGCTG AAGATAAGAG TAAGCTGCGA GATGTTAAAG ATAACAAACC TAGTGATGTT
AAGCTTCCGG TAATTAAAGC TGAAGATAAG AGTAAGCTAC AAGATGTTAA AGATAACAAA
CCTAGTGATG TTAAGCTTCC GGTAATTAAA GCTGAAGATA AGAGTAAGCT GCGAGATGTT
AAAGATAACA AACCTAGTGA TGTTAAGCTT CCAGTAGTTG AGAATATGAT TATTGATACA
TCTAAGTTAA ATGATGATGG TGATCATAAG GCTGACAAGA AAGAAAAAGG GTTACGTTCA
TTATTAAAAT TTACAAAAAT AGAAAATGAT AGTAAGGGTT CTGATAATAA TGTCGCAGGT
AATATTGCTA ATGATAGTGA GGAGCCTACG TTCTCTCAAC CTAAAAGTGA TGTAGAATCT
CCTAAACAAG TATCTGAGAG TGATGAACAA AAAAATAGTC ATAAAGCTGT ACACTTTCTA
TCATTCTTAA ACGAGCCAAG TGAAGATCAA AATGATACGA CAGAAGAACT ACAAAATACA
GAAGATAAAA AAGATAATTT ACTAGAAGAG AATGTTAAAA TATCTGAAAA GGATGATCAA
CAAGTAGTAA TATCTATAGA AGAGGAAAAT CAGATGTTAT TACAAAGTAT TAAGAAGATG
AAAGAATATG ATGAAGATTA TAGTATTACG TATTATTATG ATGATGATGG TATGGCGTAC
TATGAAGACT AG
 
Protein sequence
MFRRNILNVL LVLIFCFVIS CSNKSRYQFS KKYSPVYNPD GEAFDSDVGF SRAYSIYKER 
RNALVSGIDE SSKKVSVRRK VKPKVQVRDV DLLKEYGDLL KEENCDLMVD GNGVNLVDVA
GAKFIDPIEN DDDVIDHDNR HVDVKSSVVK TKDKNKLQDV KDNKPSDVKL PVIKAEDKNK
LQDVKDNKPS DVKLPVIKAE DKSKLRDVKD NKSTDVKLPV VKAEDKNKLQ DVKDNKPSDV
KLPVIKAEDK SKLRDVKDNK STDVKLPVVK AEDKNKLQDV KDNKPSDVKL PVVKAEDKNK
LQDVKDNKPS DVKLPVIKAE DKNKLQDVKD NKPSDVKLPV VKAEDKSKLR DVKDNKPSDV
KLPVIKAEDK SKLQDVKDNK PSDVKLPVIK AEDKSKLRDV KDNKPSDVKL PVVENMIIDT
SKLNDDGDHK ADKKEKGLRS LLKFTKIEND SKGSDNNVAG NIANDSEEPT FSQPKSDVES
PKQVSESDEQ KNSHKAVHFL SFLNEPSEDQ NDTTEELQNT EDKKDNLLEE NVKISEKDDQ
QVVISIEEEN QMLLQSIKKM KEYDEDYSIT YYYDDDGMAY YED