Gene ECH_1079 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagECH_1079 
Symbol 
ID3927107 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEhrlichia chaffeensis str. Arkansas 
KingdomBacteria 
Replicon accessionNC_007799 
Strand
Start bp1106234 
End bp1107532 
Gene Length1299 bp 
Protein Length432 aa 
Translation table11 
GC content30% 
IMG OID637902193 
Productputative cytochrome c oxidase, subunit I 
Protein accessionYP_507864 
Protein GI88658468 
COG category[C] Energy production and conversion 
COG ID[COG0843] Heme/copper-type cytochrome/quinol oxidases, subunit 1 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.933646 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTCTTATA TAAATAACTA TAACATACAA CTATGTCGTA AGTGGTTATT ATTAGCTGTA 
TCTGCTTTAG CAATATCTGG ACTTTTATCC ATATTTGTAA TTTTGCTCAG ATTGCCTATA
TCAAAATCTT TTATCACAGA TGTAGATAAA GTGTTTGATA TATCGCTAGT AATACATGTT
AACTTATCAG TACTAGTATG GGCATCCTCA ATCATTTCTA TAATATCAAG CTTAATAATT
TCCACTAACA AGTATTCAAA ATGTTTCAAA TATCTATGCT ACTCCGCATT TATTGGTACA
TTTTTAATGG TGCTATCTAT TTTTTTTCCT AATGCAGAAC CAATAAAAAA TAATTACGTT
CCTGTAATAA ATAACTTATG TTTTTTACTA GGACTATTTA TATTTACTGC AAGCATACTT
TTTTACTCAA TATTATCAAC TAAATGTCAT CACCTAAATA TTCTTCAAGG CATTTCATTG
GGAATACACG GCATTCCAAT AATACTCATA TGCGCTGTTC TATGTTTCAC TATGGCTTAT
TACACAATAC ACACAAACAA CTATTTCCAC ATAACATCTT TTTATGAAAA TATATTTTGG
GGAGGAGGAC ATATACTGCA ACTAGCTTTT TGTCAGGCAT TACTAATAGT ATACTTAATT
ATATTAGGAA AAAACACTAA ACTACTTAAT AAAAACGTAA TCCACGTCAT ATTTATAATA
AACACATTAT CAGTTATCGT AGCACCTATC ATATACATAT ATCATCCAGC CGATTCTCAA
TTTACCATAG ATTTCTTCAC ATGGCATATG CGTATTCTTG GAGGAGTAAT ACCCGTATTC
TTATTCATAC TTACACTATT CAATCTCAAA ATATTGCTAA AACATAGCTA TCATAGTCTA
TTATGTACAA CTTTACTATT TAGCTATGGT GGTATACTAG GAATATTAAG TATTCATGGT
AATGTAACAA TACCTGCACA TTATCATGGA TCTATTGTTG GAATGACTAT CGCATTCATG
GGATTCATTT ACTGGCTCCT ACCAAAATTA AACCTTGGCA ATACAAATAG CTTTTATACT
AATCTTCAAA TTTACATATA CAGTTTAGGA CAATTTCTAC ATATAACCGG ATTAGAATGG
TTAGGTGGAT ATGGAGCATT AAGAAAAGTA ACATATTTAC CTGATACAGC ATCAAAAATT
GCAAAACACT GTTTTACAAT CGGTGGACTT ATGGCAATTA TTGGTGGATG TTTATTTGTA
ATAATAGTAC TACTACAAAT CCGAAAAAAA GAAGCTTAA
 
Protein sequence
MSYINNYNIQ LCRKWLLLAV SALAISGLLS IFVILLRLPI SKSFITDVDK VFDISLVIHV 
NLSVLVWASS IISIISSLII STNKYSKCFK YLCYSAFIGT FLMVLSIFFP NAEPIKNNYV
PVINNLCFLL GLFIFTASIL FYSILSTKCH HLNILQGISL GIHGIPIILI CAVLCFTMAY
YTIHTNNYFH ITSFYENIFW GGGHILQLAF CQALLIVYLI ILGKNTKLLN KNVIHVIFII
NTLSVIVAPI IYIYHPADSQ FTIDFFTWHM RILGGVIPVF LFILTLFNLK ILLKHSYHSL
LCTTLLFSYG GILGILSIHG NVTIPAHYHG SIVGMTIAFM GFIYWLLPKL NLGNTNSFYT
NLQIYIYSLG QFLHITGLEW LGGYGALRKV TYLPDTASKI AKHCFTIGGL MAIIGGCLFV
IIVLLQIRKK EA