Gene ECH_1003 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagECH_1003 
SymbolctaD 
ID3927515 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEhrlichia chaffeensis str. Arkansas 
KingdomBacteria 
Replicon accessionNC_007799 
Strand
Start bp1028969 
End bp1030525 
Gene Length1557 bp 
Protein Length518 aa 
Translation table11 
GC content36% 
IMG OID637902119 
Productcytochrome c oxidase, subunit I 
Protein accessionYP_507790 
Protein GI88658568 
COG category[C] Energy production and conversion 
COG ID[COG0843] Heme/copper-type cytochrome/quinol oxidases, subunit 1 
TIGRFAM ID[TIGR02891] cytochrome c oxidase, subunit I 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGTAGTG AACATACACC ACAGGGGATA AGACGTTGGT TGTTTTCAAC AAATCACAAA 
GATATAGGTA CATTATATAT CATTTTTTCC ATTATTGGTG GACTTGTAGG TGGTATAATG
TCTCTTGTAC TGAGATTACA ACTAGCACAT ATTAACGTAT TACATGATAA CTATCAATTA
TATAATGTTA TTGTAACAGG GCATGCATTA ATTATGGTGT TTTTTATGAT CATGCCAGCT
TTAACTGGAG GATTTGGTAA TTGGTTCGTA CCATTGCTTA TAGGAGCCCC AGACATGGCA
TTCCCACGTC TTAATAATGT AAGCTTCTGG CTGTTGGTTG CCTCACTAAT TCTATTATGT
ATATCTGTCT TGATAGGAGA AGGTGCAGGT ACAGGGTGGA CATTATACCC ACCATTATCA
TATATCGGGT CTCATCCAAG TGCTTCCGTA GATATAGCAA TATTCGCTAT ACATGTTGCT
GGAGCTTCAT CAATTGTAGG AGCAATAAAT TTCATTGTGA CAATATTCAA CATGCGAGCT
CATGGCATGA CATTGTTAAA AATGCCACTA TTCGTCTGGA CAATATTATT AACATCTTTT
ATGTTAATAG TGACAATCCC AGTTTTAGGG GGCGCAGTAA CAATGCTATT AACAGACAGA
AATTTTGGCA CAAGTTTTTT TGACCCTGCT GGTGGAGGTG ATCCTTTATT ATTCCAACAT
CTATTTTGGT TCTTTGGACA CCCAGAAGTG TATATAATCA TATTCCCAGC ATTTGGAATA
ATAAGCCAAA TTGTATCCAC TTTTTCTCAT AAAGCAGTTT TTGGATACTT AGGTATGGTA
TTAGCATTAG TTGGCATTGC AGCTGTAGGT GCAGTAGTAT GGGCACATCA CATGTTTACT
GTTGGATTAA GCGCAGAGAT TATGACATAT TTCAGTGTTA CTACCATGTT AATAGGTGTG
TTAACAGGAG TTAAAGTATT CAGCTGGATT GCAACTATGT GGGGAGGACA AATAGAATTT
AAAACTCCTA TGCTATTTTC AATTGGATTT ATCTTCGTAT TTGTAGTAGG AGGAGTTACT
GGTATTGTAA TTTCACACGG TGGTATAGAT AAAGCACTGC ACGACACATA CTATGTAGTT
GCACATTTCC ATTATGTAAT GTCAATCGCT GCATTGTTTG CTGCCTTTGC TGCTTTCTAT
TACTGGATAG GTAAAATATC AGGTAAGCAA TATAACGAAT GTTTAGGTAA AATACATTTC
TGGTTAACTT TTATTGGAAC TAATATTACA TTTTTACCTC AACACTTTTT AGGTGTAGCA
GGCATGCCAA GACGGATCCC AGATTACCCA GATGCTTTTA TTCCATGGAA TTATATATCA
TCAGTAGGAG CATTGATATC ATTCATATCA GCCTTATTTT TTGTTTACAT AATTATTTCA
ACATTAAGAA ATGGAAAAAA ATGTCCTAGC AATCCATGGG GAGGAGATAC ATTAGAATGG
ACAATACCAT CACCAGCACC TTTCCATACC TTTGAAGAAA TACCAAAGGT TGATTAA
 
Protein sequence
MSSEHTPQGI RRWLFSTNHK DIGTLYIIFS IIGGLVGGIM SLVLRLQLAH INVLHDNYQL 
YNVIVTGHAL IMVFFMIMPA LTGGFGNWFV PLLIGAPDMA FPRLNNVSFW LLVASLILLC
ISVLIGEGAG TGWTLYPPLS YIGSHPSASV DIAIFAIHVA GASSIVGAIN FIVTIFNMRA
HGMTLLKMPL FVWTILLTSF MLIVTIPVLG GAVTMLLTDR NFGTSFFDPA GGGDPLLFQH
LFWFFGHPEV YIIIFPAFGI ISQIVSTFSH KAVFGYLGMV LALVGIAAVG AVVWAHHMFT
VGLSAEIMTY FSVTTMLIGV LTGVKVFSWI ATMWGGQIEF KTPMLFSIGF IFVFVVGGVT
GIVISHGGID KALHDTYYVV AHFHYVMSIA ALFAAFAAFY YWIGKISGKQ YNECLGKIHF
WLTFIGTNIT FLPQHFLGVA GMPRRIPDYP DAFIPWNYIS SVGALISFIS ALFFVYIIIS
TLRNGKKCPS NPWGGDTLEW TIPSPAPFHT FEEIPKVD