Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | ECH_1003 |
Symbol | ctaD |
ID | 3927515 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Ehrlichia chaffeensis str. Arkansas |
Kingdom | Bacteria |
Replicon accession | NC_007799 |
Strand | + |
Start bp | 1028969 |
End bp | 1030525 |
Gene Length | 1557 bp |
Protein Length | 518 aa |
Translation table | 11 |
GC content | 36% |
IMG OID | 637902119 |
Product | cytochrome c oxidase, subunit I |
Protein accession | YP_507790 |
Protein GI | 88658568 |
COG category | [C] Energy production and conversion |
COG ID | [COG0843] Heme/copper-type cytochrome/quinol oxidases, subunit 1 |
TIGRFAM ID | [TIGR02891] cytochrome c oxidase, subunit I |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGTAGTG AACATACACC ACAGGGGATA AGACGTTGGT TGTTTTCAAC AAATCACAAA GATATAGGTA CATTATATAT CATTTTTTCC ATTATTGGTG GACTTGTAGG TGGTATAATG TCTCTTGTAC TGAGATTACA ACTAGCACAT ATTAACGTAT TACATGATAA CTATCAATTA TATAATGTTA TTGTAACAGG GCATGCATTA ATTATGGTGT TTTTTATGAT CATGCCAGCT TTAACTGGAG GATTTGGTAA TTGGTTCGTA CCATTGCTTA TAGGAGCCCC AGACATGGCA TTCCCACGTC TTAATAATGT AAGCTTCTGG CTGTTGGTTG CCTCACTAAT TCTATTATGT ATATCTGTCT TGATAGGAGA AGGTGCAGGT ACAGGGTGGA CATTATACCC ACCATTATCA TATATCGGGT CTCATCCAAG TGCTTCCGTA GATATAGCAA TATTCGCTAT ACATGTTGCT GGAGCTTCAT CAATTGTAGG AGCAATAAAT TTCATTGTGA CAATATTCAA CATGCGAGCT CATGGCATGA CATTGTTAAA AATGCCACTA TTCGTCTGGA CAATATTATT AACATCTTTT ATGTTAATAG TGACAATCCC AGTTTTAGGG GGCGCAGTAA CAATGCTATT AACAGACAGA AATTTTGGCA CAAGTTTTTT TGACCCTGCT GGTGGAGGTG ATCCTTTATT ATTCCAACAT CTATTTTGGT TCTTTGGACA CCCAGAAGTG TATATAATCA TATTCCCAGC ATTTGGAATA ATAAGCCAAA TTGTATCCAC TTTTTCTCAT AAAGCAGTTT TTGGATACTT AGGTATGGTA TTAGCATTAG TTGGCATTGC AGCTGTAGGT GCAGTAGTAT GGGCACATCA CATGTTTACT GTTGGATTAA GCGCAGAGAT TATGACATAT TTCAGTGTTA CTACCATGTT AATAGGTGTG TTAACAGGAG TTAAAGTATT CAGCTGGATT GCAACTATGT GGGGAGGACA AATAGAATTT AAAACTCCTA TGCTATTTTC AATTGGATTT ATCTTCGTAT TTGTAGTAGG AGGAGTTACT GGTATTGTAA TTTCACACGG TGGTATAGAT AAAGCACTGC ACGACACATA CTATGTAGTT GCACATTTCC ATTATGTAAT GTCAATCGCT GCATTGTTTG CTGCCTTTGC TGCTTTCTAT TACTGGATAG GTAAAATATC AGGTAAGCAA TATAACGAAT GTTTAGGTAA AATACATTTC TGGTTAACTT TTATTGGAAC TAATATTACA TTTTTACCTC AACACTTTTT AGGTGTAGCA GGCATGCCAA GACGGATCCC AGATTACCCA GATGCTTTTA TTCCATGGAA TTATATATCA TCAGTAGGAG CATTGATATC ATTCATATCA GCCTTATTTT TTGTTTACAT AATTATTTCA ACATTAAGAA ATGGAAAAAA ATGTCCTAGC AATCCATGGG GAGGAGATAC ATTAGAATGG ACAATACCAT CACCAGCACC TTTCCATACC TTTGAAGAAA TACCAAAGGT TGATTAA
|
Protein sequence | MSSEHTPQGI RRWLFSTNHK DIGTLYIIFS IIGGLVGGIM SLVLRLQLAH INVLHDNYQL YNVIVTGHAL IMVFFMIMPA LTGGFGNWFV PLLIGAPDMA FPRLNNVSFW LLVASLILLC ISVLIGEGAG TGWTLYPPLS YIGSHPSASV DIAIFAIHVA GASSIVGAIN FIVTIFNMRA HGMTLLKMPL FVWTILLTSF MLIVTIPVLG GAVTMLLTDR NFGTSFFDPA GGGDPLLFQH LFWFFGHPEV YIIIFPAFGI ISQIVSTFSH KAVFGYLGMV LALVGIAAVG AVVWAHHMFT VGLSAEIMTY FSVTTMLIGV LTGVKVFSWI ATMWGGQIEF KTPMLFSIGF IFVFVVGGVT GIVISHGGID KALHDTYYVV AHFHYVMSIA ALFAAFAAFY YWIGKISGKQ YNECLGKIHF WLTFIGTNIT FLPQHFLGVA GMPRRIPDYP DAFIPWNYIS SVGALISFIS ALFFVYIIIS TLRNGKKCPS NPWGGDTLEW TIPSPAPFHT FEEIPKVD
|
| |