Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | ECH_0521 |
Symbol | petB |
ID | 3927875 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Ehrlichia chaffeensis str. Arkansas |
Kingdom | Bacteria |
Replicon accession | NC_007799 |
Strand | + |
Start bp | 520405 |
End bp | 521631 |
Gene Length | 1227 bp |
Protein Length | 408 aa |
Translation table | 11 |
GC content | 33% |
IMG OID | 637901644 |
Product | ubiquinol-cytochrome c reductase, cytochrome b |
Protein accession | YP_507336 |
Protein GI | 88658641 |
COG category | [C] Energy production and conversion |
COG ID | [COG1290] Cytochrome b subunit of the bc complex |
TIGRFAM ID | |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 7 |
Plasmid unclonability p-value | 0.518371 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTCTGAAC ATGACAATAT AAAAAAAACG GAAGGGCGTG GCATTAGGGC TTGGATAGAA TATAGAATGC CGATTGGTGC TTTTTTAAAA GAGTTAGCTT CATATCAGGT ACCTAAGAAC CTGAATTATG CTTGGAATTT TGGTTCTCTT GCTGGTATTG CACTAATGCT ACAGATTATC ACAGGGATAT TTTTAGCAAT GCATTACACA CCACATGTTG CACATGCATT TAGTAGTGTA GAGAGGATAA TGCGTGATGT TAATTATGGT TGGTTAATAA GATATACTCA TGCTGTAGGT GCTTCATTCT TTTTTATAGT TGTGTATATA CATATATTAC GTGGTTTATA TTATGGTTCT TATAAAAGTC CTAGAGAATT AGTTTGGTTT GTTGGTATTT TTATCTTTTT TGCAATGATG GCTACAGCAT TTATGGGATA TGTATTACCA TGGGGGCAAA TGAGTTTTTG GGGTGCAACT GTAATTACTA ACTTGTTTTC TGTTATACCT TTAATTGGTC AGGATGTAGT ACAATGGCTA TGGGGTGGTT TTTCTGTTGA TAATCCTACG TTGAATAGAT TTTTTGCGTT ACATTATTTG TTACCTTTTA TTATTGTGAT GCTTGCTTCA TTACATGTTA TAGCATTGCA CAGGTTTGGA TCAGGTAATC CGAGTGGAAT AGAAGTAAAA TCTAGTAAAG ACACTATTCC AATTTATCCT TACTTTATTG TTAAAGATTG TATAACATTT GGTATATTTT TTATTCTTTT ATTTTTGTTT GTATTTTATA TTCCAAATTA CTTAGGGCAT CCAGATAATT ATATTGAAGC TGATCCTATG GTGACACCTG CTCATATAGT TCCTGAATGG TACTTTTTGC CTTTTTATGC TATGTTGCGT TCTATTCCTA ATAAATTATT AGGGGTAGTT ACTATGATTG GCTCTATAGC AGTGTGGTTT TTGTTACCTG TATTAGATAA ATGTAAGGTC AAGAGTGGTA GTCATCGTCC GATTTTTAGA ATCTTTTATC TGTTCTTTGT AGTGAATTTT TGTTTTTTAG CTTGGCTTGG TGGACAAGAA GTAAGAGAAC CATTTGTAAC ACTTAGTAGA TTATCTACAT TATATTATTT CTCATATTTT TTTATTGTGT TGCCTATATT GTCTAAGTAT GAAAAGCCAG TTGTGCTTCC AAAAACGATA AGTGATGCAG TGCCGGAGAT GAAATAA
|
Protein sequence | MSEHDNIKKT EGRGIRAWIE YRMPIGAFLK ELASYQVPKN LNYAWNFGSL AGIALMLQII TGIFLAMHYT PHVAHAFSSV ERIMRDVNYG WLIRYTHAVG ASFFFIVVYI HILRGLYYGS YKSPRELVWF VGIFIFFAMM ATAFMGYVLP WGQMSFWGAT VITNLFSVIP LIGQDVVQWL WGGFSVDNPT LNRFFALHYL LPFIIVMLAS LHVIALHRFG SGNPSGIEVK SSKDTIPIYP YFIVKDCITF GIFFILLFLF VFYIPNYLGH PDNYIEADPM VTPAHIVPEW YFLPFYAMLR SIPNKLLGVV TMIGSIAVWF LLPVLDKCKV KSGSHRPIFR IFYLFFVVNF CFLAWLGGQE VREPFVTLSR LSTLYYFSYF FIVLPILSKY EKPVVLPKTI SDAVPEMK
|
| |