Gene ECH_0521 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagECH_0521 
SymbolpetB 
ID3927875 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEhrlichia chaffeensis str. Arkansas 
KingdomBacteria 
Replicon accessionNC_007799 
Strand
Start bp520405 
End bp521631 
Gene Length1227 bp 
Protein Length408 aa 
Translation table11 
GC content33% 
IMG OID637901644 
Productubiquinol-cytochrome c reductase, cytochrome b 
Protein accessionYP_507336 
Protein GI88658641 
COG category[C] Energy production and conversion 
COG ID[COG1290] Cytochrome b subunit of the bc complex 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.518371 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTCTGAAC ATGACAATAT AAAAAAAACG GAAGGGCGTG GCATTAGGGC TTGGATAGAA 
TATAGAATGC CGATTGGTGC TTTTTTAAAA GAGTTAGCTT CATATCAGGT ACCTAAGAAC
CTGAATTATG CTTGGAATTT TGGTTCTCTT GCTGGTATTG CACTAATGCT ACAGATTATC
ACAGGGATAT TTTTAGCAAT GCATTACACA CCACATGTTG CACATGCATT TAGTAGTGTA
GAGAGGATAA TGCGTGATGT TAATTATGGT TGGTTAATAA GATATACTCA TGCTGTAGGT
GCTTCATTCT TTTTTATAGT TGTGTATATA CATATATTAC GTGGTTTATA TTATGGTTCT
TATAAAAGTC CTAGAGAATT AGTTTGGTTT GTTGGTATTT TTATCTTTTT TGCAATGATG
GCTACAGCAT TTATGGGATA TGTATTACCA TGGGGGCAAA TGAGTTTTTG GGGTGCAACT
GTAATTACTA ACTTGTTTTC TGTTATACCT TTAATTGGTC AGGATGTAGT ACAATGGCTA
TGGGGTGGTT TTTCTGTTGA TAATCCTACG TTGAATAGAT TTTTTGCGTT ACATTATTTG
TTACCTTTTA TTATTGTGAT GCTTGCTTCA TTACATGTTA TAGCATTGCA CAGGTTTGGA
TCAGGTAATC CGAGTGGAAT AGAAGTAAAA TCTAGTAAAG ACACTATTCC AATTTATCCT
TACTTTATTG TTAAAGATTG TATAACATTT GGTATATTTT TTATTCTTTT ATTTTTGTTT
GTATTTTATA TTCCAAATTA CTTAGGGCAT CCAGATAATT ATATTGAAGC TGATCCTATG
GTGACACCTG CTCATATAGT TCCTGAATGG TACTTTTTGC CTTTTTATGC TATGTTGCGT
TCTATTCCTA ATAAATTATT AGGGGTAGTT ACTATGATTG GCTCTATAGC AGTGTGGTTT
TTGTTACCTG TATTAGATAA ATGTAAGGTC AAGAGTGGTA GTCATCGTCC GATTTTTAGA
ATCTTTTATC TGTTCTTTGT AGTGAATTTT TGTTTTTTAG CTTGGCTTGG TGGACAAGAA
GTAAGAGAAC CATTTGTAAC ACTTAGTAGA TTATCTACAT TATATTATTT CTCATATTTT
TTTATTGTGT TGCCTATATT GTCTAAGTAT GAAAAGCCAG TTGTGCTTCC AAAAACGATA
AGTGATGCAG TGCCGGAGAT GAAATAA
 
Protein sequence
MSEHDNIKKT EGRGIRAWIE YRMPIGAFLK ELASYQVPKN LNYAWNFGSL AGIALMLQII 
TGIFLAMHYT PHVAHAFSSV ERIMRDVNYG WLIRYTHAVG ASFFFIVVYI HILRGLYYGS
YKSPRELVWF VGIFIFFAMM ATAFMGYVLP WGQMSFWGAT VITNLFSVIP LIGQDVVQWL
WGGFSVDNPT LNRFFALHYL LPFIIVMLAS LHVIALHRFG SGNPSGIEVK SSKDTIPIYP
YFIVKDCITF GIFFILLFLF VFYIPNYLGH PDNYIEADPM VTPAHIVPEW YFLPFYAMLR
SIPNKLLGVV TMIGSIAVWF LLPVLDKCKV KSGSHRPIFR IFYLFFVVNF CFLAWLGGQE
VREPFVTLSR LSTLYYFSYF FIVLPILSKY EKPVVLPKTI SDAVPEMK