Gene ECH_0149 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagECH_0149 
Symbol 
ID3927761 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEhrlichia chaffeensis str. Arkansas 
KingdomBacteria 
Replicon accessionNC_007799 
Strand
Start bp139712 
End bp140710 
Gene Length999 bp 
Protein Length332 aa 
Translation table11 
GC content37% 
IMG OID637901273 
Productpyruvate dehydrogenase subunit beta 
Protein accessionYP_506977 
Protein GI88657756 
COG category[C] Energy production and conversion 
COG ID[COG0022] Pyruvate/2-oxoglutarate dehydrogenase complex, dehydrogenase (E1) component, eukaryotic type, beta subunit 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGAACTT TAACTGTACG AGAAGCACTA TGCGAAGCAA TACGTGAAGA AATGGAACGC 
GACCATACAG TACTAATTAT GGGAGAAGAA GTAGGTGAAT ATCAAGGTGC ATACAAAGTG
ACCCAAGGAT TACTTGAACA ATTTGGCCCT GATAGAGTCA TAGATACTCC CATAACTGAA
CATGGATTTG CTGGGATAGG GGTAGGTGCT GCATTTGCGG GACTGAAACC TATTGTAGAA
TTCATGACTT TCAACTTTGC TATGCAGGCA ATAGATCAAA TTATTAACTC AGCAGCTAAA
ACTAGTTACA TGTCTGGAGG ACAATTGAAC TGTCCTATTG TATTTAGAGG CCCCAATGGT
GCAGCAGCAA GAGTAGGAGC ACAACATTCT CAATGTTATG CTTCATGGTA TGCACACATC
CCTGGATTAA AAGTAGTATC CCCATATTTT GCAGCAGATT GTAAAGGTCT ATTAAAGGCA
GCTATAAGGG ATTTAAATCC TGTTGTATTT CTTGAAAATG AGATCGCATA TGGACATAAG
CATGAAATAC CAAATGAAGT ATCAACATCA GACTATATAA CCGAAATTGG GAAAGCAGCT
ATAGTCAAGG AAGGAACTGA TATCACAATA ACAGCGTTTT CCCTACAAGT TAAATTCGCA
CTAGAAGCAG CAGAACTTTT AGCAAAAGAA GGTATAAATG CAGAGGTTAT AGACTTAAGA
ACGCTACGCC CTCTTGATAC AGAAACAATA TTACGTTCTA TTAAAAAAAC AAACAAAATT
ATTAGCATAG AAGAAGGATG GCCATATTCA GGCATAGGAT CTGAAATAGC AGCATTGATA
ATGGAATATG CATTTGATGA TTTAGATGCA CCAATGATAA GAATAACTGG AAAAGATGTA
CCATTACCTT ATGCTACAAA CCTTGAAAAG TTAGCATTAC CACAAATTGA AGATATACTA
GAAGCAGCAC GTGCTTTATG TATTCGCAAT TATAGATAA
 
Protein sequence
MRTLTVREAL CEAIREEMER DHTVLIMGEE VGEYQGAYKV TQGLLEQFGP DRVIDTPITE 
HGFAGIGVGA AFAGLKPIVE FMTFNFAMQA IDQIINSAAK TSYMSGGQLN CPIVFRGPNG
AAARVGAQHS QCYASWYAHI PGLKVVSPYF AADCKGLLKA AIRDLNPVVF LENEIAYGHK
HEIPNEVSTS DYITEIGKAA IVKEGTDITI TAFSLQVKFA LEAAELLAKE GINAEVIDLR
TLRPLDTETI LRSIKKTNKI ISIEEGWPYS GIGSEIAALI MEYAFDDLDA PMIRITGKDV
PLPYATNLEK LALPQIEDIL EAARALCIRN YR