Gene ECH_0030 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagECH_0030 
SymbolhemE 
ID3927972 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEhrlichia chaffeensis str. Arkansas 
KingdomBacteria 
Replicon accessionNC_007799 
Strand
Start bp25934 
End bp26938 
Gene Length1005 bp 
Protein Length334 aa 
Translation table11 
GC content38% 
IMG OID637901155 
Producturoporphyrinogen decarboxylase 
Protein accessionYP_506863 
Protein GI88657828 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG0407] Uroporphyrinogen-III decarboxylase 
TIGRFAM ID[TIGR01464] uroporphyrinogen decarboxylase 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.210193 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTTAAAGA CTATAACAAG CAGGGCCAAG CAAAAAGAGA TTCCAGTCTG GTTCATGCGC 
CAAGCCGGTA GATACTTACC GGAGTACCGC AAGGTGGCAG AGGAGGCAGG AAGCTTTCTA
GAACTGTGTT ATACACCAGA GCTGGTAAAG GAGGTTACAT TACAACCAGT AAGGAGGTTC
GGCTTGGACG CGGCGATAAT ATTTTCAGAC ATACTGGTAA TCCCTGACGC CTTAGGTTGC
AAAGTAGAAT TCACGAAAGA GAAAGGACCC GAGTTGCAGC TAATATCTAA CCACTCAGAA
ATAAGCGTTC CCGAAGAAGC TGCATTGGAT CATCTTAAAA ATGTTTTTAG GGGTATAAAA
GAAGTAAGAA AGTCCTTACA AAGAGACAAG CCATTGATAG GGTTTGCAGG TGCACCTTGG
ACTATAGCCT CTTATATGAT AGGAAGAGAT AAAAATTTCT CAAAAATAAG AGAGATGTGT
TATTCACAAA CTAAAAACCT AGAAAAAATA GTAGAAAAAA TTACAAAGGT GACAACCTTA
TACTTAATAA AACAAATAGA AAGCGGTGTA GACATAATAC AAATTTTTGA TAGCAATGCA
GGAATTGTAC CAGCCGGCGA ATTCAAAAAG TGGATAATAG ACCCAACGAA AGAAATAGTC
TCGTCTATAC GTAAACTTTA TCCAGAATTC CCCATCATAG GATTTCCTAA GGGTGCAGGA
GTGATGTACA AGCAGTTTTC AGAAGAAACG GAAGTTTCAG TCACAAGTGT CGACTATAAT
ACCCCAATGT CTTGGGCAAA AAGTAACATT CCGTCAGTAC TACAAGGAAA TATAGATCCA
TATCTAGTAG CGTATGACAA AAGTAAGGCA ATATCCCAAA CGAAAGAACT AATCAATATA
ATGAAGGACA AACCTTTCAT ATTTAACTTA GGTCATGGAG TAATACCAAG TACCCCTATA
GCTAATATTG CAGCACTTGT AGACACAATA AAATCTGTTG TTTAA
 
Protein sequence
MLKTITSRAK QKEIPVWFMR QAGRYLPEYR KVAEEAGSFL ELCYTPELVK EVTLQPVRRF 
GLDAAIIFSD ILVIPDALGC KVEFTKEKGP ELQLISNHSE ISVPEEAALD HLKNVFRGIK
EVRKSLQRDK PLIGFAGAPW TIASYMIGRD KNFSKIREMC YSQTKNLEKI VEKITKVTTL
YLIKQIESGV DIIQIFDSNA GIVPAGEFKK WIIDPTKEIV SSIRKLYPEF PIIGFPKGAG
VMYKQFSEET EVSVTSVDYN TPMSWAKSNI PSVLQGNIDP YLVAYDKSKA ISQTKELINI
MKDKPFIFNL GHGVIPSTPI ANIAALVDTI KSVV