Gene ECH_0237 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagECH_0237 
Symbol 
ID3927133 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEhrlichia chaffeensis str. Arkansas 
KingdomBacteria 
Replicon accessionNC_007799 
Strand
Start bp223716 
End bp225119 
Gene Length1404 bp 
Protein Length467 aa 
Translation table11 
GC content37% 
IMG OID637901361 
Producthypothetical protein 
Protein accessionYP_507058 
Protein GI88658494 
COG category[R] General function prediction only 
COG ID[COG2070] Dioxygenases related to 2-nitropropane dioxygenase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
TTGCGTAAAA AAATTAAGAA GGTATTGATT TCCGGTAAAA GTGTTTGGCC AATTATAGAA 
GGTGGTAAAG GTATAGGTGC GAGTGATGGT AGAACTGCTG GTGCTTTTGC TGCTGCTTCT
GCTGTAGGTA CTTTTTCTGG AGCATGTGCT AGATTAGTTG ATGATAATGG GGAGCATGTT
CCTTTAATAT ATCGTGGAAA GACTAGGTTA GAACGCCATA ATGAGCTCAT AAACTATAGT
ATTGATGCAG CAGTGAGTCA GGCTAGAATG GCTCATGAGA TATCAAAGGG ATTGGGAAGA
ATACATATGA ATGTACTGTG GGAAATGGGA GGTGTTCAGC GGGTACTTCA TGGTGTGCTT
GATAAGGCAA AAGGATTAAT CCATGGTATT ACTTGTGGTG CTGGTATGCC ATATAAGTTA
GTTGAAATAG CTGCTCAATA TCAAGTTTAT TACTATCCTA TTGTTTCTTC AATGAGAGCA
TTTAAAATTT TGTGGCAACG TTCTTATCAG AAATTTTCAA AAACACTTCT TGGTGGAGTT
GTATATGAAG ATCCTTGGTT AGCTGGTGGA CATAATGGAC TTAGTAATAG TGAATCTCCT
GGTCATCCAC AAGATCCTTT TGAAAGAGTT GCAGCAATTC GCGCATACAT GAATGAAGTT
GGATTATCTG ATGTTGTATT AATTATGGCA GGTGGCGTTT GGCATTTGAA GGACTGGGAA
TCATGGTTAG ATAATGATTT AATTGGTCCA ATAGCATTTC AGTTCGGGAC CAGACCTTTA
TTAACTCAGG AAAGTCCAAT TTCTCCTGGG TGGAAAAAGA AGTTGATGTC TTTGAAACCT
GGTGATGTGT TTTTAAATAG ATTTAGTCCT ACAGGGTTTT ATTCTTCTGC GATTGAAAAT
GAGTTTATAA AAGAGTTACA AGCACGTAAT ACACGTCAGA TTGCATTTGA AAATGAGATG
ACTGAAAAAT GCAGTGCAGA GCTTTCTATT GGTAGTAGAG GTAGAAAAGT ATATGTAGAT
CCTAAGGATA AAAAATTGTC CGAGTCTTGG GTAGCAATGG GTTATACAGA TGCTTTAAAA
ACTCCCGATA ATACTTTAAT ATTTGTTAGT CAGAGTCAGT CAAGAAGTAT TAGGGAAGAT
CAAATAAATT GTATGGGATG TCTAAGTCAT TGCAAATTTA GTAATTGGAA AGATCACGGG
GATTATACTA CAGGTATTAA ACCAGATGTT CGTAGTTTTT GTATTCAGAA AACGTTACAA
AATATTATTG CTGGGGTAGA CCATGAACAT GAGCTTATGT TTTCTGGCCA TAATGCATAT
AAGTTTGTAC AAGATGAGTT TTATAGAGAT GGTTATATTC CTACAATCAA GGAGCTTGTT
GATAGAATTT TGACTGGATA TTGA
 
Protein sequence
MRKKIKKVLI SGKSVWPIIE GGKGIGASDG RTAGAFAAAS AVGTFSGACA RLVDDNGEHV 
PLIYRGKTRL ERHNELINYS IDAAVSQARM AHEISKGLGR IHMNVLWEMG GVQRVLHGVL
DKAKGLIHGI TCGAGMPYKL VEIAAQYQVY YYPIVSSMRA FKILWQRSYQ KFSKTLLGGV
VYEDPWLAGG HNGLSNSESP GHPQDPFERV AAIRAYMNEV GLSDVVLIMA GGVWHLKDWE
SWLDNDLIGP IAFQFGTRPL LTQESPISPG WKKKLMSLKP GDVFLNRFSP TGFYSSAIEN
EFIKELQARN TRQIAFENEM TEKCSAELSI GSRGRKVYVD PKDKKLSESW VAMGYTDALK
TPDNTLIFVS QSQSRSIRED QINCMGCLSH CKFSNWKDHG DYTTGIKPDV RSFCIQKTLQ
NIIAGVDHEH ELMFSGHNAY KFVQDEFYRD GYIPTIKELV DRILTGY