Gene ECH_1114 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagECH_1114 
Symbol 
ID3927888 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEhrlichia chaffeensis str. Arkansas 
KingdomBacteria 
Replicon accessionNC_007799 
Strand
Start bp1136453 
End bp1137901 
Gene Length1449 bp 
Protein Length482 aa 
Translation table11 
GC content33% 
IMG OID637902228 
Productisocitrate dehydrogenase 
Protein accessionYP_507898 
Protein GI88658075 
COG category[C] Energy production and conversion
[E] Amino acid transport and metabolism 
COG ID[COG0473] Isocitrate/isopropylmalate dehydrogenase 
TIGRFAM ID[TIGR02924] isocitrate dehydrogenase 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTCAATTC CAATAACAGT TGCTTATGGA GATGGTATTG GCCCAGAAAT TATGGAAGCT 
GTACTATTGA TCTTGAGTGA AGCAGAATCA GGTCTAGTTG TAGAAACTAT AGAGGTGGGA
CACAATTTAT ATAAAAAAGA ATGGTCTTCT GGTATTGCGC CTTCATCTTG GGATTCTATT
TATAGAACGA AAGTACTGCT TAAGTCTCCT ACCATGACTC CACAAGGTCG TGGGCATAAA
AGTCTTAATG TTACGTTGAG AAAGAGATTA GGGTTGTATG CAAATATTAG GCCATGTATT
TCTTATCATC CTGTGATAAA GACAAGGTAT CCTAATCTCA ATGTAGTGAT AGTTCGAGAA
AATGAGGAAG ATACATATAC TGGTATAGAA CATAGGCTGA CTAATGATAC GTATCAATGC
TCAAAAGTTA TTACGAGATC AGGTTCAGAA AGAATATGTG ATTATGCATT TCACTATGCT
AAGGTTCATA ATAGGAAGAG AGTTACTTGC TTGATAAAAG ATAATATTAT GAAAATGACA
GATGGAATTT TTCATAAGTC TTTTTCAAAG ATTGCAGAGA ATTATCCTGA CATTGAATCA
GATCATTATA TTGTTGATAT TGGAATGGCA AAAGTGGCTT CTAACCCTGA AAATTTTGAT
GTTATAGTTA CTACTAATCT TTATGGTGAT ATAGTGTCTG ATATAGTTGC TGAATTATCA
GGGTCTATAG GCCTTGCAGG TAGTGCTAAT ATAGGGAATA ATTATGCAAT GTTTGAAGCT
GTTCATGGTT CAGCTCCAGA TATAGCTGGA AAGAACATAG CAAATCCTTC TGGATTACTT
AATGCAGCTA TACAAATGTT GATGTATTTA AAACAGTTTG ACAAGGCACA GCTAATTTAT
AATGCATTTC TTAAAACTTT AGAAGATGGT ATTCATACAG CAGATATTTA CCAAAGTCAG
GTTAGCAAGA AAAAAGTTTC TACAATGGAT TTTGCTAAGG CTGTTGTTGA AAATTTTGGA
CAATCTCCAT CTCAATTACC AAAATCAATG TTTCAGGATA ATGTAGATAG GACGGGTGTA
TCTTATACAT ATGAACCATC TTATGTTACT AGAGTTCTTG TAGGAGTGGA TATTACTATT
GGTTGTGATG GTGTAGGACT AGATTTTAAG CAGTTAATTA ATAATCTTCA AAATATTACA
CATGATAAAC TGGAGCTTGT GCTAATTCAT AATAAAGGAT TAGAAATTTG GCCTGATGAA
TCTGTGAGTG TAAATCTTTC TTATATGGAT CAGGTATGTT GTAGGTTTTA CATGAAAAGT
AAAGATGATA AAATCATGAA TGAGCATATT AATCAGTTGC TTTTTGACAT AGAGCAAAAG
AAAATAGATG TAGTAAAAAT GGAAAAATTA TACTTATATA ATGATCAACC TGGCTTTTTT
ACTATGTAA
 
Protein sequence
MSIPITVAYG DGIGPEIMEA VLLILSEAES GLVVETIEVG HNLYKKEWSS GIAPSSWDSI 
YRTKVLLKSP TMTPQGRGHK SLNVTLRKRL GLYANIRPCI SYHPVIKTRY PNLNVVIVRE
NEEDTYTGIE HRLTNDTYQC SKVITRSGSE RICDYAFHYA KVHNRKRVTC LIKDNIMKMT
DGIFHKSFSK IAENYPDIES DHYIVDIGMA KVASNPENFD VIVTTNLYGD IVSDIVAELS
GSIGLAGSAN IGNNYAMFEA VHGSAPDIAG KNIANPSGLL NAAIQMLMYL KQFDKAQLIY
NAFLKTLEDG IHTADIYQSQ VSKKKVSTMD FAKAVVENFG QSPSQLPKSM FQDNVDRTGV
SYTYEPSYVT RVLVGVDITI GCDGVGLDFK QLINNLQNIT HDKLELVLIH NKGLEIWPDE
SVSVNLSYMD QVCCRFYMKS KDDKIMNEHI NQLLFDIEQK KIDVVKMEKL YLYNDQPGFF
TM