Gene ECH_1084 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagECH_1084 
SymbolaraM 
ID3927959 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEhrlichia chaffeensis str. Arkansas 
KingdomBacteria 
Replicon accessionNC_007799 
Strand
Start bp1111773 
End bp1113032 
Gene Length1260 bp 
Protein Length419 aa 
Translation table11 
GC content30% 
IMG OID637902198 
ProductAraM protein 
Protein accessionYP_507869 
Protein GI88658280 
COG category[C] Energy production and conversion 
COG ID[COG0371] Glycerol dehydrogenase and related enzymes 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTATGATA AATTTTTAAA GCAGATTCTA GTTGATGAAA GTTTTTGTAA GTTGGAGTCA 
GTAGTTAATG CAATAAAGAG TATACGTATT AGTAAGAGAA TCAGTGACAA TATATGCGAT
ATAATTAAAC AGTATGGTAA TAGTGGTTTC ATAGTTACAG ATGTTAATCT TGCTCCTCTG
TTGAACAAGG TTGTATTTAA TAGTGTTGCA CATTTTATTA TTCCTCGTTG GTCTTGTGCT
TCTCAAAAAT TAGTGGAACT AATTAAAGAG AAATCTCGGG ATTCAGATGT TTTAGTATCT
TTTGGTAGTG GTACAATCAA TGATATTTGT AAATATGTAA GTTACATTAC AAATAAGCGC
TATATCTCAT TTCCAACAGC TCCTTCTATG AATGGATATG TTTCATCTAA TGCATCTATA
GTATTAAACA ATGGACATAA AAAATCATTA CAAGCTCATT TACCTGAAGC TATATATATA
GATGTTGATA TTATTGTAAA TGCTCCCCAA AGATTAATAA TAAGTGGTTT TGCTGATTTT
ATTTGTAGAT CTACAGTACA GGCAGATTGG TTATTATCAC ATTTGTTGTT AGGTAGTGAA
TATACTGAAT TGCCGTTTTT AATTAGCAAA AGAAGTGAAA ATGCATTAAT CAACGATTAT
CTAGGGTTAA TAAAACATGA TGAGTATAGT ATTATGCTTT TAATGCAAGC TTTGCTCTTA
TCAGGATTGG GTATGTTTAT TGTAGGCGGA AGTCAATCTG CTAGTCAGGG TGAACATATG
ATTGCTAGTA CCATAGAACT TTTACAAGAT GATATGCATT TCTTTCATGG TGAATTAATA
GGTGTAAGTA TGTCAACTAT GACATGTTTA CAACATAGGA TTTTGAAATC AGTACCAAGG
TTTTATCCTA CATTGATAAA TGATGAAGAT ATAAAACAGT GTTTTCATAT GCAGTATACT
CAAGAGTATT GTGATATACT TGCTCAAAAG TTTATTAATC AACAAAAGGC GGATTATTTG
AATAGCTTGA TTAAGGATAA GTGGTCTTTT ATTGTTGAAA AAATTACAGA GAAAACATTA
TCTGATATAT TATTAAAAGA CATGCTGGTT AACATAGGTT GTCCCAATAA ACCAGAGCAT
ATTGGATGGA ATATTAGTCA ATATAGTAAA GCAATAGAAT TTGCGTTTAT TACTAGATCA
AGATTTACGT TTCTTGATAT TGCGCATCAT GGTAGATTTG CGATAGTTGA AGATATGTAA
 
Protein sequence
MYDKFLKQIL VDESFCKLES VVNAIKSIRI SKRISDNICD IIKQYGNSGF IVTDVNLAPL 
LNKVVFNSVA HFIIPRWSCA SQKLVELIKE KSRDSDVLVS FGSGTINDIC KYVSYITNKR
YISFPTAPSM NGYVSSNASI VLNNGHKKSL QAHLPEAIYI DVDIIVNAPQ RLIISGFADF
ICRSTVQADW LLSHLLLGSE YTELPFLISK RSENALINDY LGLIKHDEYS IMLLMQALLL
SGLGMFIVGG SQSASQGEHM IASTIELLQD DMHFFHGELI GVSMSTMTCL QHRILKSVPR
FYPTLINDED IKQCFHMQYT QEYCDILAQK FINQQKADYL NSLIKDKWSF IVEKITEKTL
SDILLKDMLV NIGCPNKPEH IGWNISQYSK AIEFAFITRS RFTFLDIAHH GRFAIVEDM