Gene ECH_0193 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagECH_0193 
Symbol 
ID3927694 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEhrlichia chaffeensis str. Arkansas 
KingdomBacteria 
Replicon accessionNC_007799 
Strand
Start bp184447 
End bp185715 
Gene Length1269 bp 
Protein Length422 aa 
Translation table11 
GC content32% 
IMG OID637901317 
Productmajor facilitator family transporter 
Protein accessionYP_507017 
Protein GI88657883 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2814] Arabinose efflux permease 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAATTTAA GAAGAGCTGT GTTATCGACA ATCATGTGTA ATACTTTAGT ATGGTATGAT 
TATGTGTTAT TTGGGAATTT GGTGAGTGTA ATCAGTAAAT TATTTTTTCC AGCAGAAGAT
AGATATTTTA GTCTTATTAT GACATTCAGT ATTTTTGCAG TTGGATTTTT AATGCGTCCT
TTTGGGGCAA GTATTTTTGG TTACATTGGC GATAAATATG GAAGAAAAGC TGCACTGACT
TTATCGATTA TAGCAATATC TGTCCCTATT ACTTTTATCT CAATATTGCC TACCTATGAA
AAAATAGGAA TATTGTCTCC TATATTACTT ATTATTTGTA GGTTGATGCA GGGGATATCT
CTAGGTGGAG AAGCTGGTAA TGCTACTTTC TTAATAGAGC ATTCTAAAAA GGGAAAAAAC
ATTGGTTTTT TTGGTAGTTT TGAGACCCTT AGTGCTGTGC TTGGTTCTAT TATTGCATTA
TTTATGATCT TGTTATCTCA GTACTTTACA GGAGAAAATT TTGAAGTATG GGGTTGGAGA
ATACCTTTTG TAATTGGTTT ATTATTGGGA TTAATTAGTG TTTATATTAG GCGTATTACT
GGTGAAAGTC CTGCGTATGA TACTCATAAA GAAAATAATA ATCTTTCTCA ATCTCCTTTC
TTAGAATTGT TAAAAAAGTA TAAGCGCCCT TTAGTCTTGG CAACATGTAT TGACTGTGTA
GAAAATTGTT CATTTCATAT TTTTATGGTG TTTTTTATTA CATTTATTAA GGAGTTCTCA
AATATTCACC TGAATTTAAA TGCTAATACT ATAAGCATTA TTGAAAGTTT TAATATAATG
ATTTGTGGTA TTTTGAATGT ATTTTTTGGA TATATTTCAG ATTATGTAGG GCGTAGAAAA
GTAATGTTAA TTGCATCTGT GTCATTGTTT TGTGTTGCAA TACCAGTATT TTGGTTATTA
AGTCAAGATA GCTATGTTTC TTTGATTGCT GCATATTTAA TATTTGTAAT TCCGTTTTCT
GCAACTTTAG GTCCAGCAAG TGGTGCAATG TCTGAATTGT TCCCTACAAA AGTTAGATAT
ACTGGTTTTG GATTATCGCG TAACATTGCT TCAGCTATAT CTGGTGGTAT GGCTCCTGTA
GTATGTACAT GGCTTATAAG GGCAACAGGG CTTTCGTTTA TTCCTGGAGT ATATGTTATG
TTTTGGGCAT TGGTTGGAGT TATTGCATTG TGTCAGATCA GAAAAAAAGA TGTTTATGCT
GATTGGTAA
 
Protein sequence
MNLRRAVLST IMCNTLVWYD YVLFGNLVSV ISKLFFPAED RYFSLIMTFS IFAVGFLMRP 
FGASIFGYIG DKYGRKAALT LSIIAISVPI TFISILPTYE KIGILSPILL IICRLMQGIS
LGGEAGNATF LIEHSKKGKN IGFFGSFETL SAVLGSIIAL FMILLSQYFT GENFEVWGWR
IPFVIGLLLG LISVYIRRIT GESPAYDTHK ENNNLSQSPF LELLKKYKRP LVLATCIDCV
ENCSFHIFMV FFITFIKEFS NIHLNLNANT ISIIESFNIM ICGILNVFFG YISDYVGRRK
VMLIASVSLF CVAIPVFWLL SQDSYVSLIA AYLIFVIPFS ATLGPASGAM SELFPTKVRY
TGFGLSRNIA SAISGGMAPV VCTWLIRATG LSFIPGVYVM FWALVGVIAL CQIRKKDVYA
DW