Gene ECH_0230 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagECH_0230 
Symbol 
ID3927462 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEhrlichia chaffeensis str. Arkansas 
KingdomBacteria 
Replicon accessionNC_007799 
Strand
Start bp217847 
End bp219082 
Gene Length1236 bp 
Protein Length411 aa 
Translation table11 
GC content32% 
IMG OID637901354 
Producthypothetical protein 
Protein accessionYP_507051 
Protein GI88657781 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2814] Arabinose efflux permease 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0317148 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGGGGATG TTTTGAATAG AAAATTTTTA GCATGGTTTT TAGTCTCAAT GTTTTATGCG 
TATCAGTATA TATTGAGAGT AATTCCTAAT GTTATTGTCT CTGTGTCTAT GGAGAAGTTC
AAAATTAGTG CTATGGCATT TGGTCAATTT TCTGGTTTAT ATTATATTGG ATATACGTTA
GCACATATAC CGCTAGGTAT TCTTTTAGAT AAATATGGGC CTAAGATAGT ATTACCAATT
TGTGCAGCTC TAACGTTTAT TGGATTAGTG CCATTGCTAA TATCAGATGT GTGGTTATTT
GCTCAAATTG GGAGAATAAT TACAGGTGTA GGTTCAGCTG GTTCTGCTCT TGGCCTTTTT
AAAGTTGCTA GTATGTACTA TGGCAACAGA TTTGCTAGAA TGTCTGGTAT TTCTGTTATA
ATAGGACTGC TAGGTGCAAT GTATGGAGGA CTGCCAATTT TATCTTTGTT AAATAAATTT
GGTTGGGAAA GTTTATTCTT AGTATTTATT ATTATAGGAG CTGTTATAGC GCTGTTGCTA
TATTTGTTTA TGTTGCCTTA TGATAAAGAC TCCAATGTTG AAGATAATAA AGGTCTCTGT
GATAAAATTA AGTTGATTGT ATTTAATAAG TACATTGTTA TGATCAGCTT ACTTGCAGGC
TTTATGATTG GACCTTTGGA AGGTTTTGCT GATGGATGGG TTACTTCATT TTTAAAGGCA
GTTAGTAATA TGGATAAAGA AGTAGCTGCT TTATTGCCTT CTACAATATT TATTGGTATG
TGCTTTGGAT TGTTTGTCTT ACCATATATG TTGGAGAAGA AATCATTTAA TAGTTGGAAT
ATACTTATCA CATCTGCTTT GGGGATGTTG TTTCCTTTCC TGATGTTATA TATCAGTAGT
TCGGTTATAT TGGTCACAAT ATCATTTTTT ATGATAGGGT TTTTTTCATC ATATCAGATT
ATAGCAACTT GTAAGGTACT AAGTTATGTT AGTAATAATG TAGTTGCATT AGCTACTGCT
GTAAATAACA TGATAGCTAT GGCTTTTGGT TATTTCTTTC ATACTGCTAT ATCTTGTGTA
ATAGATTTGT TATGGGATGG AAAAATTGTA GATTCAGAGC CAGTATATAC TAAGGCATTA
ATGCTAAAAT CTATGTTATT TATTCCTGGT GGATTGCTTA TAGGAGCAAT AGGGTTTATT
TACCTGAAAT ATTTGGATAA GAAAGAGGGT AAGTAA
 
Protein sequence
MGDVLNRKFL AWFLVSMFYA YQYILRVIPN VIVSVSMEKF KISAMAFGQF SGLYYIGYTL 
AHIPLGILLD KYGPKIVLPI CAALTFIGLV PLLISDVWLF AQIGRIITGV GSAGSALGLF
KVASMYYGNR FARMSGISVI IGLLGAMYGG LPILSLLNKF GWESLFLVFI IIGAVIALLL
YLFMLPYDKD SNVEDNKGLC DKIKLIVFNK YIVMISLLAG FMIGPLEGFA DGWVTSFLKA
VSNMDKEVAA LLPSTIFIGM CFGLFVLPYM LEKKSFNSWN ILITSALGML FPFLMLYISS
SVILVTISFF MIGFFSSYQI IATCKVLSYV SNNVVALATA VNNMIAMAFG YFFHTAISCV
IDLLWDGKIV DSEPVYTKAL MLKSMLFIPG GLLIGAIGFI YLKYLDKKEG K