Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | ECH_0230 |
Symbol | |
ID | 3927462 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Ehrlichia chaffeensis str. Arkansas |
Kingdom | Bacteria |
Replicon accession | NC_007799 |
Strand | + |
Start bp | 217847 |
End bp | 219082 |
Gene Length | 1236 bp |
Protein Length | 411 aa |
Translation table | 11 |
GC content | 32% |
IMG OID | 637901354 |
Product | hypothetical protein |
Protein accession | YP_507051 |
Protein GI | 88657781 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG2814] Arabinose efflux permease |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 3 |
Plasmid unclonability p-value | 0.0317148 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGGGGATG TTTTGAATAG AAAATTTTTA GCATGGTTTT TAGTCTCAAT GTTTTATGCG TATCAGTATA TATTGAGAGT AATTCCTAAT GTTATTGTCT CTGTGTCTAT GGAGAAGTTC AAAATTAGTG CTATGGCATT TGGTCAATTT TCTGGTTTAT ATTATATTGG ATATACGTTA GCACATATAC CGCTAGGTAT TCTTTTAGAT AAATATGGGC CTAAGATAGT ATTACCAATT TGTGCAGCTC TAACGTTTAT TGGATTAGTG CCATTGCTAA TATCAGATGT GTGGTTATTT GCTCAAATTG GGAGAATAAT TACAGGTGTA GGTTCAGCTG GTTCTGCTCT TGGCCTTTTT AAAGTTGCTA GTATGTACTA TGGCAACAGA TTTGCTAGAA TGTCTGGTAT TTCTGTTATA ATAGGACTGC TAGGTGCAAT GTATGGAGGA CTGCCAATTT TATCTTTGTT AAATAAATTT GGTTGGGAAA GTTTATTCTT AGTATTTATT ATTATAGGAG CTGTTATAGC GCTGTTGCTA TATTTGTTTA TGTTGCCTTA TGATAAAGAC TCCAATGTTG AAGATAATAA AGGTCTCTGT GATAAAATTA AGTTGATTGT ATTTAATAAG TACATTGTTA TGATCAGCTT ACTTGCAGGC TTTATGATTG GACCTTTGGA AGGTTTTGCT GATGGATGGG TTACTTCATT TTTAAAGGCA GTTAGTAATA TGGATAAAGA AGTAGCTGCT TTATTGCCTT CTACAATATT TATTGGTATG TGCTTTGGAT TGTTTGTCTT ACCATATATG TTGGAGAAGA AATCATTTAA TAGTTGGAAT ATACTTATCA CATCTGCTTT GGGGATGTTG TTTCCTTTCC TGATGTTATA TATCAGTAGT TCGGTTATAT TGGTCACAAT ATCATTTTTT ATGATAGGGT TTTTTTCATC ATATCAGATT ATAGCAACTT GTAAGGTACT AAGTTATGTT AGTAATAATG TAGTTGCATT AGCTACTGCT GTAAATAACA TGATAGCTAT GGCTTTTGGT TATTTCTTTC ATACTGCTAT ATCTTGTGTA ATAGATTTGT TATGGGATGG AAAAATTGTA GATTCAGAGC CAGTATATAC TAAGGCATTA ATGCTAAAAT CTATGTTATT TATTCCTGGT GGATTGCTTA TAGGAGCAAT AGGGTTTATT TACCTGAAAT ATTTGGATAA GAAAGAGGGT AAGTAA
|
Protein sequence | MGDVLNRKFL AWFLVSMFYA YQYILRVIPN VIVSVSMEKF KISAMAFGQF SGLYYIGYTL AHIPLGILLD KYGPKIVLPI CAALTFIGLV PLLISDVWLF AQIGRIITGV GSAGSALGLF KVASMYYGNR FARMSGISVI IGLLGAMYGG LPILSLLNKF GWESLFLVFI IIGAVIALLL YLFMLPYDKD SNVEDNKGLC DKIKLIVFNK YIVMISLLAG FMIGPLEGFA DGWVTSFLKA VSNMDKEVAA LLPSTIFIGM CFGLFVLPYM LEKKSFNSWN ILITSALGML FPFLMLYISS SVILVTISFF MIGFFSSYQI IATCKVLSYV SNNVVALATA VNNMIAMAFG YFFHTAISCV IDLLWDGKIV DSEPVYTKAL MLKSMLFIPG GLLIGAIGFI YLKYLDKKEG K
|
| |