Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | ECH_0977 |
Symbol | |
ID | 3927174 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Ehrlichia chaffeensis str. Arkansas |
Kingdom | Bacteria |
Replicon accession | NC_007799 |
Strand | + |
Start bp | 999835 |
End bp | 1000854 |
Gene Length | 1020 bp |
Protein Length | 339 aa |
Translation table | 11 |
GC content | 31% |
IMG OID | 637902093 |
Product | putative phosphate ABC transporter, periplasmic phosphate-binding protein |
Protein accession | YP_507764 |
Protein GI | 88658150 |
COG category | [P] Inorganic ion transport and metabolism |
COG ID | [COG0226] ABC-type phosphate transport system, periplasmic component |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGGTAGAT TTCTTTTGGT GTTTCTATCG CTAATTACAT TAGCATTTAA TTCCAGCGCA CAACAAATAC GAGTTGTTGG ATCATCTACA GCATTCCCAT TTATTTCAGC TATAGCAGAA GAATTTGGAA GATTTTCAGA TTACGGCACC CCTATTATAG AATCAGTAGG TAGCGGAATG GGATTCAGCA TGTTTTGTCA AGGTGTAGGG AAAAATACTC CAGATATAGT AATGTCATCA AGAAAAATTA AGGATGCAGA AGTTAAATTA TGTCAAAGTA ATAATATTAA CAATATTATA GAGATCATTA TAGGTTATGA TGGCATAGTT ATAGCAAATT CCAAAGATAG TGCCAAACTT GATTTTACAA AAAAAGACCT ATTCAAAGCT TTAAGTAAAT ACTCCACATC AAATGAGTAT GTTGAAACAA TACCGTCCAA CAACTTCAGA TATTGGTCTG AAATCAATAA TAGATTCCCT AATATTGATA TTGAGATCTA TGGACCATAT AAAAATACTG GAACCTATAA CATACTAGTA GAAGAGATCA TGCAAGATGC TTGTATGAAT AACCAAAATT TTATCGATGT ATACTCAGAT CCTACAAAAA GGCGTCGTAT ATGTAGTATT ATGCGTAACG ATGGCAAATA CATTGAAGTT GCTGCCAACG AAAACATTAT TATACAAAAA ATAGCAAAAA ATCAAGACGC TTTTGGCATA TTAAGTTTTA GTTTCTTAGT AAAAAATCAA GATAAAATAC ATGCAAATAA AATAGCAGGT ATAGAACCTA ATTATGAAAC TATATCTTCA GGAAAATACA TTCTGTCAAG ACCCATATAC CTATATATAA AACAAGAACA CATCAGTTTT TCTCCCAGCT TGAAAGAATT TATTAAGGTA ATTCTAAGAG AAGATTCCAT CGGCAAGAAC GGATACTTAA TTGGGCTAGG TTTTATACCT TTATCAGATA AAGACTTACA AGACACTAAA AATCGTATTA CCGGTATATT AGAAAAATAA
|
Protein sequence | MGRFLLVFLS LITLAFNSSA QQIRVVGSST AFPFISAIAE EFGRFSDYGT PIIESVGSGM GFSMFCQGVG KNTPDIVMSS RKIKDAEVKL CQSNNINNII EIIIGYDGIV IANSKDSAKL DFTKKDLFKA LSKYSTSNEY VETIPSNNFR YWSEINNRFP NIDIEIYGPY KNTGTYNILV EEIMQDACMN NQNFIDVYSD PTKRRRICSI MRNDGKYIEV AANENIIIQK IAKNQDAFGI LSFSFLVKNQ DKIHANKIAG IEPNYETISS GKYILSRPIY LYIKQEHISF SPSLKEFIKV ILREDSIGKN GYLIGLGFIP LSDKDLQDTK NRITGILEK
|
| |