Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | ECH_0581 |
Symbol | |
ID | 3927325 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Ehrlichia chaffeensis str. Arkansas |
Kingdom | Bacteria |
Replicon accession | NC_007799 |
Strand | + |
Start bp | 589788 |
End bp | 590924 |
Gene Length | 1137 bp |
Protein Length | 378 aa |
Translation table | 11 |
GC content | 31% |
IMG OID | 637901703 |
Product | sodium:dicarboxylate symporter family protein |
Protein accession | YP_507392 |
Protein GI | 88658620 |
COG category | [C] Energy production and conversion |
COG ID | [COG1301] Na+/H+-dicarboxylate symporters |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | TTGGGAGATA TTTTTATTAA CTTACTCAAA CTAATTAGCT TACCAGTAGT ATTTTTTTCT ATTACATCGA CAATTTCTGG ATTGGCAAAT TTGACAGAAA TTAAAACTTT AATAAGAAAA ACCATATTTT ATACTATATC TACTACAGTG ATTGCAGCTA CTGTAGGTCT TATCACATAC CTATTAATTG ATCCATCAAA AAAAGAACTT ATATATAACA TATTGAGTAC CAATAAGCAT ATTAGCAATA CTCCAGATTA TTTATCGTAT TTAATGTCTA TATTACCTTA TAACTTCATT AAGGTATTTC TAGATAATAA TGTAATTGGG TGTGTCATAC TTGCATTCCT TATAGGAGGA GCACTGCTAT TATTGCCTGA TAAAAATAAA CGCGAATTGC TTCATAAAGT TTTTGATGCC CTCTTCGATA CATTTTTAGA AATAGCAAAG CTTATATTAA AGCTAATGCC CATAGCATTA TGGTCATTTA TTACTGTACT GTTATATAAC ATGAAGGAAG GATATAACAT ATCAAATGTT TTAAAATATC TGTTATGTAT CATGATAGCA AATTTCATAC AAGCATGTAT AATACTGCCT CTGTTACTAA AATTAAAGAA GATTCCTGTT ATAAAAACAT TTCGCGGTGT TTTGCCAGCT TTAACCATAG CTTTCTTTTC AAAATCTTCT ACAGCTACTC TACCAACAAC TCTTCGTTGC ACACAAGATT ACTTAAATAT TCCAAAAAAG ATATCATCTT TTATACTGCC AGTTTGTACA ACTATTAATA TGAATGCATG TGCTGCTTTC ATATTAATCA CAGTATTTTT TGTATCAGAA GTTAACGGAT ACACATTTTC TATTGGTGAG ATGTTTTTGT GGGTGTTCCT AGCTACTGGA GCAGCTATTG GTAATGCAGG GGTTCCAATG GGGTGCTACT TTATGGCTAT GAGTTATCTC ATGTCAATGA AAGTGCCTTT GAGTATCATG GGAGTTATAT TGCCTGTATA TACAATAATA GATATGTTTG AAACTGCAAT TAATGTATGG TCAGATGTAT GCATTACTCA GATAGTACAT AAAGAATATG ATGCGTTGAT AAAAAAAGGT AAGAAAATTG GAATAAATGA TCAATGA
|
Protein sequence | MGDIFINLLK LISLPVVFFS ITSTISGLAN LTEIKTLIRK TIFYTISTTV IAATVGLITY LLIDPSKKEL IYNILSTNKH ISNTPDYLSY LMSILPYNFI KVFLDNNVIG CVILAFLIGG ALLLLPDKNK RELLHKVFDA LFDTFLEIAK LILKLMPIAL WSFITVLLYN MKEGYNISNV LKYLLCIMIA NFIQACIILP LLLKLKKIPV IKTFRGVLPA LTIAFFSKSS TATLPTTLRC TQDYLNIPKK ISSFILPVCT TINMNACAAF ILITVFFVSE VNGYTFSIGE MFLWVFLATG AAIGNAGVPM GCYFMAMSYL MSMKVPLSIM GVILPVYTII DMFETAINVW SDVCITQIVH KEYDALIKKG KKIGINDQ
|
| |