Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | ECH_1020 |
Symbol | |
ID | 3927228 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Ehrlichia chaffeensis str. Arkansas |
Kingdom | Bacteria |
Replicon accession | NC_007799 |
Strand | + |
Start bp | 1044549 |
End bp | 1045820 |
Gene Length | 1272 bp |
Protein Length | 423 aa |
Translation table | 11 |
GC content | 31% |
IMG OID | 637902135 |
Product | putative outer membrane protein TolC |
Protein accession | YP_507806 |
Protein GI | 88657721 |
COG category | [M] Cell wall/membrane/envelope biogenesis [U] Intracellular trafficking, secretion, and vesicular transport |
COG ID | [COG1538] Outer membrane protein |
TIGRFAM ID | [TIGR01844] type I secretion outer membrane protein, TolC family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 7 |
Plasmid unclonability p-value | 0.808388 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGATAAGCA AATTTAATAT ACGTAAGGTT TGTTATACAT TGATATTGAT ATCTATGTCA ATTATACCTA ATAACAGTTA TTGTACTAAC TTAGATGAAG CTTTACAGGC TGCATTATCA AATAACCCCA ACATAAAAGC AAAATTCTAT CATTCCTTAG GGAATAAACA AAAAATTAAA TTGAATAGCA TATCAAAGTT TTTACCATCA ATTGCATACT CCGTGCAGGT ACATCAGCCA GAATTATCTC TAACCAACAA TAGTAATAGA ACTATGAGCC TCATAGTTAC TCAACAGCTA TTCAATGGAG GAGCTGATGC CGCTGCTTTT CAACAATCAA AATACTTAAC AAATATAGAA GATATTGATT TTTCACTAGA GAAACAAAAT GTTATACTTA ATACAGTAAA AGCTTACATG AAGGTTTTAA CAACAGCTGA GGTATATAAG TTAACACAGC ATACTAAAAA AGTATTAGCA GAACATTTAA CAGCCACACA AAAACGTTTT TCTTTAGGAG AAGTTACTAA AACAGATGTC TCACTAGCTA CTGCTAGGTT ATCATCAGCT ACATCAGAAT TAATCAAAGC TCACGGAGAA ATGAAAGTTG CAGAAGCTAA CTACATTCAC ATAACAGGAG AAATACCAAC AGATTTACAA AATCCTGCTA TACCAGCAAT ACCATCATCT GTAGAAGAAG CTTTAGAAAT AGCTCAAAAA AATAACCTTT CTCTACAAGC ATCTCACAAC GGATATAAAG CAGCTAAGCA GGGTATCTTA ATGGCAATTG CACATTTACT TCCTTCTATT AGCATATCAT CAATAAATTC TTATACTTAC TCTAATATTC CTAACACAAA TCCTAAAAAA ATTGACAATC TATTTGAAAT AAAAATGTCA TTACCTATAT TCCAACAAGG ATTAAACATC GCTGCAATTG CACAATCAAA ACTTGCAGCA CAACACAAGA TGTATTCACA TTATGAAGTG TTAAACACGA TTAAAGAGTC TGTTATTTCA AATTGGGAAA ATATTTTCAC TACAAATTCC ATGCTACAAG CAGCTCAAGA TTCTGTGAGA TATTCAGAAG TAGCATTATT CGGAATAAAA CAGGAAGCAG AGTTAAATTT AAGAACAGTT CTAGATGTAT TAGATGCAGA GCAAGAATTG CTAAAAGCAA AAGTCAATCT TGTTAATGTA CAAAGTAATG TCGTGATAAG TATATACAAC CTACTTGCAT TAATAGGACA ACTAAACATT AATTATATTT AA
|
Protein sequence | MISKFNIRKV CYTLILISMS IIPNNSYCTN LDEALQAALS NNPNIKAKFY HSLGNKQKIK LNSISKFLPS IAYSVQVHQP ELSLTNNSNR TMSLIVTQQL FNGGADAAAF QQSKYLTNIE DIDFSLEKQN VILNTVKAYM KVLTTAEVYK LTQHTKKVLA EHLTATQKRF SLGEVTKTDV SLATARLSSA TSELIKAHGE MKVAEANYIH ITGEIPTDLQ NPAIPAIPSS VEEALEIAQK NNLSLQASHN GYKAAKQGIL MAIAHLLPSI SISSINSYTY SNIPNTNPKK IDNLFEIKMS LPIFQQGLNI AAIAQSKLAA QHKMYSHYEV LNTIKESVIS NWENIFTTNS MLQAAQDSVR YSEVALFGIK QEAELNLRTV LDVLDAEQEL LKAKVNLVNV QSNVVISIYN LLALIGQLNI NYI
|
| |