Gene ECH_1020 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagECH_1020 
Symbol 
ID3927228 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEhrlichia chaffeensis str. Arkansas 
KingdomBacteria 
Replicon accessionNC_007799 
Strand
Start bp1044549 
End bp1045820 
Gene Length1272 bp 
Protein Length423 aa 
Translation table11 
GC content31% 
IMG OID637902135 
Productputative outer membrane protein TolC 
Protein accessionYP_507806 
Protein GI88657721 
COG category[M] Cell wall/membrane/envelope biogenesis
[U] Intracellular trafficking, secretion, and vesicular transport 
COG ID[COG1538] Outer membrane protein 
TIGRFAM ID[TIGR01844] type I secretion outer membrane protein, TolC family 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.808388 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGATAAGCA AATTTAATAT ACGTAAGGTT TGTTATACAT TGATATTGAT ATCTATGTCA 
ATTATACCTA ATAACAGTTA TTGTACTAAC TTAGATGAAG CTTTACAGGC TGCATTATCA
AATAACCCCA ACATAAAAGC AAAATTCTAT CATTCCTTAG GGAATAAACA AAAAATTAAA
TTGAATAGCA TATCAAAGTT TTTACCATCA ATTGCATACT CCGTGCAGGT ACATCAGCCA
GAATTATCTC TAACCAACAA TAGTAATAGA ACTATGAGCC TCATAGTTAC TCAACAGCTA
TTCAATGGAG GAGCTGATGC CGCTGCTTTT CAACAATCAA AATACTTAAC AAATATAGAA
GATATTGATT TTTCACTAGA GAAACAAAAT GTTATACTTA ATACAGTAAA AGCTTACATG
AAGGTTTTAA CAACAGCTGA GGTATATAAG TTAACACAGC ATACTAAAAA AGTATTAGCA
GAACATTTAA CAGCCACACA AAAACGTTTT TCTTTAGGAG AAGTTACTAA AACAGATGTC
TCACTAGCTA CTGCTAGGTT ATCATCAGCT ACATCAGAAT TAATCAAAGC TCACGGAGAA
ATGAAAGTTG CAGAAGCTAA CTACATTCAC ATAACAGGAG AAATACCAAC AGATTTACAA
AATCCTGCTA TACCAGCAAT ACCATCATCT GTAGAAGAAG CTTTAGAAAT AGCTCAAAAA
AATAACCTTT CTCTACAAGC ATCTCACAAC GGATATAAAG CAGCTAAGCA GGGTATCTTA
ATGGCAATTG CACATTTACT TCCTTCTATT AGCATATCAT CAATAAATTC TTATACTTAC
TCTAATATTC CTAACACAAA TCCTAAAAAA ATTGACAATC TATTTGAAAT AAAAATGTCA
TTACCTATAT TCCAACAAGG ATTAAACATC GCTGCAATTG CACAATCAAA ACTTGCAGCA
CAACACAAGA TGTATTCACA TTATGAAGTG TTAAACACGA TTAAAGAGTC TGTTATTTCA
AATTGGGAAA ATATTTTCAC TACAAATTCC ATGCTACAAG CAGCTCAAGA TTCTGTGAGA
TATTCAGAAG TAGCATTATT CGGAATAAAA CAGGAAGCAG AGTTAAATTT AAGAACAGTT
CTAGATGTAT TAGATGCAGA GCAAGAATTG CTAAAAGCAA AAGTCAATCT TGTTAATGTA
CAAAGTAATG TCGTGATAAG TATATACAAC CTACTTGCAT TAATAGGACA ACTAAACATT
AATTATATTT AA
 
Protein sequence
MISKFNIRKV CYTLILISMS IIPNNSYCTN LDEALQAALS NNPNIKAKFY HSLGNKQKIK 
LNSISKFLPS IAYSVQVHQP ELSLTNNSNR TMSLIVTQQL FNGGADAAAF QQSKYLTNIE
DIDFSLEKQN VILNTVKAYM KVLTTAEVYK LTQHTKKVLA EHLTATQKRF SLGEVTKTDV
SLATARLSSA TSELIKAHGE MKVAEANYIH ITGEIPTDLQ NPAIPAIPSS VEEALEIAQK
NNLSLQASHN GYKAAKQGIL MAIAHLLPSI SISSINSYTY SNIPNTNPKK IDNLFEIKMS
LPIFQQGLNI AAIAQSKLAA QHKMYSHYEV LNTIKESVIS NWENIFTTNS MLQAAQDSVR
YSEVALFGIK QEAELNLRTV LDVLDAEQEL LKAKVNLVNV QSNVVISIYN LLALIGQLNI
NYI