Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | COXBURSA331_A1811 |
Symbol | icmE |
ID | 5794161 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Coxiella burnetii RSA 331 |
Kingdom | Bacteria |
Replicon accession | NC_010117 |
Strand | - |
Start bp | 1652302 |
End bp | 1655406 |
Gene Length | 3105 bp |
Protein Length | 1034 aa |
Translation table | 11 |
GC content | 51% |
IMG OID | 641331163 |
Product | IcmE protein |
Protein accession | YP_001597450 |
Protein GI | 161829987 |
COG category | [S] Function unknown |
COG ID | [COG1357] Uncharacterized low-complexity proteins |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 40 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGCAGAGT TTTCAAAAAA GTTTCTACAA AGTGCAAAAT TCCGAGTAAT TGCCGCCGCG GTTGCCGCTG TGGCTTTAAT CGCGGTGGTG GGCGTTATTT GGCATCATAA GGCTACCGAA GACGCCTTTA AAAGTACGGC CGAAGTTTCT TCTCCGCCGA CGATTGAATC TCTACCAGGG GCTGGCAATC CCTCAGACGC TTACGTGAAA ACCCAGAATA TCCAGAATGC CCAACAAGCG AGTGAAGCGC GCAAAGGCGG CACCAGCTTT GTGCCTACCA TTACGCGTCC AAGCTTTCTC GGATCGGAAG ACCAATTTGA ACAAGATCAA CCGTCAGCCC CGACAACGGA CTTAAAAAAA CGAAATTGTC CCATTAAGAA AGTCGTTTAT ATGTATAAAC CAAATCCAGC GAGCTGTACG GTGGATAATT TAAAATTAGC GCGAAGCGCC GGCGTTACGG CTGAAGAATT AGTGTGTCAA TCCTGTTCTT GTCCATCTTT ACGTCTGGCT GGCTACACGG CCGGTGAGTT AAAAGAAGTG GGTTATTCCA CCGTGGAACT GCGCAAGTGT GGATTTAGCA TCGCCCAATT GCAGGCGGCG GGATTTAGCG CGAAAGATTT AAAAGCGGCT GGTTTTACCG CTGCCCAATT AAAAGCAGCA GGCTTCAGCG CAGGAGAATT GGCGGACGCC GGTTTCACGC CGGATCAAAT AAAAGCGGCC GGTTATTCAT CAGCAGAAAT GCAAGCCGCG GGAATCCAAA CAAACAATCC CGATTGCGAT CTCGCCGCCT TAAAAAAAGC CCGCGCCAGT GGTGTCACAG CGGCGGAATT ACGCCAAAAA GGCTGCGGTT TAGCCGCATT AAAAGCCGCC GGCTTCACTG CAGCTGAGCT TAAAGATGCC GGTTTTACGG CCGCGCAGCT AAAAGCGGCG GGCTTTAGTG CGAAAGATTT GAAAGCGACC GGTTTTACCG CTGCGCAATT AAAAGCCGCG GGTTTTAGTG CAAAAGATTT AAAAAGCGCT GGCTTTAGTG CAGTGGCATT AAAACAAGCG GGATTTAGCA ATGCGGATTT GAAGGATGCC GGATTTTCGC CGGAACAAAT TCAGGCAGCG GATAAAGTGG CAAAAGTTTG TGACGTGGAG GCATTAAAAG CTGCCCGCGC CCAAGGTATT TCCGCTAAAG AACTAAAAGA GAAGGGTTGC GGCTTGGCGG CTTTAAAAGC GGCCGGCTTT ACCGCCGCCG AACTTAAAGA CGCGGGCTTT ACGGCCGCGC AGCTTAAAGG GGCCGGATTC AGCGCCGCCG ATTTAAAAGC GGCCGGTTTT TCTGCGGCGC AATTAAAAGC GGCTGGGTTT AGTGCCAAAG CTTTAAAAGC GGCTGGGTTT TCAGCCCACG ATTTAGCGAC GGCTGGATTT AACGCTTCAC AATTGAAAGA CGCCGGTTTC ACCGCCGATG ATTTAAAAGC GGCTGGTTTC AGTGATCAGG CTTTGAGCGC AGCGGGATTC CCGCCGTCTT CTGGCGATTG CAGCGTCAAA GCGCTTAAAA AAGCGCGAAT GGCGGGTATT TCAGCCACTG AATTAAAAGA AAAAGGTTGT GGGCTGGCTG CTTTAAAAGC AGCCGGCTTC ACCGCAGCCG AGCTTAAAAA CGCTGGATTT ACAGCCGCGC AGCTCAAAGC GGCTGGCTTT AGCGCTAAGG ATTTGAAGGA CGCGGGGTTT TCGGCAGCGG AATTAAAAGC AGCCGGATTC GGGGCTAAGG ATTTAAAAGA TGCTGGCTAT TCGGCACAAG ACTTAAAAGC GGCTGGCTTT AGTGCCGCTC AATTGAAGGA TGCCGGTTTT GACGCTCAAG CCTTGAAAGA CGCAGGCTTC TCAGCGGCGG ACTTAAAAAA TGCTGGTTTT AGCGCTGAAG CACTGAAAAA CGCCGGTTTC AGCGCCGCGC AACTCAAAGC GGCTGGCTTT AGTGCAGGGG CTTTAAAAGC GGCTGGCTTT AGCGCTTCTC AATTAAAAGC CGCCGGCTTT GACGCAAAAG CGCTACGCGA TGCGGGTTTT TCCGCAGGCG AATTAAAAGC AGCGGGATTT TCGCCTGAAG AATTACGCCA TGCGGGTTAT TCAAAAGGTG ATTTATTGCG GGCGGGTTAT ACGGCTGAGC AAGCCGGTTA TCCTCCATCC TCCCCGCCGG GGACAGAGGT CTCGCAGTCC GCCCAGCGCC CGCCCTTATC TGCCGATAAT TCAGCAGCCA GTGTGTCGGG ATTAAATAAT TCACAGAGCA GTGCGATGCC TTCTATTAAT AGTGATTCGC CTGAGGCGCG CTTGCGGGCA TTGCAAAAAT TGCAGCAAGA ACAACTCAAC GAGCAGCAGC GCCGAGATGT GGAACAGCAA ATGCAAGGGC AAATGAGCTT GCAAGCGCAA AAGCTCATGG CTGGCTGGAG TAATGATTCG GGGCAAGCCT ATCAAGTGGC CTTGCAACAA CCGGCGACTA CACCGGTAGG GGGCAACGTT AGCAGCCAGC AAGGGGCGGG AGCCGCGGCT AAACCCACTG GACCCGTCAT TAAGGCAGGG ACCATTATGT TTGCTGTTTT GGATACCGGT ATTAACAGCG ATGAAAAAAG TCCAATTTTA GCCACCATTG TGACAGGCAA ACTGAAAGGA TCGAAACTCA TCGGTGATTT TAGCCGAGTG GATAAGAAGG TCTTATTAAA ATTCAATTTA CTGAATGTGC CTTCTTTCGA CCATACTTTT GGCATTAATG CGGTGGCGAT TGACCCTGAT ACCGCCCGAA CCGCGATCGC GAAATCCGTT AATAGCCATT ATTTATTGCG TTACGGATCG TTATTTGCCT CGGCATTTTT GTCGGGCCTG TCGCAAGGAA TTATCCAATC GGGTTCGACG GAGGAGTGTT TCTTCGGTAT CTGTCACAGA CAGTATTCGA AACTTAACAC AGCTCAATAT ATTGCTTTAG GCATGGGCAA CGTCGGTGAA CAATATGCCA CCGTGATGGG GAACAATTTC AATCGCGCGC CGACCATTCG AGTGCCTGGC GGTACGGGGA TTGGATTGTT GTTCATGAGT GATATCACAT TGCCGCAACC CTTACCGGCG CATCAAAACA CGTAA
|
Protein sequence | MAEFSKKFLQ SAKFRVIAAA VAAVALIAVV GVIWHHKATE DAFKSTAEVS SPPTIESLPG AGNPSDAYVK TQNIQNAQQA SEARKGGTSF VPTITRPSFL GSEDQFEQDQ PSAPTTDLKK RNCPIKKVVY MYKPNPASCT VDNLKLARSA GVTAEELVCQ SCSCPSLRLA GYTAGELKEV GYSTVELRKC GFSIAQLQAA GFSAKDLKAA GFTAAQLKAA GFSAGELADA GFTPDQIKAA GYSSAEMQAA GIQTNNPDCD LAALKKARAS GVTAAELRQK GCGLAALKAA GFTAAELKDA GFTAAQLKAA GFSAKDLKAT GFTAAQLKAA GFSAKDLKSA GFSAVALKQA GFSNADLKDA GFSPEQIQAA DKVAKVCDVE ALKAARAQGI SAKELKEKGC GLAALKAAGF TAAELKDAGF TAAQLKGAGF SAADLKAAGF SAAQLKAAGF SAKALKAAGF SAHDLATAGF NASQLKDAGF TADDLKAAGF SDQALSAAGF PPSSGDCSVK ALKKARMAGI SATELKEKGC GLAALKAAGF TAAELKNAGF TAAQLKAAGF SAKDLKDAGF SAAELKAAGF GAKDLKDAGY SAQDLKAAGF SAAQLKDAGF DAQALKDAGF SAADLKNAGF SAEALKNAGF SAAQLKAAGF SAGALKAAGF SASQLKAAGF DAKALRDAGF SAGELKAAGF SPEELRHAGY SKGDLLRAGY TAEQAGYPPS SPPGTEVSQS AQRPPLSADN SAASVSGLNN SQSSAMPSIN SDSPEARLRA LQKLQQEQLN EQQRRDVEQQ MQGQMSLQAQ KLMAGWSNDS GQAYQVALQQ PATTPVGGNV SSQQGAGAAA KPTGPVIKAG TIMFAVLDTG INSDEKSPIL ATIVTGKLKG SKLIGDFSRV DKKVLLKFNL LNVPSFDHTF GINAVAIDPD TARTAIAKSV NSHYLLRYGS LFASAFLSGL SQGIIQSGST EECFFGICHR QYSKLNTAQY IALGMGNVGE QYATVMGNNF NRAPTIRVPG GTGIGLLFMS DITLPQPLPA HQNT
|
| |