Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | COXBURSA331_A1806 |
Symbol | icmB |
ID | 5793405 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Coxiella burnetii RSA 331 |
Kingdom | Bacteria |
Replicon accession | NC_010117 |
Strand | - |
Start bp | 1646724 |
End bp | 1649735 |
Gene Length | 3012 bp |
Protein Length | 1003 aa |
Translation table | 11 |
GC content | 45% |
IMG OID | 641331158 |
Product | IcmB protein |
Protein accession | YP_001597445 |
Protein GI | 161830763 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 31 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAAATCA TTAAATCATT TGCAGGCCTG TTTGATAGTT TGTTCGCCTG GTTAAGTAAT ACTTTAAAAC AAAGCACCAG CGCTTATTGT GAATTGCAAA CAGCCGATAG TTCGACTGTG TTAGTTGCTC ACGACGGTTC GTTAATTTCC GTTTTACGTT TAGAAGGCGT TACCGCTCTC ATTGGCCGGG AAGAATTCGA TAAAATTCAA ACCGGGTTGC AGCATGCGTT ACAAACCGTC ATGTCCCAAC CAGGACACGT CATCCAAGTT TATTTTAGCT ATAACAAAGA TGAAGTGCGT GGTGAAATTA ATGAAATTTT GCAGCCGGCT GAACAAACGG CAAAGCGGCT AAGCTTGCAG TTAGGTGATT TGTTTAAGGA GAGAATGAAT TATTTAACAA AATATTGTGC GCATGAAGAA ATTTATATTG TTTTATGGAC CCGTTTGAAA TCCCTTACTA ACGAACAAAT TAAACGATCA ACGAAAGAAA AACGAAAACA GATTAAAAAA CAAAAAATAC CGCCATTTAA ATTAACCCAA AATCTAATTG CCGCTATTCC CGACCTTCGA GAAAATCACG ACTCTTTTGT TCGTTCAGTA GTGAACGAAT TTAATGGGTT GGGGTTGATA ACTGAACTAT TGGAAGTGCA CGATGCTGTT TATATTATGC GTCGCAGTGC GGACCCAGAG TTCACCGATC GAGAATGGCG ACCTTTATTG CCAGGGGATA AAATAACGAT AAAAGAGCCA AAAGCGGGCA CTTCGGAAGT TTCTGATATT TTATGGCCGG CCCTCGCCCG TCAAATATTA CCCCGCGATG CTGAAAATTT AGATTTGCGA ACGGCTCGGG TGGGCGATCG CATTTATGCG ACGGTGTTTA TTAATTTATT TCCGAAGGAC ATTCAAACCT TCGTCCGGCT TTTTACTCGA ACACTCCAAA CGCGGATTCC ATGGCGAATT TCTTTTTTAT TTGAAAGCGA TGGCCTAGCG GGAACGAGCA TTCGTAAAAT GCTCTCTTCG GTATTGAGTG TAACTTCCAC TCAGAATCGT TTAATTCACG ACTCCTTAAA TTTATTAAAT TACATCAATC TCAATACGGA TGATGCCGTC GTGAAATTGC GTGTTTCGGC CGCTACTTGG GCGCCGGAAG GCGATATTCG GCTTTTGCGC GCGCGGGCGG CTATGTTAGC GAAAGCGATT GAAGGATGGG GTTCTTGTGA CGTTTCCGAA ATTTCGGGGG ATGCTTACGA AGGTGTCGTT TCAACAATGA TCGGCATTTC GGGCACCAGT GTTGCTCCCG CCTCCATCGC TCCTCTTTCT AACGTTCTCT ATATGCTTCC GCTCTTTCGC CCCGCCTCGC CGTGGACGCA CGGCGCATTA TTATTTCGTT CTCCCGACGG TAAGCCGTGG CCTTATCAAC CCGGTTCTCA TCAACAAACC ACGTGGATCG ATTTATTTTA CGCGCGACCG GGTTCTGGTA AATCCGTGTT GTCCAACACC ATTAACTTAG CTGTTTGTTT GTCTTCTGGG ATTCAGCGGT TACCGCACAT TTCCATTATC GATATCGGTC CTTCTAGCAG CGGATTAATT TTATTATTAA AAGATGCGTT GCCCGCCGAT AAAAAATATT TGGTGGCTTA CCATCGGTTG CGAATGCGAT CGGATTACGC CATTAATCCT TTTGATACTC AGCTCGGGTG TCGCTATCCC ACTCCACAGG AACGCGCCTT TTTAGTGAAT TTTTTAAGTT TATTGGCAAC GCCTATTGGT TCTGAAAAAA CATACGACGG CGTTGCGGAT ATGGCGGGTT TAATCATCGA TGAGCTTTAT AAAAATAAAG CGGACGATGG AAATCCCAAC ACTTTTGCGC TTGGAATGGA AGAAAATATC GATGGGATTT TAGAAGAAAT TGGTTTTGTT CAAGATGATC AAACAACCTG GTGGGAAGTC ACCGATGCGC TATTTATGGC GGGCTTTACT CACGAAGCGA TGTTGGCGCA GCGCCATGCG ATGCCTGTTT TAGCGGATGT CGCTGCTATT TGTCGTTTAC CCGCTATTCA GGATTTGTAC GGGAAAATTG TTGCTCCAAC GGGAGAACCG CTGATTCATG CTTTTGCGCG GATGATTTCA AGCGCGGTGC GTGAATACCC CATCATTTCT CAAGTAACGC GTTTCGATTT AGGCGATGCT CGAGTAGTGG CGTTGGACTT AGATGAAGTG GCACGAAGCG GTGGGGATGC GGCTAATCGT CAAACGGCTG TGATGTATAT GCTAGCGCGA TATGTTTTGG GCCGTCATTA TTTTCTGACG GACGACAACG TGGCGGATAT GCCGGAAGGT TATCGCCATT ATCACCAAGC GCGCATTGCA GAAATTCGGG AAGATCCTAA ACGGATTGTT TTCGATGAAT TTCACCGTAC GGCTAAGGCG CAAGCGGTGC GCGATCAGGT CATTCAAGAC ATGCGTGAAG GCCGTAAATG GAAAGTTCAG GTTGCGTTAT TATCACAATC TTTGGATGAT TTCGATGAAA TTATGGTCGA ATTTGCCACC TCCATATTTA TTATGGACGC CGGGCCTGAA CAAACTGTGC GTAAAACCGC TGAAATTTTC GGATTAAGTC ACACCGCAGA AATTGCATTA AAAACACGGG TGCATGGTCC GCGTGAAGAC GGCGCGACCT TTCTAGCGCA ATTTGCAACT AAAAATGGGT TAAACACGCA GTTATTAACC GCCACATTAG GTCCTGTCGA ATTATGGTCG CTGAACACAA CGGCCGAGGA TGTCAATATT CGTAATCAAC TTTATAAACG CATTGGCCCA AAAGAAACTC GCCGTATTTT AGCCACTATG TTCCCCTCGG GGACGGCCAC CAAAGCGCTC GAAGATCGTT ACGCAGACTA TAAGGAAGAA GGCCGCCTTA TCGATGACAC GGCCAAACAC GGCATTATGC AGTCCTTGAT TAATGAAATT CTAGAAACTT ATTATTCTGA AAAAGAGAGT GCTGTTGTAT AA
|
Protein sequence | MKIIKSFAGL FDSLFAWLSN TLKQSTSAYC ELQTADSSTV LVAHDGSLIS VLRLEGVTAL IGREEFDKIQ TGLQHALQTV MSQPGHVIQV YFSYNKDEVR GEINEILQPA EQTAKRLSLQ LGDLFKERMN YLTKYCAHEE IYIVLWTRLK SLTNEQIKRS TKEKRKQIKK QKIPPFKLTQ NLIAAIPDLR ENHDSFVRSV VNEFNGLGLI TELLEVHDAV YIMRRSADPE FTDREWRPLL PGDKITIKEP KAGTSEVSDI LWPALARQIL PRDAENLDLR TARVGDRIYA TVFINLFPKD IQTFVRLFTR TLQTRIPWRI SFLFESDGLA GTSIRKMLSS VLSVTSTQNR LIHDSLNLLN YINLNTDDAV VKLRVSAATW APEGDIRLLR ARAAMLAKAI EGWGSCDVSE ISGDAYEGVV STMIGISGTS VAPASIAPLS NVLYMLPLFR PASPWTHGAL LFRSPDGKPW PYQPGSHQQT TWIDLFYARP GSGKSVLSNT INLAVCLSSG IQRLPHISII DIGPSSSGLI LLLKDALPAD KKYLVAYHRL RMRSDYAINP FDTQLGCRYP TPQERAFLVN FLSLLATPIG SEKTYDGVAD MAGLIIDELY KNKADDGNPN TFALGMEENI DGILEEIGFV QDDQTTWWEV TDALFMAGFT HEAMLAQRHA MPVLADVAAI CRLPAIQDLY GKIVAPTGEP LIHAFARMIS SAVREYPIIS QVTRFDLGDA RVVALDLDEV ARSGGDAANR QTAVMYMLAR YVLGRHYFLT DDNVADMPEG YRHYHQARIA EIREDPKRIV FDEFHRTAKA QAVRDQVIQD MREGRKWKVQ VALLSQSLDD FDEIMVEFAT SIFIMDAGPE QTVRKTAEIF GLSHTAEIAL KTRVHGPRED GATFLAQFAT KNGLNTQLLT ATLGPVELWS LNTTAEDVNI RNQLYKRIGP KETRRILATM FPSGTATKAL EDRYADYKEE GRLIDDTAKH GIMQSLINEI LETYYSEKES AVV
|
| |