Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | COXBURSA331_A1196 |
Symbol | |
ID | 5794189 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Coxiella burnetii RSA 331 |
Kingdom | Bacteria |
Replicon accession | NC_010117 |
Strand | + |
Start bp | 1081486 |
End bp | 1082841 |
Gene Length | 1356 bp |
Protein Length | 451 aa |
Translation table | 11 |
GC content | 47% |
IMG OID | 641330634 |
Product | protease Do |
Protein accession | YP_001596934 |
Protein GI | 161831404 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG0265] Trypsin-like serine proteases, typically periplasmic, contain C-terminal PDZ domain |
TIGRFAM ID | [TIGR02037] periplasmic serine protease, Do/DeqQ family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 41 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAAAAAT TATCCAAAAT TATTCTCAGC AGTATTTTTG CGGGGCTCCC ATTACTCCTG CCCGTCAGTA GTTACGCTCA CTTACCCTCC GCTGTCGAAG GAAAAACAAT ACCCAGCCTT GCACCAATGT TGAATAAGAC CACACCGAGC GTCGTCAACA TTGCCGTGGA AAAGCTGATT CCTCAAACGC CAAACCCCTT ACAACCCGAA ATGGATCAAA ACACAGCACC AACGAAAGTC TTAGGCGTAG GCTCGGGTGT AATCATAGAC GCCAAAAAAG GCTATATCGT AACAAATGCT CATGTCGTCA AAGACCAAAA AATCATGGTG GTGACGCTTA AAGATGGTCG CCGTTATCGA GCGAAAGTCA TCGGAAAAGA TGAAGGGTTT GATCTGGCTG TGATTCAAAT TCACGCGAAC CATTTGACCG CACTTCCCAT CGGAAATTCA GATCAATTAA AAGTGGGTGA TTTCGTCGTC GCCGTGGGAA GCCCTTTTGG CTTAACTCAA ACAGTCACTT CCGGCGTCAT TAGCGCCTTG AATCGCCAAG AACCGCGTAT CGATAATTTT CAAAGCTTTA TTCAAACCGA CGCGCCGATT AATCCCGGCA ATTCCGGCGG GGCTTTAATC GATTTAGAGG GCAAATTAAT TGGTATTAAT ACAGCGATTG TCACCCCGTC CGCGGGAAAT ATCGGCATCG GCTTTGCCAT TCCCAGCGAC ATGGTCAAAA GCGTGGCCGA ACAATTAATT AAATATGGAA AAGTCGAACG CGGCATGCTC GGCGTAACGG CTCAAAATAT TACCCCGGAA TTAGCGGACG CCCTAAATTT AAAACATAAC AAAGGAGCGC TGGTAACCAA AGTGGTTGCT GAAAGTCCAG CGGCTAAAGC CGGGGTTGAG GTGCAGGATA TTATTGAATC TGTCAACGGT ATTCGGATTC ATAGTTCAGC ACAACTCCAC AACATGCTCG GGCTGGTGCG TCCAGGAACT AAGATTGAAC TAACCGTATT GCGCGACCAT AAGGTTCTGC CTATAAAAAC GGAAGTAGCC GATCCTAAAA AAGTGCTATT GCAACGCGAA CTGCCCTTCC TCGGCGGCAT GCGTATGCAG AAATTCAACG ACCTAGAGCC CGATGGCACT ATTTTGCAAG GTGTTTTAGT TACCGGCGTG GACGATAGCA GCGATGGAGC GCTCGGCGGG TTAGAGCCCG GCGATATCAT TATCAGTGCT AATGGCCAAT TAACGCCCAC GGTCGATGAG CTAATGAAAA TCGCTGAAGG CAAGCCAAAG GAGTTGTTAC TGAAAGTGGC GCGGGGCGCG GGACAATTAT TTTTAGTTAT CCAACAATCA CAATAA
|
Protein sequence | MKKLSKIILS SIFAGLPLLL PVSSYAHLPS AVEGKTIPSL APMLNKTTPS VVNIAVEKLI PQTPNPLQPE MDQNTAPTKV LGVGSGVIID AKKGYIVTNA HVVKDQKIMV VTLKDGRRYR AKVIGKDEGF DLAVIQIHAN HLTALPIGNS DQLKVGDFVV AVGSPFGLTQ TVTSGVISAL NRQEPRIDNF QSFIQTDAPI NPGNSGGALI DLEGKLIGIN TAIVTPSAGN IGIGFAIPSD MVKSVAEQLI KYGKVERGML GVTAQNITPE LADALNLKHN KGALVTKVVA ESPAAKAGVE VQDIIESVNG IRIHSSAQLH NMLGLVRPGT KIELTVLRDH KVLPIKTEVA DPKKVLLQRE LPFLGGMRMQ KFNDLEPDGT ILQGVLVTGV DDSSDGALGG LEPGDIIISA NGQLTPTVDE LMKIAEGKPK ELLLKVARGA GQLFLVIQQS Q
|
| |