Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Bind_2235 |
Symbol | |
ID | 6201155 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Beijerinckia indica subsp. indica ATCC 9039 |
Kingdom | Bacteria |
Replicon accession | NC_010581 |
Strand | + |
Start bp | 2562957 |
End bp | 2564546 |
Gene Length | 1590 bp |
Protein Length | 529 aa |
Translation table | 11 |
GC content | 63% |
IMG OID | 641706224 |
Product | protease Do |
Protein accession | YP_001833342 |
Protein GI | 182679196 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG0265] Trypsin-like serine proteases, typically periplasmic, contain C-terminal PDZ domain |
TIGRFAM ID | [TIGR02037] periplasmic serine protease, Do/DeqQ family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 0.355037 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 32 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCCGCAAT TGGAATTTCC CCCGCAGGAT AAAGGCGCCG CACGGAACGC CGCCACGCCA CGCTCGCAGC GTCGGCCGGG GATGCGCGCC ATCCTGCTCG GCGCCACAGC GGCCGTGGCC CTGACCGGTG CCTTCACGCA TTCCGTCCTG CTGCCGCAAG CGGCCAATGC CGAAACACCC ACCCTGAACG TGCCGGTGAA TGCGCCGAAT AGCAGCCCAG TCGTCGGACC GGTCTCTTTT GCCGATGTGG TCGACCATGT GCGGGGGGCT GTCGTTTCGG TCAAGGTCAA GATTACCGAA ACCGCCGATA ATGAAGAGGC CAATACCGGA AACGACATGC CTCAATTCGC CCCGGGCGAT CCCCTGGAGC GTTTCTTCCG CCGCTTTGGC GAACAAGGAG GCGTCCCCTT CAACAAACAT AGCGGCAAGC CACGGACCGG CCAGGCGCAG GGTTCGGGAT TCATCATTTC GAGCGATGGC TATGTCGTCA CCAATAATCA TGTCGTCGAA AACGCGACAG AGGTCAGCCT GACGACCGAT GGCGGTCAGA CCTTGACCGC GAGCGTGGTT GGCACCGACA AGAAGACTGA TCTCGCTCTC TTGAAGATCA ATGGCTCGGG CACCTATCCC TTCGTCAAAT TCTCCAACGA GACACCCCGT GTCGGCGAAT GGGTCATCGC TGTCGGCAAT CCTTTCGGTC TCGGCGGCAC GGTGACGGCA GGCATTATTT CAGCGCGCGG CCGCGATATC GGCGCCGGCC CCTATGACGA CTTCCTGCAG GTCGACGCCC CGGTCAATCG CGGCAATTCC GGTGGCCCGA CCTTCAACGC CAAGGGCGAC GTGGTCGGCG TCAATACGGC GATCTTCTCA CCGTCCGGCG GCAGCGTCGG CATCGGCTTC GCCATTCCCG CGGAGGTTGC GCAAAACGTC ATCACCTCCT TGCGGGAGAA AGGCACGGTC GCGCGCGGTT GGATCGGCGT CCAGATTCAG CCTGTGACAG CGGAAATCGC CGATAGTCTC GGCCTGAAAA CCAGCAAGGG CGCCCTGGTT GCCGAGGCAC AGCCGAATTC TCCCGCGCTC TCGGCCGGTA TCCGCTCCGG TGACGTGATC CTCGGCGTCG ATGGCGAACG CATCGATGGT CCGCGCGAAC TGGCCCGCAA GATAGCGGCG CTCGGCCCTG GCAAGAGCAC CAATCTCATG TATTGGCACG ATGGCTCGGA AAAGACCGTC GCGGTGAAAC TCGGCAATCT GCCAAATGAC AAGGAAGCCA AGGCGGACAT CACGACACGC CCCGATAAAA ACGTCCTCGG CGATCTCGGT CTGACGCTCG CCCCGGCGGC GCAGGTCCCC GGCGCCGGCG ATGAAGGTGT AGTCGTCTCC GACATCGATC CCGATGGCGT TGCCGCACAA AAGGGTTTGC GTGTCGGTGA TGTCATTCTC GAAGCCGGTG GGCACGCTGT CAGCCGTCCG GCCGAAATCG GCGCGACCTT GAGCACCGCC AAGAAAGATG GCCGCAAGGC CGTGCTCATG CGTGTCAAGA ATCGGGAAGG CACCCGCTAC GTCGCGCTTG CGACCACTCC GGCTTCCTGA
|
Protein sequence | MPQLEFPPQD KGAARNAATP RSQRRPGMRA ILLGATAAVA LTGAFTHSVL LPQAANAETP TLNVPVNAPN SSPVVGPVSF ADVVDHVRGA VVSVKVKITE TADNEEANTG NDMPQFAPGD PLERFFRRFG EQGGVPFNKH SGKPRTGQAQ GSGFIISSDG YVVTNNHVVE NATEVSLTTD GGQTLTASVV GTDKKTDLAL LKINGSGTYP FVKFSNETPR VGEWVIAVGN PFGLGGTVTA GIISARGRDI GAGPYDDFLQ VDAPVNRGNS GGPTFNAKGD VVGVNTAIFS PSGGSVGIGF AIPAEVAQNV ITSLREKGTV ARGWIGVQIQ PVTAEIADSL GLKTSKGALV AEAQPNSPAL SAGIRSGDVI LGVDGERIDG PRELARKIAA LGPGKSTNLM YWHDGSEKTV AVKLGNLPND KEAKADITTR PDKNVLGDLG LTLAPAAQVP GAGDEGVVVS DIDPDGVAAQ KGLRVGDVIL EAGGHAVSRP AEIGATLSTA KKDGRKAVLM RVKNREGTRY VALATTPAS
|
| |