Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Bind_1733 |
Symbol | |
ID | 6199643 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Beijerinckia indica subsp. indica ATCC 9039 |
Kingdom | Bacteria |
Replicon accession | NC_010581 |
Strand | + |
Start bp | 1960109 |
End bp | 1961677 |
Gene Length | 1569 bp |
Protein Length | 522 aa |
Translation table | 11 |
GC content | 61% |
IMG OID | 641705724 |
Product | protease Do |
Protein accession | YP_001832852 |
Protein GI | 182678706 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG0265] Trypsin-like serine proteases, typically periplasmic, contain C-terminal PDZ domain |
TIGRFAM ID | [TIGR02037] periplasmic serine protease, Do/DeqQ family |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 0.613226 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 12 |
Fosmid unclonability p-value | 0.00274873 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGGGTTTGG CCCGCAGGTT CCTTCCCGTT TCCTTCCCCC TACCGGTACG ACCCGCGCGA GGCCGGGCGC TGCGCTTCGG CTCAAGCCTC GCACTGAGCC TCACTTCGGG ACTCATGCTG GCCACGACCT CTTTGCCGCC TTTTACCTTG GTCGAGGCAC AAGCGCGCGG ACCAGAATCC CTGGCCGATC TCTCGGCTGC CGTCAGCGAT GCGGTCGTCA ATATTTCCGC GACGCAGACC GTGGACGAAA AGCGCCCCGG CAATGGGCCA CAGCTTGAGC CAGGCACGCC GTTCGACGAT CTGTTCGAGG AATTTTTCCG TCGCCGGCAG CAGCGTGGCG GTGGTGGGGG TGGCGGTGCC GTTCCGCGCC CGCATGAGCG CCGCTCGAAT TCGCTGGGCT CTGGCTTCGT CATCGATCCG GCCGGCATCA TCATCACCAA TAATCACGTC ATCGCCGATG CCAATGAGGT GACGGTCATC TTTACCGATG GTCAAAAGCT CAAGGCGCAG GTCCTCGGCA AGGATGCGAA GGTCGATGTG GCTGTCCTCA AGGTCAAGCC GGAGAAGCCC CTGAAGGCCG TTAAATTCGG CGACAGTGAA GTCATGCGGG TCGGTGATTG GGTCATTGCC GTCGGCAATC CCTTCGGGCT CGGCGGCACG GTGACGGCGG GCATTGTTTC GGCGCGCAAC CGCAACATCG ATTCCGGCCC TTACGACAAT TATATCCAGA CCGATGCAGC CATCAACAAG GGGAATTCCG GTGGCCCCTT GTTCAATATG GCCGGTGAGG TGATCGGTAT TAATACAGCG ATCCTGTCGC CCTCCGGCGG CTCGATCGGC ATCGGTTTCG CGACGCCTGC GTCCATGGTC GTTCCGATTG TGGAGCAATT GCAGAAATTT GGCGAAACAC GCCGGGGCTG GCTTGGCGTG CGCATCCAGA ATGTCGATGA CACCATCGCC GAAAGCCTCA ATCTCGGCAC GGCGCGCGGC GCGTTGATCG CCGGCGTGGA CGACAAGGGA CCGGCCAAGT CGGCTGGCCT CTCCGCTGGC GACGTCATCG TCAAATTCGA TGGCACTGCA ATCAAGGAAT CGCGTGATCT GCCCAAGCTC GTCGCCATGG CCGCCGTCGG CAAGGACGTG CCGGTCGTCC TGATCCGTCA GGGCAAGGAG ATCACCAAAA ACGTGAAGCT CGGCCGTCTC GAGGACAATG AGAAACAGGC CTCGCTCAAT ACGCCCGATC GCGACGAAAG CGCGCTCGGC TCAGCGAGCC AGCGTGTCCT GGGGATGAGC CTGTCTGCGC TGACCGACGA AGCTCGGCAG AAATTCGCGA TCCGCGACAC TGTCGCATCG GGCGTCGTCA TCACCGATGT CGATCCTGAA TCGGCGGCGG CGGAAAAACA TATTCAGCCG GGTGAACTGG TCGTCGAAAT CAATCAGGAC GCGGTGAAGA CACCGGCCGA GATCAACAAG AAATTGCAGA GCCTGAAGGA GCAAGGCAAG AAATCAGCTT TGCTGCTCGT CTCCAACGGA CAGGGTGAAG TGCGTTTCGT CGCTCTTGCC ATTCCTTGA
|
Protein sequence | MGLARRFLPV SFPLPVRPAR GRALRFGSSL ALSLTSGLML ATTSLPPFTL VEAQARGPES LADLSAAVSD AVVNISATQT VDEKRPGNGP QLEPGTPFDD LFEEFFRRRQ QRGGGGGGGA VPRPHERRSN SLGSGFVIDP AGIIITNNHV IADANEVTVI FTDGQKLKAQ VLGKDAKVDV AVLKVKPEKP LKAVKFGDSE VMRVGDWVIA VGNPFGLGGT VTAGIVSARN RNIDSGPYDN YIQTDAAINK GNSGGPLFNM AGEVIGINTA ILSPSGGSIG IGFATPASMV VPIVEQLQKF GETRRGWLGV RIQNVDDTIA ESLNLGTARG ALIAGVDDKG PAKSAGLSAG DVIVKFDGTA IKESRDLPKL VAMAAVGKDV PVVLIRQGKE ITKNVKLGRL EDNEKQASLN TPDRDESALG SASQRVLGMS LSALTDEARQ KFAIRDTVAS GVVITDVDPE SAAAEKHIQP GELVVEINQD AVKTPAEINK KLQSLKEQGK KSALLLVSNG QGEVRFVALA IP
|
| |