Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Meso_0901 |
Symbol | |
ID | 4182732 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Chelativorans sp. BNC1 |
Kingdom | Bacteria |
Replicon accession | NC_008254 |
Strand | + |
Start bp | 991362 |
End bp | 992891 |
Gene Length | 1530 bp |
Protein Length | 509 aa |
Translation table | 11 |
GC content | 64% |
IMG OID | 638066780 |
Product | protease Do |
Protein accession | YP_673463 |
Protein GI | 110633255 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG0265] Trypsin-like serine proteases, typically periplasmic, contain C-terminal PDZ domain |
TIGRFAM ID | [TIGR02037] periplasmic serine protease, Do/DeqQ family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 0.24208 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGCGAAG AGCAGAACGT AACCTCCAAA AAAATAACTT CCAAGACCCG CAATCGCCTC GTGGCGGCCG TGGCCTCTGC GGCTATAGCC GGTGCGGTCG GTTTTAGCGC AATAAGTTCG GGCACCGTCC CGGTGTTCGC TGAGCCGGTG AAGCTGGACA GGCCCGTTCA GGCGCCCAGT TTTGCCGATG TGGTCGATGT CGTCACACCC GCGGTGGTGA GCGTGCGCAT TTCGGGTGAA GCGCCGGAGA CGGGCACGGT GGCGACGCCG TTCTTCGACA TGCCGGGCTT TGACAATCTT CCGCCGAACC ATCCGTTGCG CCGTTTCTTC GATGAGTTCC AGAGGCGTGA CGAGCAGCCG CGCGGCCCGC AGCAGCGCCG GCGCACCCGG CCGGTTGCCC AAGGTTCCGG GTTCTTCATC TCCGGTGACG GCTATATCGT CACCAACAAC CATGTGGTCG AAGGTGGAAA TCAGTACACG GTCGTTCTGG AGGACGGCAC CGAGCTGCAG GCCGATCTGA TCGGCAAGGA CCCCCGCACC GATCTGGCGG TACTCAAGGT TAAAGCCGAC CGCGAGTTCA CCTATGTGAC CTGGGCCAAT GACGATGCGG CACGCGTGGG CGACTGGGTG GTTGCCGTGG GCAACCCGTT CGGGCTCGGC GGTACGGTGA CGGCGGGCAT CGTTTCGGCC CGAGGCCGTG AGATCGGCGC CGGGAATTAC GATGACTTCC TGCAGATCGA CGCGGCGGTG AATCAGGGCA ACTCCGGCGG GCCGACCTTC AACCTCGCCG GTGAGGTGGT TGGCGTCAAC ACCGCGATTT TCTCGCCCTC CGGCGGCAAT GTCGGCATTG CCTTCGCAAT TCCGGCTTCC CTTGCCAAGA CGGTCGTAGA GCAGCTTATC GAGGATGGTT CCGTCGAGCG CGGCTATATG GGAGTCAGCA TCCAGGAGGT GACACCCCAG ATCGCGGAGT CGCTTGGCCT GCAGGACGCA AGGGGCGCGC TGATCAACAG CGCGAACAGC GGCGATCCCG CAGCCAGGGC AGGTATTCAA GCCGGTGACG TCGTTATCGC AGTGAACGGC AAGACCATCG CCACCCCCCG CGAGTTGGCT CGCACGGTGG CGGCTATCCA GCCTGGCACC GAGATCGATG TTACGGTGTG GCGCAACGGT AAGTCGCAGG ATTTTTCCGT GACGTTGACG GAGATGCCTG ACACTGACCA GGTCGCTTCG GCGGCGCCCA GCGCGCCCTC GCAGGATGAG GGCGGTCAGC CGGCAAGCTT TGGATTGACG GTTGCACCGG CCGATGACGG CAATGGCCTC GTCGTCACCG ATGTCGAGCC GGGCAGCGTG GCGGAGGAGA ACGGAATCCA GCCGGGAGAC GTGATCCGCG CCATCAATTC TCAGCCCGTA ACAACGGCGG GCGACCTCAA GAAGGCTGCT GATGCGGCAT CGAGTGCCGG CCGTGGCGCA GTGCTCCTGC AGGTCGTGCG TGATGACGCT AACCGCTTCG TCGCGCTGCC AATTGGCTGA
|
Protein sequence | MSEEQNVTSK KITSKTRNRL VAAVASAAIA GAVGFSAISS GTVPVFAEPV KLDRPVQAPS FADVVDVVTP AVVSVRISGE APETGTVATP FFDMPGFDNL PPNHPLRRFF DEFQRRDEQP RGPQQRRRTR PVAQGSGFFI SGDGYIVTNN HVVEGGNQYT VVLEDGTELQ ADLIGKDPRT DLAVLKVKAD REFTYVTWAN DDAARVGDWV VAVGNPFGLG GTVTAGIVSA RGREIGAGNY DDFLQIDAAV NQGNSGGPTF NLAGEVVGVN TAIFSPSGGN VGIAFAIPAS LAKTVVEQLI EDGSVERGYM GVSIQEVTPQ IAESLGLQDA RGALINSANS GDPAARAGIQ AGDVVIAVNG KTIATPRELA RTVAAIQPGT EIDVTVWRNG KSQDFSVTLT EMPDTDQVAS AAPSAPSQDE GGQPASFGLT VAPADDGNGL VVTDVEPGSV AEENGIQPGD VIRAINSQPV TTAGDLKKAA DAASSAGRGA VLLQVVRDDA NRFVALPIG
|
| |