Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | VC0395_A2341 |
Symbol | |
ID | 5135501 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Vibrio cholerae O395 |
Kingdom | Bacteria |
Replicon accession | NC_009457 |
Strand | + |
Start bp | 2494801 |
End bp | 2496408 |
Gene Length | 1608 bp |
Protein Length | 535 aa |
Translation table | 11 |
GC content | 47% |
IMG OID | 640533793 |
Product | alkaline serine protease |
Protein accession | YP_001218241 |
Protein GI | 147674600 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG1404] Subtilisin-like serine proteases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 0.00532226 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTTTAAAA AGTTTTTAAG CTTATGCATT GTTTCGACGT TTTCCGTCGC AGCAACCTCA GCACTTGCCC AACCCAATCA GCTTGTTGGC AAATCATCTC CTCAACAATT AGCACCATTG ATGAAAGCCG CTTCAGGGAA AGGCATTAAA AATCAATACA TTGTTGTACT CAAGCAACCA ACGACAATCA TGAGTAATGA TTTGCAAGCT TTCCAACAAT TTACTCAACG ATCTGTCAAT GCATTAGCCA ATAAGCATGC ACTCGAAATC AAAAATGTTT TTGATAGTGC GTTAAGCGGT TTTTCTGCAG AGCTAACGGC TGAGCAACTA CAAGCTCTTC GTGCCGATCC TAACGTCGAC TATATCGAGC AAAACCAAAT TATTACGGTT AATCCGATCA TCTCTGCATC AGCAAATGCG GCTCAAGATA ACGTAACCTG GGGAATCGAC CGCATCGATC AGCGCGATCT ACCGCTCAAT CGTAGTTATA ACTACAACTA TGATGGCAGT GGCGTTACCG CTTATGTGAT CGATACCGGT ATTGCTTTCA ACCATCCTGA ATTTGGTGGA CGAGCAAAAT CCGGCTATGA CTTTATCGAT AACGATAATG ATGCCAGTGA CTGCCAAGGC CACGGTACAC ATGTGGCAGG CACTATCGGG GGCGCTCAAT ACGGTGTAGC TAAAAACGTT AACTTAGTTG GTGTGCGAGT GCTGGGATGT GATGGTAGTG GTTCAACTGA AGCGATCGCC CGTGGTATTG ACTGGGTGGC TCAAAACGCG TCCGGCCCTT CTGTTGCCAA TTTGAGTTTA GGGGGAGGGA TATCTCAGGC GATGGATCAA GCAGTGGCTC GACTCGTCCA AAGAGGAGTC ACGGCGGTCA TTGCTGCTGG TAACGATAAT AAAGATGCTT GCCAAGTATC CCCTGCACGT GAACCAAGTG GTATCACTGT AGGTTCAACG ACCAATAATG ATGGTCGCTC TAACTTCTCG AACTGGGGTA ATTGTGTACA GATCTTCGCA CCAGGATCTG ACGTCACTTC GGCGTCGCAT AAAGGTGGCA CGACGACTAT GAGTGGTACG TCTATGGCCT CACCCCATGT AGCTGGTGTC GCAGCCTTGT ACTTACAAGA GAATAAGAAC CTCTCTCCGA ATCAAATCAA AACGCTGCTT AGCGACCGAT CAACCAAAGG CAAAGTCAGC GACACTCAAG GAACACCAAA TAAGCTGCTG TATAGTTTGA CCGATAACAA TACCACTCCG AATCCTGAGC CAAATCCGCA ACCAGAGCCA CAACCGCAGC CTGACAGCCA ATTGACTAAT GGTAAAGTGG TTACAGGCAT CAGTGGCAAG CAAGGTGAAT TGAAAAAATT CTATATTGAT GTGCCTGCAG GTCGTCGCTT GAGTATCGAG ACCAACGGAG GTACGGGTAA TCTTGATCTC TATGTTCGTC TAGGTATTGA GCCAGAACCG TTTGCTTGGG ATTGCGCATC TTATCGCAAC GGCAATAATG AAGTTTGTAC CTTCCCGAAT ACCCGAGAAG GTCGCCACTT CATTACACTC TATGGCACCA CTGAGTTTAA CAATGTCAGT TTGGTGGCTC GCTACTAA
|
Protein sequence | MFKKFLSLCI VSTFSVAATS ALAQPNQLVG KSSPQQLAPL MKAASGKGIK NQYIVVLKQP TTIMSNDLQA FQQFTQRSVN ALANKHALEI KNVFDSALSG FSAELTAEQL QALRADPNVD YIEQNQIITV NPIISASANA AQDNVTWGID RIDQRDLPLN RSYNYNYDGS GVTAYVIDTG IAFNHPEFGG RAKSGYDFID NDNDASDCQG HGTHVAGTIG GAQYGVAKNV NLVGVRVLGC DGSGSTEAIA RGIDWVAQNA SGPSVANLSL GGGISQAMDQ AVARLVQRGV TAVIAAGNDN KDACQVSPAR EPSGITVGST TNNDGRSNFS NWGNCVQIFA PGSDVTSASH KGGTTTMSGT SMASPHVAGV AALYLQENKN LSPNQIKTLL SDRSTKGKVS DTQGTPNKLL YSLTDNNTTP NPEPNPQPEP QPQPDSQLTN GKVVTGISGK QGELKKFYID VPAGRRLSIE TNGGTGNLDL YVRLGIEPEP FAWDCASYRN GNNEVCTFPN TREGRHFITL YGTTEFNNVS LVARY
|
| |