Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | VC0395_A0820 |
Symbol | |
ID | 5135919 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Vibrio cholerae O395 |
Kingdom | Bacteria |
Replicon accession | NC_009457 |
Strand | + |
Start bp | 832601 |
End bp | 833812 |
Gene Length | 1212 bp |
Protein Length | 403 aa |
Translation table | 11 |
GC content | 45% |
IMG OID | 640532278 |
Product | putative trypsin |
Protein accession | YP_001216770 |
Protein GI | 147673040 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG5640] Secreted trypsin-like serine protease |
TIGRFAM ID | [TIGR03501] gammaproteobacterial enzyme C-terminal transmembrane domain |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 0 |
Plasmid unclonability p-value | 0.0000000000000898339 |
Plasmid hitchhiking | No |
Plasmid clonability | unclonable |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GTGCATCAGG TTTCTAAATT GCTGTCGTGT TTTATCGGTT TTTCTCTTTT CTCTACACTG CTCTATGCGG AATCCACAGC AGATATTTCA TCTCGTATTA TTAATGGTTC GAATGCAAAC TCTGCCGAAT GGCCATCGAT TGTGGCTCTG GTTAAACGCG GTGCAGATGC TTATCAAGGG CAATTTTGTG GCGGTAGTTT TTTAGGTGGA CGATATGTCT TAACCGCTGC TCACTGTTTT GATAGTCGTA GTGCTGCTAG TGTGGATGTG ATTATTGGTG CTTATGATCT GAATAACTCT TCTCAAGGAG AAAGGATTGC CGCGCAAAAA ATTTACCGTC ATTTGAGTTA TAGTCCCAGT AACTTATTGA ATGACATTGC GATTGTCGAG TTAGCACAAA CCAGTAGTTT GCCTGCCATC ACTTTAGCGG GTCCTGCAAC ACGCACGTCT TTACCAGCAT TAACCCCTTT AACAGTGGCT GGATGGGGGA TTACCGTGCA ATCAAAACCG CCACAATTTA CTCCGATACT ACAAGAAGTG GATGTTGACC TGGTTTCGCA ATCGCTTTGC CAAATAGTCA TGCAGCATGG AATTTCCTCT GATCCTAATT CGACCAACTT TTGTGCCGCT CGTCTTACGA AAGATTCATG TCAGGGAGAT TCTGGTGGCC CAATTGTAGT TAAAACGGGT CGAGAGCAGC TGGGTATTGT GAGTTGGGGG GATGAACAAT GCGCTAAAAC AGGCACTTAT GGTGTGTATA CCAACGTCAG TTATTTTCGT GATTGGATCA CAAAGCACAC TAACCAGCTT AGCTATGATC AAGTCGCCAA TCTCGGTATT CGTCCATTAG GTAAAGTGAG TCAATCATTT ACATATACCA ACCTTGATGC TAATGCGCTG ACCTATACAG GAAACACATT CTCAAGTCTG CCCGCTGATT TTAGTGTGCT ATCCGATGGG TGCAGTACTA AAGTGACGTT AGCCACAGGT GAAAGTTGTA GTGTAGAAGT GGCTGTTGAT GCGCAACACT ACCGTCAATA TCAGTATGAC TTTGAGTTGA TCTTTAGCTA TGCGGGAGGT TCTAAACGTG CAACGTCTCG TATTCAACTG GATACTTCGC CTTTTGCCCC AAGTGCCTCT TCTGGCGGTT CCATCGGCTG GTTTGGTTTG CTGCTTTTGG CTCCATTGTG GATGAGACGG AAAACCGCTT GA
|
Protein sequence | MHQVSKLLSC FIGFSLFSTL LYAESTADIS SRIINGSNAN SAEWPSIVAL VKRGADAYQG QFCGGSFLGG RYVLTAAHCF DSRSAASVDV IIGAYDLNNS SQGERIAAQK IYRHLSYSPS NLLNDIAIVE LAQTSSLPAI TLAGPATRTS LPALTPLTVA GWGITVQSKP PQFTPILQEV DVDLVSQSLC QIVMQHGISS DPNSTNFCAA RLTKDSCQGD SGGPIVVKTG REQLGIVSWG DEQCAKTGTY GVYTNVSYFR DWITKHTNQL SYDQVANLGI RPLGKVSQSF TYTNLDANAL TYTGNTFSSL PADFSVLSDG CSTKVTLATG ESCSVEVAVD AQHYRQYQYD FELIFSYAGG SKRATSRIQL DTSPFAPSAS SGGSIGWFGL LLLAPLWMRR KTA
|
| |