Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | VC0395_A1256 |
Symbol | |
ID | 5137601 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Vibrio cholerae O395 |
Kingdom | Bacteria |
Replicon accession | NC_009457 |
Strand | + |
Start bp | 1330081 |
End bp | 1331700 |
Gene Length | 1620 bp |
Protein Length | 539 aa |
Translation table | 11 |
GC content | 48% |
IMG OID | 640532714 |
Product | collagenase |
Protein accession | YP_001217200 |
Protein GI | 147675061 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 7 |
Plasmid unclonability p-value | 0.0000029967 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCGACAAG TGATTGCTCG CTATCCTTTA GGGAGTAAAC ACGACAAATT GTGGTTAGCT GCCGTTGAGA TGTTGCATTA TTACGCACCT GAGGTGCTGC AACAGCTCGG TATCGATCTT GATGCAGCGA AGCGCGATTT AGCCGCGCGC ATTTTGCCAA ACCGCTTTGA ATGCCAAGGC CCTGCCATTA TTCGCTCGCA AGATCTTTCC GATGCGCAAG CCGCGCAAGC CTGTGACGTA TTGGATAAAA AAGAGCAAGA TTTTCACCAA GTGGCAAATA CGGGATTGGC ACCGGTCGCT GATGATTACA ACACTCGCGT TGAGGTGGTG GTGTTTGCCA ATAACAGCAG CTATGTCAAT TACTCCTCGT TTTTATTTGG CAATACCACC GACAATGGCG GGCAGTATTT AGAAGGCAAC CCTGCGGATC AAAACAATCA AGCCCGTTTT GTCGCCTATC GTTATGCCAA TGATGCGGAT TTGTCGATTC TCAATTTAGA GCATGAGTAC ACTCATTATC TGGACGCGCG CTTCAACCAG TATGGTTCGT TCAGTGATAA CTTAGCCCAT GGTCATATTG TGTGGTGGTT AGAAGGTTTC GCGGAATATA TGCATTATAA GCAGGGCTAC CAAGCGGCGG TTAAGCTGAT TTCTCAAGGT AAATTGAGCC TTTCTGATGT GTTTGCTACC ACCTACTCGA ATGATACGAA CCGCATTTAT CGCTGGGGCT ACTTAGCGGT ACGCTTTATG TTGGAAAAGC ATCCACAAGA CGTGGAAAGC TTGCTGGCAC TCTCTCGTAC TGGGCAGTTT GATCAGTGGG CACAGAGTGT CAAATTACTG GGTGAACGCT ATAACACCGA GTTTTCGGCG TGGCTGGATA CATTGCAACG CGATAATCCA GATAATCCAG ATAATCCAGA ACAGCCTAAC CCAGAACCCA ATGCGGTCAC TCAGTTAGCG GCCAATTCGT CCTTAACACT GACAGGAAAG GCGTACAGCG AGCATCTGTT CTATGTGGAT GTGCCAGAGT ACAGCCGAGA GTTCCACGTT CAGATCTCCG GGGAGGGAGA TGCCGATCTG TACATGAGCT ACCAGCAGGT TGCCCATTAC TATGATTATC AGGTAACGGA ATTTACCTAT GGCAGCAACG AGCAAATTAC CTTCAAACCT GAGCAGAATG GTTACATCAA ACCCGGACGC TACTATTTAA GTGTGACAGG ACGTGCGGAT TACTCAGCAG TCATCCTCAA CACACACCTA GTCACTGAAC AGCCTAATGA GCAACCGACT ATCAAAGATG ATCTTGCTCC TGTGCTTTTA GAAGCGGGTA ACAGTCAAAG CCTTACCGTG CATCGACAAC GTTACGTCGC CATCTACGTG CCAAAAGGCG TCAGTGAAGT ACAAGTATGG CTCACCGCTT CAGAACAAAA CCGAGGCAAT GTCGATCTGT TTGCAGCAAA AGCGTATTGG CCAACTCGAG AGCAGTTTGA GCACGCCTCA ACTGGTGCGG GCAGTCATGA GTATTTGCGT ATTCCTGTGA CACAAGAAGG CTATGTGCAC TTTTCGTTAA ACGCACAACA ACTCGGCGAC ACGGTCGAAA TGGTGGCCTA TTTCGACTGA
|
Protein sequence | MRQVIARYPL GSKHDKLWLA AVEMLHYYAP EVLQQLGIDL DAAKRDLAAR ILPNRFECQG PAIIRSQDLS DAQAAQACDV LDKKEQDFHQ VANTGLAPVA DDYNTRVEVV VFANNSSYVN YSSFLFGNTT DNGGQYLEGN PADQNNQARF VAYRYANDAD LSILNLEHEY THYLDARFNQ YGSFSDNLAH GHIVWWLEGF AEYMHYKQGY QAAVKLISQG KLSLSDVFAT TYSNDTNRIY RWGYLAVRFM LEKHPQDVES LLALSRTGQF DQWAQSVKLL GERYNTEFSA WLDTLQRDNP DNPDNPEQPN PEPNAVTQLA ANSSLTLTGK AYSEHLFYVD VPEYSREFHV QISGEGDADL YMSYQQVAHY YDYQVTEFTY GSNEQITFKP EQNGYIKPGR YYLSVTGRAD YSAVILNTHL VTEQPNEQPT IKDDLAPVLL EAGNSQSLTV HRQRYVAIYV PKGVSEVQVW LTASEQNRGN VDLFAAKAYW PTREQFEHAS TGAGSHEYLR IPVTQEGYVH FSLNAQQLGD TVEMVAYFD
|
| |