Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | VC0395_0371 |
Symbol | hap |
ID | 5134128 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Vibrio cholerae O395 |
Kingdom | Bacteria |
Replicon accession | NC_009456 |
Strand | + |
Start bp | 405930 |
End bp | 407759 |
Gene Length | 1830 bp |
Protein Length | 609 aa |
Translation table | 11 |
GC content | 50% |
IMG OID | 640530694 |
Product | hemagglutinin/protease |
Protein accession | YP_001215212 |
Protein GI | 147671703 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG3227] Zinc metalloprotease (elastase) |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 33 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAAATGA TACAACGTCC TCTGAATTGG TTAGTTCTGG CCGGAGCGGC AACTGGCTTC CCTCTCTATG CGGCACAAAT GGTCATGATT GATGATGCAT CAATGGTTGA ACAAGCGTTG GCGCAGCAAC AGTACAGTAT GATGCCTGCC GCCAGCGGTT TTAAAGCCGT CAATACGGTA CAGTTGCCGA ATGGTAAGGT GAAAGTGCGT TACCAGCAGA TGTACAACGG GGTTCCTGTC TATGGCACCG CTGTGGTGGC AACCGAATCC AGTAAAGGGA TTTCGCAAGT GTATGGTCAA ATGGCTCAGC AGTTGGAAGC CGATCTCTCA ACCGTGACCC CTGACATTGA AAGCCAGCAG GCCATCGCTT TAGCGGTTAG CCATTTTGGT GAACAACACG CTGGAGAATC GCTCCCGGTG GAAAACGAAA GTGTGCAACT GATGGTACGT TTGGATGATA ACCAACAGGC TCAGTTAGTG TACTTGGTCG ACTTTTTTGT CGCTTCAGAA ACACCTTCGC GTCCGTTCTA CTTTATCAGT GCAGCAACGG GAGAAGTGCT AGACCAATGG GATGGCATTA ACCACGCACA GGCAACAGGA ACCGGCCCCG GCGGTAACCA AAAAACGGGA CGTTATGAAT ACGGCAGTAA CGGTTTACCC GGTTTCACGA TTGATAAGAC CGGAACCACC TGTACGATGA ATAACAGTGC GGTAAAAACC GTTAACCTCA ATGGCGGCAC CTCGGGTAGC ACGGCGTTCA GTTATGCTTG TAACAACAGC ACTAACTACA ACAGTGTTAA AACAGTGAAT GGTGCTTACT CACCGCTAAA CGACGCGCAC TTCTTCGGAA AAGTGGTGTT TGATATGTAT CAGCAGTGGT TGAATACTTC GCCGCTGACT TTCCAATTAA CCATGCGTGT GCACTATGGC AATAACTATG AAAATGCCTT CTGGGATGGC CGCGCCATGA CTTTTGGTGA TGGCTATACC CGTTTCTATC CTTTGGTGGA TATCAACGTT AGTGCCCATG AGGTCAGCCA CGGTTTTACT GAGCAGAATT CAGGCCTCGT TTACCGAGAT ATGTCCGGTG GTATTAACGA AGCATTCTCG GATATCGCAG GGGAAGCGGC AGAGTACTTT ATGCGTGGCA ATGTTGACTG GATTGTCGGC GCGGATATTT TTAAATCCTC CGGTGGCCTA CGTTATTTCG ATCAGCCGTC ACGTGATGGC CGATCGATAG ATCATGCTTC ACAGTATTAC AGCGGTATTG ATGTTCACCA TTCGAGTGGC GTGTTTAACC GCGCGTTTTA CCTACTCGCC AATAAATCGG GTTGGAACGT ACGTAAAGGT TTTGAAGTGT TTGCCGTGGC TAACCAGTTG TACTGGACAC CGAACAGCAC TTTTGATCAA GGTGGCTGTG GGGTAGTGAA AGCGGCGCAG GATCTCAACT ACAACACCGC AGACGTCGTG GCAGCCTTTA ATACCGTGGG TGTCAATGCT TCTTGTGGCA CCACGCCACC ACCTGTCGGC AAAGTGCTTG AGAAAGGTAA ACCGTTCACA GGACTGAGCG GCTCACGTGG AGGAGAAGAT TTCTATACCT TCACTGTGAC CAATTCAGGC AGTGTTGTTG TGTCCATCAG TGGTGGAACG GGCGATGCGG ATCTGTATGT CAAAGCAGGC AGCAAACCCA CCACTTCTTC TTGGGATTGT CGTCCATACC GTTCAGGCAA TGCCGAGCAG TGTTCCATCT CTGCGGTCGT GGGTACGACA TACCATGTCA TGTTACGCGG TTACAGTAAC TATTCTGGTG TGACGTTACG CTTGGACTAA
|
Protein sequence | MKMIQRPLNW LVLAGAATGF PLYAAQMVMI DDASMVEQAL AQQQYSMMPA ASGFKAVNTV QLPNGKVKVR YQQMYNGVPV YGTAVVATES SKGISQVYGQ MAQQLEADLS TVTPDIESQQ AIALAVSHFG EQHAGESLPV ENESVQLMVR LDDNQQAQLV YLVDFFVASE TPSRPFYFIS AATGEVLDQW DGINHAQATG TGPGGNQKTG RYEYGSNGLP GFTIDKTGTT CTMNNSAVKT VNLNGGTSGS TAFSYACNNS TNYNSVKTVN GAYSPLNDAH FFGKVVFDMY QQWLNTSPLT FQLTMRVHYG NNYENAFWDG RAMTFGDGYT RFYPLVDINV SAHEVSHGFT EQNSGLVYRD MSGGINEAFS DIAGEAAEYF MRGNVDWIVG ADIFKSSGGL RYFDQPSRDG RSIDHASQYY SGIDVHHSSG VFNRAFYLLA NKSGWNVRKG FEVFAVANQL YWTPNSTFDQ GGCGVVKAAQ DLNYNTADVV AAFNTVGVNA SCGTTPPPVG KVLEKGKPFT GLSGSRGGED FYTFTVTNSG SVVVSISGGT GDADLYVKAG SKPTTSSWDC RPYRSGNAEQ CSISAVVGTT YHVMLRGYSN YSGVTLRLD
|
| |