Gene VC0395_0371 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagVC0395_0371 
Symbolhap 
ID5134128 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameVibrio cholerae O395 
KingdomBacteria 
Replicon accessionNC_009456 
Strand
Start bp405930 
End bp407759 
Gene Length1830 bp 
Protein Length609 aa 
Translation table11 
GC content50% 
IMG OID640530694 
Producthemagglutinin/protease 
Protein accessionYP_001215212 
Protein GI147671703 
COG category[E] Amino acid transport and metabolism 
COG ID[COG3227] Zinc metalloprotease (elastase) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones33 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAAATGA TACAACGTCC TCTGAATTGG TTAGTTCTGG CCGGAGCGGC AACTGGCTTC 
CCTCTCTATG CGGCACAAAT GGTCATGATT GATGATGCAT CAATGGTTGA ACAAGCGTTG
GCGCAGCAAC AGTACAGTAT GATGCCTGCC GCCAGCGGTT TTAAAGCCGT CAATACGGTA
CAGTTGCCGA ATGGTAAGGT GAAAGTGCGT TACCAGCAGA TGTACAACGG GGTTCCTGTC
TATGGCACCG CTGTGGTGGC AACCGAATCC AGTAAAGGGA TTTCGCAAGT GTATGGTCAA
ATGGCTCAGC AGTTGGAAGC CGATCTCTCA ACCGTGACCC CTGACATTGA AAGCCAGCAG
GCCATCGCTT TAGCGGTTAG CCATTTTGGT GAACAACACG CTGGAGAATC GCTCCCGGTG
GAAAACGAAA GTGTGCAACT GATGGTACGT TTGGATGATA ACCAACAGGC TCAGTTAGTG
TACTTGGTCG ACTTTTTTGT CGCTTCAGAA ACACCTTCGC GTCCGTTCTA CTTTATCAGT
GCAGCAACGG GAGAAGTGCT AGACCAATGG GATGGCATTA ACCACGCACA GGCAACAGGA
ACCGGCCCCG GCGGTAACCA AAAAACGGGA CGTTATGAAT ACGGCAGTAA CGGTTTACCC
GGTTTCACGA TTGATAAGAC CGGAACCACC TGTACGATGA ATAACAGTGC GGTAAAAACC
GTTAACCTCA ATGGCGGCAC CTCGGGTAGC ACGGCGTTCA GTTATGCTTG TAACAACAGC
ACTAACTACA ACAGTGTTAA AACAGTGAAT GGTGCTTACT CACCGCTAAA CGACGCGCAC
TTCTTCGGAA AAGTGGTGTT TGATATGTAT CAGCAGTGGT TGAATACTTC GCCGCTGACT
TTCCAATTAA CCATGCGTGT GCACTATGGC AATAACTATG AAAATGCCTT CTGGGATGGC
CGCGCCATGA CTTTTGGTGA TGGCTATACC CGTTTCTATC CTTTGGTGGA TATCAACGTT
AGTGCCCATG AGGTCAGCCA CGGTTTTACT GAGCAGAATT CAGGCCTCGT TTACCGAGAT
ATGTCCGGTG GTATTAACGA AGCATTCTCG GATATCGCAG GGGAAGCGGC AGAGTACTTT
ATGCGTGGCA ATGTTGACTG GATTGTCGGC GCGGATATTT TTAAATCCTC CGGTGGCCTA
CGTTATTTCG ATCAGCCGTC ACGTGATGGC CGATCGATAG ATCATGCTTC ACAGTATTAC
AGCGGTATTG ATGTTCACCA TTCGAGTGGC GTGTTTAACC GCGCGTTTTA CCTACTCGCC
AATAAATCGG GTTGGAACGT ACGTAAAGGT TTTGAAGTGT TTGCCGTGGC TAACCAGTTG
TACTGGACAC CGAACAGCAC TTTTGATCAA GGTGGCTGTG GGGTAGTGAA AGCGGCGCAG
GATCTCAACT ACAACACCGC AGACGTCGTG GCAGCCTTTA ATACCGTGGG TGTCAATGCT
TCTTGTGGCA CCACGCCACC ACCTGTCGGC AAAGTGCTTG AGAAAGGTAA ACCGTTCACA
GGACTGAGCG GCTCACGTGG AGGAGAAGAT TTCTATACCT TCACTGTGAC CAATTCAGGC
AGTGTTGTTG TGTCCATCAG TGGTGGAACG GGCGATGCGG ATCTGTATGT CAAAGCAGGC
AGCAAACCCA CCACTTCTTC TTGGGATTGT CGTCCATACC GTTCAGGCAA TGCCGAGCAG
TGTTCCATCT CTGCGGTCGT GGGTACGACA TACCATGTCA TGTTACGCGG TTACAGTAAC
TATTCTGGTG TGACGTTACG CTTGGACTAA
 
Protein sequence
MKMIQRPLNW LVLAGAATGF PLYAAQMVMI DDASMVEQAL AQQQYSMMPA ASGFKAVNTV 
QLPNGKVKVR YQQMYNGVPV YGTAVVATES SKGISQVYGQ MAQQLEADLS TVTPDIESQQ
AIALAVSHFG EQHAGESLPV ENESVQLMVR LDDNQQAQLV YLVDFFVASE TPSRPFYFIS
AATGEVLDQW DGINHAQATG TGPGGNQKTG RYEYGSNGLP GFTIDKTGTT CTMNNSAVKT
VNLNGGTSGS TAFSYACNNS TNYNSVKTVN GAYSPLNDAH FFGKVVFDMY QQWLNTSPLT
FQLTMRVHYG NNYENAFWDG RAMTFGDGYT RFYPLVDINV SAHEVSHGFT EQNSGLVYRD
MSGGINEAFS DIAGEAAEYF MRGNVDWIVG ADIFKSSGGL RYFDQPSRDG RSIDHASQYY
SGIDVHHSSG VFNRAFYLLA NKSGWNVRKG FEVFAVANQL YWTPNSTFDQ GGCGVVKAAQ
DLNYNTADVV AAFNTVGVNA SCGTTPPPVG KVLEKGKPFT GLSGSRGGED FYTFTVTNSG
SVVVSISGGT GDADLYVKAG SKPTTSSWDC RPYRSGNAEQ CSISAVVGTT YHVMLRGYSN
YSGVTLRLD