Gene VC0395_A0100 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagVC0395_A0100 
SymbolhtrA 
ID5135611 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameVibrio cholerae O395 
KingdomBacteria 
Replicon accessionNC_009457 
Strand
Start bp90423 
End bp91793 
Gene Length1371 bp 
Protein Length456 aa 
Translation table11 
GC content48% 
IMG OID640531560 
Productprotease DO 
Protein accessionYP_001216065 
Protein GI147673156 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0265] Trypsin-like serine proteases, typically periplasmic, contain C-terminal PDZ domain 
TIGRFAM ID[TIGR02037] periplasmic serine protease, Do/DeqQ family 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0000000000901293 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGATGAAAA AACCTTTACT TGTTTTAACT GCTCTGTCTC TTAGTTTGAG CGCGATTCTC 
TCGCCTTTGC CTGCAACTGC AGCGCTTCCT CTCTCAGTCA ATGGAGAGCA GATTCCTAGC
CTAGCCCCCA TGCTTGAAAA AGTCACACCC GCCGTGGTGA GCATTGCTGT GGAAGGGACT
CAAGTTTCAA GACAGCGTCT GCCGGATCAG TTTCGTTTTT TCTTCGGACC GGATTTTCCG
ACCGAACAAC TCCAAGAGCG ACCTTTCCGT GGCTTAGGTT CTGGGGTCAT CATTAACGCT
GATAAAGGGT ATGTCGTCAC TAACTACCAT GTCATTAATG GTGCTGAAAA AATTCGCGTC
AAACTGTATG ACGGTCGCGA GTTTGATGCA GAACTTGTCG GTGGTGATGA GATGTCTGAT
GTCGCCTTGC TCAAGCTAAA CAAAGCGAAA AACCTCACTG AGATCCGTAT CGCGGACTCC
GATAAACTGC GAGTCGGTGA TTTTGCAGTG GCCATCGGTA ACCCATTTGG CTTAGGGCAA
ACTGTGACCT CTGGCATTGT CTCAGCCTTA GGGCGTAGTG GTTTGAATAT CGAAAACTTT
GAAAACTTCA TCCAGACCGA TGCCGCCATC AACAGCGGCA ACTCAGGAGG AGCTCTGGTT
AACCTTAATG GTGAACTCAT CGGTATCAAC ACCGCGATCC TTGGTCCAAA CGGTGGCAAC
GTCGGTATAG GTTTTGCCAT CCCATCGAAT ATGATGAAAA ATCTGACCGA TCAAATTCTT
GAGTTTGGTG AAGTGAAACG CGGCATGCTG GGTGTACAAG GCGGTGAAAT CACTTCCGAA
CTGGCTGATG CGCTCGGCTA TGAATCCTCA AAAGGTGCTT TTGTCAGCCA AGTGGTTCCT
GACAGTGCTG CGGACAAAGC GGGCATCAAA GCGGGTGACA TCATTACGTC GCTGAATGGT
AAAAAAATCG ATACCTTCTC TGAGCTACGC GCGAAAGTCG CGACCCTAGG CGCAGGAAAA
ACCATTACCC TTGGAGTGCT GCGTGATGGT AAGAATCAAA ATATTGATGT AACGCTTGGG
GAGCAGCAAA ATGCCAAGAC CAAAGCAGAA TCACTGCATC AAGGTTTGAG CGGCGCGGAG
TTAAGCAACA CCACTGACAG CGATCCTATT CAGGGCGTTA AGGTTACTGA GGTTCAAAAA
GGCTCTGCCG CTGAATCTTA CCAGCTACAA AAAGACGACA TTATCATTGG CGTTAACCGT
AAGCGGGTGA AAAATATCGC CGAGTTGCGT GCGATTATGG AAAAATCACC GAATATTTTG
GCATTAAATA TCCAACGTGG AGAGAGAACG CTTTACTTGG TTGTTCGTTA A
 
Protein sequence
MMKKPLLVLT ALSLSLSAIL SPLPATAALP LSVNGEQIPS LAPMLEKVTP AVVSIAVEGT 
QVSRQRLPDQ FRFFFGPDFP TEQLQERPFR GLGSGVIINA DKGYVVTNYH VINGAEKIRV
KLYDGREFDA ELVGGDEMSD VALLKLNKAK NLTEIRIADS DKLRVGDFAV AIGNPFGLGQ
TVTSGIVSAL GRSGLNIENF ENFIQTDAAI NSGNSGGALV NLNGELIGIN TAILGPNGGN
VGIGFAIPSN MMKNLTDQIL EFGEVKRGML GVQGGEITSE LADALGYESS KGAFVSQVVP
DSAADKAGIK AGDIITSLNG KKIDTFSELR AKVATLGAGK TITLGVLRDG KNQNIDVTLG
EQQNAKTKAE SLHQGLSGAE LSNTTDSDPI QGVKVTEVQK GSAAESYQLQ KDDIIIGVNR
KRVKNIAELR AIMEKSPNIL ALNIQRGERT LYLVVR