Gene VC0395_A1254 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagVC0395_A1254 
Symbol 
ID5136098 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameVibrio cholerae O395 
KingdomBacteria 
Replicon accessionNC_009457 
Strand
Start bp1327307 
End bp1328953 
Gene Length1647 bp 
Protein Length548 aa 
Translation table11 
GC content43% 
IMG OID640532712 
Productputative trypsin 
Protein accessionYP_001217198 
Protein GI147673259 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG5640] Secreted trypsin-like serine protease 
TIGRFAM ID[TIGR03501] gammaproteobacterial enzyme C-terminal transmembrane domain 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.000000000000770222 
Plasmid hitchhikingNo 
Plasmid clonabilityunclonable 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAACAAAA CATTTTTATC AGGAGTGGTT GGAACACTTC TTTTTACATC TTTTCAGTTC 
AGTGCTTCTG GAACGGAATC CGGAGTCTCT AGCCGTATTA TTGGTGGAGA ACAAGCGACT
GCAGGCGAAT GGCCTTATAT GGTTGCGCTA ACCGCTCGAA ATAGTAGCCA CGTTTTTTGT
GGTGGTAGTT ATCTTGGTGG TCGTTATGTT TTGACTGCTG CGCACTGTGT GGATAAAGAA
GATCCTGCTA AGGGGGATGT ACTGCTCGGT GCTTTTGACA TGAATGATGT TAATACTGCA
GAGAGAATTC ATGTCAGACA AATTTATGTG CATAATAGCT ATATTACTGC TTCAATGGGC
AATGATATTG CTGTTCTTGA ATTGGAACGG GATCCCTTAC CTCGAAGATC TGTGCAAATT
TCAGATTCAT CGGATTTCAA TGAGTTGACA AAAGATTCGC CAATGACGGT GATCGGTTTT
GGTAATCGTA AAGAAGTCGA TGGTGAGAAA TCCGATCCTG CAACCATATT ACACCAAGTA
CAGGTCCCCT TTGTTCCACT CCCGGAATGT AAAACAAAAG GCAGTGATCA GGACGCTAAA
AACAATTACT CTCAACTAAC CAATAACGCG TTTTGTGCCG GTTCGTTTGG CAAAGACGCT
TGTTCTGGTG ACAGCGGTGG CCCTATCTTT TTTGATTCCA ACAATGGTCG AAAGCAGATG
GGCGTTGTTA GCTGGGGGGA TGGCTGTGGT CGAGCCAATA GTCCTGGTGT TTATACTAAC
CTCAGCGTTT TTAATGATTG GTTAGATGAT CAGCAACTAG GTCTATCCTA TCGCCAAAAA
CGAGACTTAG GAGTAGTAAG ACCAGGTTCG TATACTCATA ATTTAACTTT TACGAATAAC
GGTAATGCTG ACATCAACTT GGGCAATACC TTTGTTTTTG TTGTTGGCAT TTCTAGAACA
GACGCTGCTG CGATTGTGAA TAATAGCTGT ACAGGAGTGT TAGCTTCAGG AGCGAGTTGT
GACGTTGAGT TTAGCTATAA TATTACTGAA CATAAACAGA GTTATGTAAA ATTGATAATA
GGATCAAGTA CTTATAAGAC GGGTGCTGTT CATGCCTACC TCTATTTTGA TGCTTTGGAC
GCTGCGCCTT CAGAAACAGT AAGCTTTTTA GCGAATTTAC CTGTCCATAA TACCCACGTG
AATGATCACC CTTGGACGGT TGTCGGTAAT GGTCTGCAGA CTTCAGCTTT ACCAGCTGGA
GAAGAGTCTG TAATTTTGTT GGAAAACCTA CCTCAAGGTA GACTCAAATT CCACTATAAA
CTGTCGTCAA GTGAAGTTCT CGATCAGCTA TTTGTCTACG TGAATGATAA GTTTAAGGGT
AAATACTTTA ACAACACGGA AAATCTAGCC ACACTTGATA TGTATGGTAC GAATAACAAG
GTTCGATTTG TATACAGAAG ACATAGCGGT AGTACTGATG ACCAAAGTCG AGCCATATTG
AGCCAAATAA GCTACGATCC AAAATTCTTT GATCTACCTC CACCGCTTGA TATTCGTATC
GGTGATAGTG GCGGCGGTAG CCTAGGTGGG GCAGCCTTAG CTCTTCTCTT TGGTTGTGGC
TGGTTACGTC GTCGTCAACG CGTCTGA
 
Protein sequence
MNKTFLSGVV GTLLFTSFQF SASGTESGVS SRIIGGEQAT AGEWPYMVAL TARNSSHVFC 
GGSYLGGRYV LTAAHCVDKE DPAKGDVLLG AFDMNDVNTA ERIHVRQIYV HNSYITASMG
NDIAVLELER DPLPRRSVQI SDSSDFNELT KDSPMTVIGF GNRKEVDGEK SDPATILHQV
QVPFVPLPEC KTKGSDQDAK NNYSQLTNNA FCAGSFGKDA CSGDSGGPIF FDSNNGRKQM
GVVSWGDGCG RANSPGVYTN LSVFNDWLDD QQLGLSYRQK RDLGVVRPGS YTHNLTFTNN
GNADINLGNT FVFVVGISRT DAAAIVNNSC TGVLASGASC DVEFSYNITE HKQSYVKLII
GSSTYKTGAV HAYLYFDALD AAPSETVSFL ANLPVHNTHV NDHPWTVVGN GLQTSALPAG
EESVILLENL PQGRLKFHYK LSSSEVLDQL FVYVNDKFKG KYFNNTENLA TLDMYGTNNK
VRFVYRRHSG STDDQSRAIL SQISYDPKFF DLPPPLDIRI GDSGGGSLGG AALALLFGCG
WLRRRQRV