Gene VC0395_A0820 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagVC0395_A0820 
Symbol 
ID5135919 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameVibrio cholerae O395 
KingdomBacteria 
Replicon accessionNC_009457 
Strand
Start bp832601 
End bp833812 
Gene Length1212 bp 
Protein Length403 aa 
Translation table11 
GC content45% 
IMG OID640532278 
Productputative trypsin 
Protein accessionYP_001216770 
Protein GI147673040 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG5640] Secreted trypsin-like serine protease 
TIGRFAM ID[TIGR03501] gammaproteobacterial enzyme C-terminal transmembrane domain 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0000000000000898339 
Plasmid hitchhikingNo 
Plasmid clonabilityunclonable 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGCATCAGG TTTCTAAATT GCTGTCGTGT TTTATCGGTT TTTCTCTTTT CTCTACACTG 
CTCTATGCGG AATCCACAGC AGATATTTCA TCTCGTATTA TTAATGGTTC GAATGCAAAC
TCTGCCGAAT GGCCATCGAT TGTGGCTCTG GTTAAACGCG GTGCAGATGC TTATCAAGGG
CAATTTTGTG GCGGTAGTTT TTTAGGTGGA CGATATGTCT TAACCGCTGC TCACTGTTTT
GATAGTCGTA GTGCTGCTAG TGTGGATGTG ATTATTGGTG CTTATGATCT GAATAACTCT
TCTCAAGGAG AAAGGATTGC CGCGCAAAAA ATTTACCGTC ATTTGAGTTA TAGTCCCAGT
AACTTATTGA ATGACATTGC GATTGTCGAG TTAGCACAAA CCAGTAGTTT GCCTGCCATC
ACTTTAGCGG GTCCTGCAAC ACGCACGTCT TTACCAGCAT TAACCCCTTT AACAGTGGCT
GGATGGGGGA TTACCGTGCA ATCAAAACCG CCACAATTTA CTCCGATACT ACAAGAAGTG
GATGTTGACC TGGTTTCGCA ATCGCTTTGC CAAATAGTCA TGCAGCATGG AATTTCCTCT
GATCCTAATT CGACCAACTT TTGTGCCGCT CGTCTTACGA AAGATTCATG TCAGGGAGAT
TCTGGTGGCC CAATTGTAGT TAAAACGGGT CGAGAGCAGC TGGGTATTGT GAGTTGGGGG
GATGAACAAT GCGCTAAAAC AGGCACTTAT GGTGTGTATA CCAACGTCAG TTATTTTCGT
GATTGGATCA CAAAGCACAC TAACCAGCTT AGCTATGATC AAGTCGCCAA TCTCGGTATT
CGTCCATTAG GTAAAGTGAG TCAATCATTT ACATATACCA ACCTTGATGC TAATGCGCTG
ACCTATACAG GAAACACATT CTCAAGTCTG CCCGCTGATT TTAGTGTGCT ATCCGATGGG
TGCAGTACTA AAGTGACGTT AGCCACAGGT GAAAGTTGTA GTGTAGAAGT GGCTGTTGAT
GCGCAACACT ACCGTCAATA TCAGTATGAC TTTGAGTTGA TCTTTAGCTA TGCGGGAGGT
TCTAAACGTG CAACGTCTCG TATTCAACTG GATACTTCGC CTTTTGCCCC AAGTGCCTCT
TCTGGCGGTT CCATCGGCTG GTTTGGTTTG CTGCTTTTGG CTCCATTGTG GATGAGACGG
AAAACCGCTT GA
 
Protein sequence
MHQVSKLLSC FIGFSLFSTL LYAESTADIS SRIINGSNAN SAEWPSIVAL VKRGADAYQG 
QFCGGSFLGG RYVLTAAHCF DSRSAASVDV IIGAYDLNNS SQGERIAAQK IYRHLSYSPS
NLLNDIAIVE LAQTSSLPAI TLAGPATRTS LPALTPLTVA GWGITVQSKP PQFTPILQEV
DVDLVSQSLC QIVMQHGISS DPNSTNFCAA RLTKDSCQGD SGGPIVVKTG REQLGIVSWG
DEQCAKTGTY GVYTNVSYFR DWITKHTNQL SYDQVANLGI RPLGKVSQSF TYTNLDANAL
TYTGNTFSSL PADFSVLSDG CSTKVTLATG ESCSVEVAVD AQHYRQYQYD FELIFSYAGG
SKRATSRIQL DTSPFAPSAS SGGSIGWFGL LLLAPLWMRR KTA