Gene VC0395_A0251 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagVC0395_A0251 
Symbol 
ID5136233 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameVibrio cholerae O395 
KingdomBacteria 
Replicon accessionNC_009457 
Strand
Start bp260275 
End bp261675 
Gene Length1401 bp 
Protein Length466 aa 
Translation table11 
GC content50% 
IMG OID640531709 
Productputative protease 
Protein accessionYP_001216207 
Protein GI147674148 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0826] Collagenase and related proteases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones30 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACAACGC CAAAAACCTT TGTTCCAGAA CTCCTTTCAC CTGCGGGTAG CCTGAAGAAT 
ATGCGCTACG CTTTTGCCTA CGGGGCAGAT GCGGTATATG CCGGCCAACC TCGTTACAGC
CTGCGCGTAC GTAACAATGA GTTCAACCAC GAAAACCTAC AAATCGGCAT TAATGAAGCC
CACGCACTGG GTAAAAAATT CTATGTGGTG TGTAACATTC AGCCGCACAA CTCTAAGCTG
AAAACCTTCA TCCGTGACCT TAAGCCGGTG ATTGATATGG GGCCGGATGC GCTCATCATG
TCTGACCCTG GCCTTATCAT GATGGTTCGT GAAGAATTCC CGCACATGCC GATCCACCTG
TCGGTACAAG CGAATGCGGT GAACTGGGCA ACCGTGAAAT TCTGGGCTTC ACAAGGCGTT
GAGCGTGTGA TTGTTTCTCG TGAGCTCTCT TTAGAAGAAA TCGAAGAGAT CCGCGAAAAA
TGCCCGAATA CCGAAATTGA AGTGTTCGTG CATGGCGCTC TATGTATGGC TTATTCCGGT
CGTTGCTTGC TGTCTGGTTA CATCAACAAG CGCGATCCAA ACCAAGGTAC TTGCACTAAC
GCATGCCGTT GGGAATACAA AGTTGAAGCA GCAAAAGAAG ATGAAGCAGG TCAGATCGTT
GAACAGTTTG ACCCTAATGC AGCACAAGCC ATCGAAGTTC AAAATGAACG TCCAGACACC
ACCATCGGGG CCGGCAAACC GATTGATGAT GTCGTACTGC TTTCTGAGAG CCATCGTCCT
GATGAGAAAA TGGCCGCCTT TGAAGATGAG CACGGCACCT ACATCATGAA CTCCAAAGAT
CTGCGTGCAG TACAGCATGT TGAGCGCCTA ACTCAAATGG GTGTGCACTC ACTGAAAATC
GAAGGCCGTA CCAAATCTTT CTACTACTGC GCACGTACCG CGCAAGTGTA CCGTAAAGCG
ATTGATGATG CGGTAGCGGG CAAGCCATTC GATGATAGCC TGATGACTAC CCTAGAAAGC
TTGGCGCACC GTGGCTATAC CGAAGGTTTC TTACGTCGCC ATACGCACGA TGCTTACCAA
AACTACGACT ACGGCTACTC GGTTTCCGAC ACTCAACAGT TTGTCGGTGA ATTTACCGGT
AAACGCCGCG GCGCAATGGC CGAAGTGGAA GTAAAGAACA AATTTGTGCT CGGCGATAGC
CTTGAGCTGA TGACGCCAAA AGGCAATGTC ATCTTCACTT TAGAAGCGAT GGAAAACCGC
AAAGGTGAAG CAACAGATGA TGCCAAAGGC AACGGTCACT TTGTTTACAT TCCAGTTCCG
GAAGAGTTGG ATCTCAGCTA CGCACTGCTG ATGCGTAACC TAGTGCAAGG GCAGGATACC
CGTAACCCAA CAGGCAAGTA A
 
Protein sequence
MTTPKTFVPE LLSPAGSLKN MRYAFAYGAD AVYAGQPRYS LRVRNNEFNH ENLQIGINEA 
HALGKKFYVV CNIQPHNSKL KTFIRDLKPV IDMGPDALIM SDPGLIMMVR EEFPHMPIHL
SVQANAVNWA TVKFWASQGV ERVIVSRELS LEEIEEIREK CPNTEIEVFV HGALCMAYSG
RCLLSGYINK RDPNQGTCTN ACRWEYKVEA AKEDEAGQIV EQFDPNAAQA IEVQNERPDT
TIGAGKPIDD VVLLSESHRP DEKMAAFEDE HGTYIMNSKD LRAVQHVERL TQMGVHSLKI
EGRTKSFYYC ARTAQVYRKA IDDAVAGKPF DDSLMTTLES LAHRGYTEGF LRRHTHDAYQ
NYDYGYSVSD TQQFVGEFTG KRRGAMAEVE VKNKFVLGDS LELMTPKGNV IFTLEAMENR
KGEATDDAKG NGHFVYIPVP EELDLSYALL MRNLVQGQDT RNPTGK