Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | VC0395_A1312 |
Symbol | |
ID | 5137667 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Vibrio cholerae O395 |
Kingdom | Bacteria |
Replicon accession | NC_009457 |
Strand | + |
Start bp | 1392951 |
End bp | 1395719 |
Gene Length | 2769 bp |
Protein Length | 922 aa |
Translation table | 11 |
GC content | 48% |
IMG OID | 640532770 |
Product | zinc protease |
Protein accession | YP_001217256 |
Protein GI | 147675340 |
COG category | [R] General function prediction only |
COG ID | [COG0612] Predicted Zn-dependent peptidases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 21 |
Plasmid unclonability p-value | 0.792091 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAAATTC GATTATTAGT CAGCATTTTA TGCATACTGT TCGCCGGTTG CGCCTTACAA CAAACCAACC GCGCACTCCA ACCCGATCAA CGCTGGGTTA CTCAAACACT GCCCAATGGC CTTACTTACC ATCTCTACCC AGACAGTGAA CAAGAAGTCT CTATTCGTTT GTACGTTCAC GCAGGTTCAA TGCAAGAAAC CACGCAGCAA GCGGGTTACG CTCACTTCAT CGAACATATG GCATTTAACG GAACCCGCCA TTATCAGCAC AATGATGTGA TTCGTATGTT TGAGCAAAGT GGTGCCCAGT TTGGTGCGGA CTTTAACGCT TTAACCGGCT ACGACCGCAC GGTTTATCAG CTCGACCTAC CTAATGCGCA AAATATCGAT AAAGCGCTAT TGTGGTTTGC TGATATTGCT GACGGTTTAG CGTTTGACGC TGATGAAGTC GAAAAAGAGA AAGGGGTGAT TTTGGGGGAG TTTCGCGCCT CACGCACAGA AAACATGAGC TTAGAGCAGC AGTTTTATCT GCACCAGATC CAAGGAACCT CCTACGCGGA TCGCGATCCC CTCGGTTCAC GCGAGTTGGT GCAAGCCGCC ACTCCAGACA GTTTGAAAGC CTTCTACCAA CAGTGGTATC AACCTCAATT AGCGGAACTG GTCATTACTG GTAACTTTAC CCTTGAACAA GGCCAACAGT GGGTTGAGAA CTATTTCTCC TCATGGAAGA AAGGCACCAC CGAAAAGCCA GCTTCGATTT ATCACCAAGC CCTGAATAAC CAAGACCTAG TCGCTCCGGT GACTGCAGGT GAATCCCCTA GCCTAACCTT GATCTTCCCC CAAGGCTCAG CAGCGATTAA AGACTATGCC AGCCAGCAAG AGTTTTGGCG CGACGATGTC GGCGAACAAC TCCTGCACAC ACGCCTAGTA GCGGCATTCA ATGATGCAGC GCAGGCCATA ACCGGCATTT ACGCAACCCA TTATGAAATC GAAGGTCAAC GTTACACTCT CATTAGCGTC GGGTTTGCGG CCGAACAACG AGAGAAAGTG CAAGCCTTGT TGCTGGAAAC CCTCGCTTCA ATGCGTGATT ACGGGGTGAC CAAAAATGAG TTGGATATCA TTTTGCGCGG TTATCGTGAG CACTTAACCT TTTTGCAAGA AGATCGCGAA GCCATAACCC CAGCGAGCCA TGCCAATCAA AAAGTCTACT CGATTGTGTT TGATACACCG ATCCAAGCGA CGCTCGATTA CCAAGCGAGC TTGAGCGAGT TTATTGCTTC CGCGACACCA GAAATGATCA ATCGTCATAT CCAACAGCAA TTGAGCCAAA ACCCTGTCTG GGTTGTTGGC GTTGCGGCCA CTGAAGATGC GCAAGCGTTG AACAAAGCCT TGCCACAATG GCGTAACGAC CTAGCGCAAC CTGGCAATCA GCCTATCGAT CAACAAATCG ATAGCCCATT CACCCAGCAA TTTACTGCGG GCGAAGTGGT GAAACAACTC GATATCAATG ACGATCCACA GGTGACTTAT TGGCAGCTTG ATAACGGTAT TGATGTCTAT TATCTGCGCA ATATTGAAGC CAAAGATCGC GTCTTTGTTC AATACGCCAG TTCGGGTGGC CAATTCGCGT TACCTGCAGA CCTACTCCCT GCAGCCGAGA TTGCTACTGC AGTACAGACA CGAAGTGGCC TCGATACGTT GAACGGCTCT CAGTTTGATC GCTACCTTCG TCAGAAAGAC ATTGGTTTCT ATAGCTATAT TGCATCGACC AGTCATGGTT TCGAAGCCAA TAGTAAAGCG CAAGAGTTAC CTGAGCTGTT GGAAATACTC CATTTATTGT CAACTCAAGT CAAAGTCAGC CCTGATCAGC TCAATTCAGT GAAAACCGAA TTTACTCAAA ATCGTAGTGC GTACTTTGAT TCCCCTATTG GCGCTTTCTT CCGTACGGTG ACCAACCAAA GCTTTATCGA AACAAGTCCC TACAGAATTC GTACTCCAGA GCAGATTGCT CAAGTCACGG CACAGCAAAT TGAGCAGGTT CATCAGCGTC TCTTCAGTGA AGGACGCAAT AATACGCTGG TCATCGTCGG TGATATTGAG CGGTCTCAAA TAACCCCGAT GTTGCGTCAG TACGTAGCAA GTATACCTTT GAGCAAGGGA ACGCTTTCCC CTATGACCAG CCAGTTGATC AAACCGGTTG CGCCACGCTT AGAGTTAGCA CTTAACAATG AAAACTCGAC CCAGTATTCC CTGCGCCTGA TATCTGAAAC GCAGCCGCGT ACCGCAAAAA CCGTCTTTAT CGATGATATG TTGCAGCGTA TTGCGACTCA GCGTTTACTG GCTGAAGTTC GCGAGCATCA AGGTTTAGAC TACACGCCAC AAGTGATCCC ATACGTGGTC GATGGAGACA TTCTCAATGA TTGGGTATTG TCAGCTTTAG TCGACCCCAA AAGTGAACCA CAAGTGGCCA AAGTGATGCA TGAGGTCGCG CGTGAGCTGG CTCAAGGTGT CACACAGCAA GAGTTGGATG TGGTGAAACA GAAGTTCTTG ATTGATATGA AACCGCTCAA CAAATCACCG GAGCAGCAAG CTTACTTCAT GTTGCGTTAT GCGATTCATC ACTATGGCGT TGAGACCATC TACAAGGTTG AGGAGCTGAC GAAATCCATC ACTCTGGATG ACATCAACCA ACGTGCTCAA ACGTTATTTG GCAAAGATAC TATGTCACAA GAGCTGATCA TGACACCTAA GGCTAATCCT AAAGGCTAA
|
Protein sequence | MKIRLLVSIL CILFAGCALQ QTNRALQPDQ RWVTQTLPNG LTYHLYPDSE QEVSIRLYVH AGSMQETTQQ AGYAHFIEHM AFNGTRHYQH NDVIRMFEQS GAQFGADFNA LTGYDRTVYQ LDLPNAQNID KALLWFADIA DGLAFDADEV EKEKGVILGE FRASRTENMS LEQQFYLHQI QGTSYADRDP LGSRELVQAA TPDSLKAFYQ QWYQPQLAEL VITGNFTLEQ GQQWVENYFS SWKKGTTEKP ASIYHQALNN QDLVAPVTAG ESPSLTLIFP QGSAAIKDYA SQQEFWRDDV GEQLLHTRLV AAFNDAAQAI TGIYATHYEI EGQRYTLISV GFAAEQREKV QALLLETLAS MRDYGVTKNE LDIILRGYRE HLTFLQEDRE AITPASHANQ KVYSIVFDTP IQATLDYQAS LSEFIASATP EMINRHIQQQ LSQNPVWVVG VAATEDAQAL NKALPQWRND LAQPGNQPID QQIDSPFTQQ FTAGEVVKQL DINDDPQVTY WQLDNGIDVY YLRNIEAKDR VFVQYASSGG QFALPADLLP AAEIATAVQT RSGLDTLNGS QFDRYLRQKD IGFYSYIAST SHGFEANSKA QELPELLEIL HLLSTQVKVS PDQLNSVKTE FTQNRSAYFD SPIGAFFRTV TNQSFIETSP YRIRTPEQIA QVTAQQIEQV HQRLFSEGRN NTLVIVGDIE RSQITPMLRQ YVASIPLSKG TLSPMTSQLI KPVAPRLELA LNNENSTQYS LRLISETQPR TAKTVFIDDM LQRIATQRLL AEVREHQGLD YTPQVIPYVV DGDILNDWVL SALVDPKSEP QVAKVMHEVA RELAQGVTQQ ELDVVKQKFL IDMKPLNKSP EQQAYFMLRY AIHHYGVETI YKVEELTKSI TLDDINQRAQ TLFGKDTMSQ ELIMTPKANP KG
|
| |