Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | VC0395_A1363 |
Symbol | |
ID | 5137389 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Vibrio cholerae O395 |
Kingdom | Bacteria |
Replicon accession | NC_009457 |
Strand | - |
Start bp | 1458893 |
End bp | 1461961 |
Gene Length | 3069 bp |
Protein Length | 1022 aa |
Translation table | 11 |
GC content | 43% |
IMG OID | 640532821 |
Product | putative type I restriction enzyme HsdR |
Protein accession | YP_001217306 |
Protein GI | 147674712 |
COG category | [V] Defense mechanisms |
COG ID | [COG0610] Type I site-specific restriction-modification system, R (restriction) subunit and related helicases |
TIGRFAM ID | [TIGR00348] type I site-specific deoxyribonuclease, HsdR family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 30 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GTGGTATTTA TGGTTAGTAA AACAAATGAA CAGGCGCTAG AGGCCGCAAT CGAAAAAGGT TTAGCAGGTA TTTGTAAAGA AGAGTTAGCG CTGGGTGAAG CGCCTCTTAA TTACAATAAT GATCTTTATC TTATTGGCTC TCCAAGTGAC TTTGACAAGC AGTACGCTTT AGATACCCGT TTGTTTTGGC AATTCCTCGA AGATACTCAA GCGAGTGAGT TAGAAAAACT TAAGCGCACC AGCCCTCACG ATTGGCAGAG AAAAATCCTT GAGCGTTTCG ATCGTATGAT CAAGCGCCAT GGTGTTTTGC GACTATTGAA AAAAGGGCTA GACGTTGACG ATGCCTTTCT ATCGTTGATG TACCCGGCAC CACTCGCCAG TAGTTCTGAG AAGGTAAAAA AAGATTTTTC TGCCAACCTG TTTAGTGTGA CTCGCCAGGT TTGTTATTCC AATGCAAATC CATTGGAAGA AATTGACATG GTGCTTTTTA TCAATGGTAT CCCTCTGATT ACCCTTGAGC TTAAAAACCC ATGGACGGGC CAGAACGCTG TTTATCATGG TCAAAAGCAG TACCGCGATG ATAGAGATGC AAATCAGCCA TTGTTGAACT TTGCTCGTTG CTTGGTTCAC ATGGCGGTCG ATACCGATGA AGTCTATATG ACGACTAAGC TGGCAGGCAA AAATACCTTC TTCCTGCCGT TCAACAAAGG CTTCAACTTT GGCAAAGGTA ACCCGATTAA TCCACATGGG CACAAGACGG CCTACTTGTG GCAAGAGGTA TTCCGCAAAG AAAGCATCGC CAATATTATT CAGCACTTTA TTCGTCTTGA TGGCAGCAGT AAAAAGCAGT TGGACAAACG AACTTTGTTC TTCCCTCGAT ATCACCAAAT GGATGTCGTA CGTCGCTTGG TTGATCACTG CTCAGTTAAT GGTGTTGGGC AAACGTATTT GATACAGCAC TCAGCGGGGT CAGGTAAATC TAACTCAATT ACATGGGCGG CGTATCAGCT CATCGAAACT TACCCTATTA GCGATGATCT ACCAGGAAGT AGAGGAAAAG AAATGCCTCT ATTCGATTCG GTCATCGTTG TTACTGACCG CCGATTGCTC GATAAGCAGT TACGCGACAA TATAAAAGAG TTCTCTGAAG TGAAGAACAT TGTTGCGCCT GCGTTTAAAT CGTCAGAACT AAAGTCAGCA TTAGAGAATG GTAAGAAAAT CATCATTACC ACCATTCAAA AGTTTCCCTA TATTGTCGAT GGTATTGCAG ACTTAAGTGA TAGGCGCTTT GCTGTCATCA TTGATGAAGC ACACAGTTCG CAGGATGGGC ATAACCAAGA TAAGTTAAAT GAAGCAATGG GGTTTGTTTC GGAGGATGTT TTAGACAAAG CATTACAAAG TGCGAAAAAT CGTAAGATGC GCTCAAATGC ATCTTACTTT GCATTCACCG CAACGCCCAA AAACACCACT CTAGAAAAGT TTGGTCAGCG ACAAGCAGAC GGAACTTATG TTCCTTTTCA TTTGTACTCG ATGAAACAAG CCATCGAAGA AGGGTTTATC CTCGATGTTA TTGCCAATTA CACGACCTAT AAGAGCTACT ACGAGATTGA AAAATCAATC CAAGATAACC CTGAGTTTGA CAGCAAAAAA GCACAGAAGC GCTTACGAGC GTATGTAGAG GCAAGCCAAG AGACGATAGA CACTAAAGCC GAGATCATGC TTGAGCATTT CATCAAGCAT GTCGTTAACG GCAAGAAGCT AAAAGGTAAA GGCAAAGGTA TGGTGGTGAC TCAAAACATT GAGTCAGCCA TACGTTACTA TCGGGCACTA ACCAGACAGC TCAATAAAAT GGGCAATCCG TTTAAAGTTG CCATTGCATT CTCTGGCTCA AAAGAAGTCG ATGGGATTGA GTACACAGAA GCTGACATTA ATGGTTTCCC AGAAGGTGAT ACCAAAGATT ACTTTGATGT GAACTACAAG CGTAAGGAGC CGGACTCTCC AATCCCTAAG CACGTAGACC AAGATGCTTA CCGGTTGCTG GTAGTGGCGA ATAAGTATTT AACAGGCTTT GATCAGCCGA AACTTTGTGC CATGTACGTG GATAAAAAGT TGGCTAGCGT TTTGTGTGTC CAAGCTCTGT CACGCTTAAA CCGTTCAGCG CCAAAGTACG GTAAGAAAAC GGAAGATCTG TTTGTGTTGG ATTTCTTCAA CTCAGTGGAT GATATCAAAA CCGCATTCGA CCCTTTCTAT ACATCCACAA CGCTTTCTGA AGCGACAGAT GTCAATGTTC TTCATGAGCT AAAAGATGAT ATGGACGACA CAGATGTTTA TGAGTGGTTT GAAGTTGAGG AGTTCAACAA GCGTTTCTTT GAAGGACGAG AGGCTCAGGA CCTAAGCCCA ATCATTGACA TCGCAGCTGC GCGTTTTAAC CACGAGTTAG AACTAGAGAA CGAGTTTAAA GTCGATTTCA AAGTGAAAGC CAAGCAGTTT GTGAAAATCT ACGGCCAGAT TGCATCTATC ATGCCTTATG AGGTCGTTCA GTGGGAAAAA CTGTTCTGGT TCTTGAAATT TTTGATTCCG AAGCTTTCCG TCGAAGACCC AGATAAAGAA GCACTAGACT CATTACTAGA TTCAGTTGAT TTGAGCTCTT ATGGATTACA AAGGGTTAAG CTTAACCATT CCATCGAGCT AGATGACTCT GAAACTGAGT TAGATCCTCA AAACCCGAAC CCGCGCGGGG CTTATGGTCC TGAAGCTGAG AAAGATCCGT TAGATGAGAT CATCAAAATC TTTAACGAAC GCTGGTTCCA AGGCTGGAGC GCAACACCAG AAGAGCAGAG AGTTAAGTTT GTGAACATTG CTGAGAGCAT CCGAAATCAC CCAGATTTCG AAGCTAAATA CCAAAACAAC GCAGACCCTC ATACGCGAGA GTTAGCGTTT GAAAAGATGT TGAAAGAAAT CATGCTTCAA CGTCGTAAAG ACGAGTTAGA GCTCTACAAG CTGTTTGCCC AAGACCCAGC TTTTAAAGCG TCTTGGACGC AGAGCATGCA GCGTATGGTT GGGATGTAA
|
Protein sequence | MVFMVSKTNE QALEAAIEKG LAGICKEELA LGEAPLNYNN DLYLIGSPSD FDKQYALDTR LFWQFLEDTQ ASELEKLKRT SPHDWQRKIL ERFDRMIKRH GVLRLLKKGL DVDDAFLSLM YPAPLASSSE KVKKDFSANL FSVTRQVCYS NANPLEEIDM VLFINGIPLI TLELKNPWTG QNAVYHGQKQ YRDDRDANQP LLNFARCLVH MAVDTDEVYM TTKLAGKNTF FLPFNKGFNF GKGNPINPHG HKTAYLWQEV FRKESIANII QHFIRLDGSS KKQLDKRTLF FPRYHQMDVV RRLVDHCSVN GVGQTYLIQH SAGSGKSNSI TWAAYQLIET YPISDDLPGS RGKEMPLFDS VIVVTDRRLL DKQLRDNIKE FSEVKNIVAP AFKSSELKSA LENGKKIIIT TIQKFPYIVD GIADLSDRRF AVIIDEAHSS QDGHNQDKLN EAMGFVSEDV LDKALQSAKN RKMRSNASYF AFTATPKNTT LEKFGQRQAD GTYVPFHLYS MKQAIEEGFI LDVIANYTTY KSYYEIEKSI QDNPEFDSKK AQKRLRAYVE ASQETIDTKA EIMLEHFIKH VVNGKKLKGK GKGMVVTQNI ESAIRYYRAL TRQLNKMGNP FKVAIAFSGS KEVDGIEYTE ADINGFPEGD TKDYFDVNYK RKEPDSPIPK HVDQDAYRLL VVANKYLTGF DQPKLCAMYV DKKLASVLCV QALSRLNRSA PKYGKKTEDL FVLDFFNSVD DIKTAFDPFY TSTTLSEATD VNVLHELKDD MDDTDVYEWF EVEEFNKRFF EGREAQDLSP IIDIAAARFN HELELENEFK VDFKVKAKQF VKIYGQIASI MPYEVVQWEK LFWFLKFLIP KLSVEDPDKE ALDSLLDSVD LSSYGLQRVK LNHSIELDDS ETELDPQNPN PRGAYGPEAE KDPLDEIIKI FNERWFQGWS ATPEEQRVKF VNIAESIRNH PDFEAKYQNN ADPHTRELAF EKMLKEIMLQ RRKDELELYK LFAQDPAFKA SWTQSMQRMV GM
|
| |