Gene VC0395_A1363 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagVC0395_A1363 
Symbol 
ID5137389 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameVibrio cholerae O395 
KingdomBacteria 
Replicon accessionNC_009457 
Strand
Start bp1458893 
End bp1461961 
Gene Length3069 bp 
Protein Length1022 aa 
Translation table11 
GC content43% 
IMG OID640532821 
Productputative type I restriction enzyme HsdR 
Protein accessionYP_001217306 
Protein GI147674712 
COG category[V] Defense mechanisms 
COG ID[COG0610] Type I site-specific restriction-modification system, R (restriction) subunit and related helicases 
TIGRFAM ID[TIGR00348] type I site-specific deoxyribonuclease, HsdR family 


Plasmid Coverage information

Num covering plasmid clones30 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGGTATTTA TGGTTAGTAA AACAAATGAA CAGGCGCTAG AGGCCGCAAT CGAAAAAGGT 
TTAGCAGGTA TTTGTAAAGA AGAGTTAGCG CTGGGTGAAG CGCCTCTTAA TTACAATAAT
GATCTTTATC TTATTGGCTC TCCAAGTGAC TTTGACAAGC AGTACGCTTT AGATACCCGT
TTGTTTTGGC AATTCCTCGA AGATACTCAA GCGAGTGAGT TAGAAAAACT TAAGCGCACC
AGCCCTCACG ATTGGCAGAG AAAAATCCTT GAGCGTTTCG ATCGTATGAT CAAGCGCCAT
GGTGTTTTGC GACTATTGAA AAAAGGGCTA GACGTTGACG ATGCCTTTCT ATCGTTGATG
TACCCGGCAC CACTCGCCAG TAGTTCTGAG AAGGTAAAAA AAGATTTTTC TGCCAACCTG
TTTAGTGTGA CTCGCCAGGT TTGTTATTCC AATGCAAATC CATTGGAAGA AATTGACATG
GTGCTTTTTA TCAATGGTAT CCCTCTGATT ACCCTTGAGC TTAAAAACCC ATGGACGGGC
CAGAACGCTG TTTATCATGG TCAAAAGCAG TACCGCGATG ATAGAGATGC AAATCAGCCA
TTGTTGAACT TTGCTCGTTG CTTGGTTCAC ATGGCGGTCG ATACCGATGA AGTCTATATG
ACGACTAAGC TGGCAGGCAA AAATACCTTC TTCCTGCCGT TCAACAAAGG CTTCAACTTT
GGCAAAGGTA ACCCGATTAA TCCACATGGG CACAAGACGG CCTACTTGTG GCAAGAGGTA
TTCCGCAAAG AAAGCATCGC CAATATTATT CAGCACTTTA TTCGTCTTGA TGGCAGCAGT
AAAAAGCAGT TGGACAAACG AACTTTGTTC TTCCCTCGAT ATCACCAAAT GGATGTCGTA
CGTCGCTTGG TTGATCACTG CTCAGTTAAT GGTGTTGGGC AAACGTATTT GATACAGCAC
TCAGCGGGGT CAGGTAAATC TAACTCAATT ACATGGGCGG CGTATCAGCT CATCGAAACT
TACCCTATTA GCGATGATCT ACCAGGAAGT AGAGGAAAAG AAATGCCTCT ATTCGATTCG
GTCATCGTTG TTACTGACCG CCGATTGCTC GATAAGCAGT TACGCGACAA TATAAAAGAG
TTCTCTGAAG TGAAGAACAT TGTTGCGCCT GCGTTTAAAT CGTCAGAACT AAAGTCAGCA
TTAGAGAATG GTAAGAAAAT CATCATTACC ACCATTCAAA AGTTTCCCTA TATTGTCGAT
GGTATTGCAG ACTTAAGTGA TAGGCGCTTT GCTGTCATCA TTGATGAAGC ACACAGTTCG
CAGGATGGGC ATAACCAAGA TAAGTTAAAT GAAGCAATGG GGTTTGTTTC GGAGGATGTT
TTAGACAAAG CATTACAAAG TGCGAAAAAT CGTAAGATGC GCTCAAATGC ATCTTACTTT
GCATTCACCG CAACGCCCAA AAACACCACT CTAGAAAAGT TTGGTCAGCG ACAAGCAGAC
GGAACTTATG TTCCTTTTCA TTTGTACTCG ATGAAACAAG CCATCGAAGA AGGGTTTATC
CTCGATGTTA TTGCCAATTA CACGACCTAT AAGAGCTACT ACGAGATTGA AAAATCAATC
CAAGATAACC CTGAGTTTGA CAGCAAAAAA GCACAGAAGC GCTTACGAGC GTATGTAGAG
GCAAGCCAAG AGACGATAGA CACTAAAGCC GAGATCATGC TTGAGCATTT CATCAAGCAT
GTCGTTAACG GCAAGAAGCT AAAAGGTAAA GGCAAAGGTA TGGTGGTGAC TCAAAACATT
GAGTCAGCCA TACGTTACTA TCGGGCACTA ACCAGACAGC TCAATAAAAT GGGCAATCCG
TTTAAAGTTG CCATTGCATT CTCTGGCTCA AAAGAAGTCG ATGGGATTGA GTACACAGAA
GCTGACATTA ATGGTTTCCC AGAAGGTGAT ACCAAAGATT ACTTTGATGT GAACTACAAG
CGTAAGGAGC CGGACTCTCC AATCCCTAAG CACGTAGACC AAGATGCTTA CCGGTTGCTG
GTAGTGGCGA ATAAGTATTT AACAGGCTTT GATCAGCCGA AACTTTGTGC CATGTACGTG
GATAAAAAGT TGGCTAGCGT TTTGTGTGTC CAAGCTCTGT CACGCTTAAA CCGTTCAGCG
CCAAAGTACG GTAAGAAAAC GGAAGATCTG TTTGTGTTGG ATTTCTTCAA CTCAGTGGAT
GATATCAAAA CCGCATTCGA CCCTTTCTAT ACATCCACAA CGCTTTCTGA AGCGACAGAT
GTCAATGTTC TTCATGAGCT AAAAGATGAT ATGGACGACA CAGATGTTTA TGAGTGGTTT
GAAGTTGAGG AGTTCAACAA GCGTTTCTTT GAAGGACGAG AGGCTCAGGA CCTAAGCCCA
ATCATTGACA TCGCAGCTGC GCGTTTTAAC CACGAGTTAG AACTAGAGAA CGAGTTTAAA
GTCGATTTCA AAGTGAAAGC CAAGCAGTTT GTGAAAATCT ACGGCCAGAT TGCATCTATC
ATGCCTTATG AGGTCGTTCA GTGGGAAAAA CTGTTCTGGT TCTTGAAATT TTTGATTCCG
AAGCTTTCCG TCGAAGACCC AGATAAAGAA GCACTAGACT CATTACTAGA TTCAGTTGAT
TTGAGCTCTT ATGGATTACA AAGGGTTAAG CTTAACCATT CCATCGAGCT AGATGACTCT
GAAACTGAGT TAGATCCTCA AAACCCGAAC CCGCGCGGGG CTTATGGTCC TGAAGCTGAG
AAAGATCCGT TAGATGAGAT CATCAAAATC TTTAACGAAC GCTGGTTCCA AGGCTGGAGC
GCAACACCAG AAGAGCAGAG AGTTAAGTTT GTGAACATTG CTGAGAGCAT CCGAAATCAC
CCAGATTTCG AAGCTAAATA CCAAAACAAC GCAGACCCTC ATACGCGAGA GTTAGCGTTT
GAAAAGATGT TGAAAGAAAT CATGCTTCAA CGTCGTAAAG ACGAGTTAGA GCTCTACAAG
CTGTTTGCCC AAGACCCAGC TTTTAAAGCG TCTTGGACGC AGAGCATGCA GCGTATGGTT
GGGATGTAA
 
Protein sequence
MVFMVSKTNE QALEAAIEKG LAGICKEELA LGEAPLNYNN DLYLIGSPSD FDKQYALDTR 
LFWQFLEDTQ ASELEKLKRT SPHDWQRKIL ERFDRMIKRH GVLRLLKKGL DVDDAFLSLM
YPAPLASSSE KVKKDFSANL FSVTRQVCYS NANPLEEIDM VLFINGIPLI TLELKNPWTG
QNAVYHGQKQ YRDDRDANQP LLNFARCLVH MAVDTDEVYM TTKLAGKNTF FLPFNKGFNF
GKGNPINPHG HKTAYLWQEV FRKESIANII QHFIRLDGSS KKQLDKRTLF FPRYHQMDVV
RRLVDHCSVN GVGQTYLIQH SAGSGKSNSI TWAAYQLIET YPISDDLPGS RGKEMPLFDS
VIVVTDRRLL DKQLRDNIKE FSEVKNIVAP AFKSSELKSA LENGKKIIIT TIQKFPYIVD
GIADLSDRRF AVIIDEAHSS QDGHNQDKLN EAMGFVSEDV LDKALQSAKN RKMRSNASYF
AFTATPKNTT LEKFGQRQAD GTYVPFHLYS MKQAIEEGFI LDVIANYTTY KSYYEIEKSI
QDNPEFDSKK AQKRLRAYVE ASQETIDTKA EIMLEHFIKH VVNGKKLKGK GKGMVVTQNI
ESAIRYYRAL TRQLNKMGNP FKVAIAFSGS KEVDGIEYTE ADINGFPEGD TKDYFDVNYK
RKEPDSPIPK HVDQDAYRLL VVANKYLTGF DQPKLCAMYV DKKLASVLCV QALSRLNRSA
PKYGKKTEDL FVLDFFNSVD DIKTAFDPFY TSTTLSEATD VNVLHELKDD MDDTDVYEWF
EVEEFNKRFF EGREAQDLSP IIDIAAARFN HELELENEFK VDFKVKAKQF VKIYGQIASI
MPYEVVQWEK LFWFLKFLIP KLSVEDPDKE ALDSLLDSVD LSSYGLQRVK LNHSIELDDS
ETELDPQNPN PRGAYGPEAE KDPLDEIIKI FNERWFQGWS ATPEEQRVKF VNIAESIRNH
PDFEAKYQNN ADPHTRELAF EKMLKEIMLQ RRKDELELYK LFAQDPAFKA SWTQSMQRMV
GM