Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | VC0395_1057 |
Symbol | |
ID | 5134189 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Vibrio cholerae O395 |
Kingdom | Bacteria |
Replicon accession | NC_009456 |
Strand | + |
Start bp | 1030342 |
End bp | 1032159 |
Gene Length | 1818 bp |
Protein Length | 605 aa |
Translation table | 11 |
GC content | 50% |
IMG OID | 640531379 |
Product | terminase |
Protein accession | YP_001215893 |
Protein GI | 147671611 |
COG category | [S] Function unknown |
COG ID | [COG5484] Uncharacterized conserved protein |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 34 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGCATACT CTCCCGAAAT CCGACAAGCC GCCCGAGCCC TCTATTTGAA GGCATGGACG CCACGCGAAA TCGCCGACGA ATTGAATCTG AACAGTGACC GAATTATTTA CTACTGGGCG GATAAGTTTG GTTGGCGCGA TATGTTGCGT GAACAAACGA TTGATGAAGC TATCGCGAAT CGTATTCAAA CGCTGCTTGA GGTAGAGAAC CCAAGTAAAC CGCAGTTGGA TATGCTCGAT CGGCTGATTA ATCATCACGT CAAACTTAAG AAGCTGCGCG CTACTGAGCA ACCGACTCAG CCCAATGAAG CTGGTACAGT TTCGGCGCAA AGTGGTGCAC ATAGTAGCAA AAGTGGTTCA CCTAAGGCCG AATCTGGCAC ACAAACGGGC GATTCTGGTA AACAATCTGC GCCCAGTGGT AAACGTAAGA AAGTTAAGAA TGATGTTAGT GAGATCACCG AGGCCGATTT TAAGCTGTGG CATGACTCGC TGTTTGCCTA TCAGCACACG ATGCGTAACA ACCTGCACCA GCGGACACGT AACATTCTCA AGTCTCGCCA AATTGGCGCA ACCTATTACT TTGCAGGTGA AGCGTTAGAA CAGGCGATTC TCACGGGCGA TAACCAGATA TTTCTCTCAG CCTCTCGCGC TCAAGCTGAT GTTTTCCGTC GCTATATTGT GGCGATTGCA AAAGAGTTTT TAGGCATTGA GATCACGGGT AACCCTTCTA CTTTGTCGAA TGGTGCAGAG TTGCACTATC TCTCTACCAA CGGCAAAACT GCACAGAGTT ACCACGGCCA CGTTTATATT GATGAGTATT TCTGGATCGG CAAGTTTGAC GAGCTGAACA AAGTCGCCTC GGCGATGGCT ACGCATAAAA AGTGGCGTAA GACTTACTTT TCCACCCCTT CTTCTAAGAT GCACCCTGCT TACCCGTTCT GGACGGGTGA AAAATGGCGC GGCGATAAAA CCACTCGGAA AAATATTGAG TTCCCGACCT TTGATGAACT GCGCGATGGC GGTCGCTTGT GCCCTGATCG CCAGTGGCGT TATGTGGTTA CGATTGAGGA TGCCGCTAAG GGTGGCTGTG ACCTCTTTGA TATTGAGGAA CTGCGCGAAG AGTACAGCGA GACGGACTTC AACAACTTGT TTATGTGCGT GTTTGTTGAT GGTGCCAGCT CGATATTTGA ATTTAATAAG ATTGAACGCT GCATGGTGGA TAGCGAGATT TGGCAGGACT TCAAGCCAAA CGCTGCTCGC CCATTTGGTA GCCGTGAGGT GTGGTTAGGC TATGACCCAT CACGAACCCG TGATAATGCG GTGCTGATGG TGGTCGCGCC ACCGATTGTG GCGGTTGAGA AATTCCGTGT GCTTGAGAAA CACACTTGGC GCGGGCTTTC TTTCCAACAT CAGGCTTCTG AGATCAGCAA AGTGTTTGAG CGCTTCAATG TGACTTACCT TGGCATTGAT ATCACCGGCA TTGGCGCGGG TGTTCATGAC TTGCTGGTTA ATAAGCACCC TCGCGAAACG GTGGCAATTC ACTATTCGAA CGAAAATAAA AACCGCTTGG TGATGAAGAT GATCGACATC ATTGACGGCA ACCGCCTGCA GTTTGATGCG GGCATGAAAG AAACGGCAAT GGCGTTTATG GCGATTAAGC GTGTCGCCAC GAACAGCGGC AACATGATGA CCTTTAAGGC CGAACGTAGC GAGCAAGCTG GCCACGCTGA CGACTTTTGG GCTCTTTCTC ACGCGCTGAT TAATGAACCC CTCGATCACT CCACTCAACG CAAATCAACA TGGCAGATGG CAGCATGA
|
Protein sequence | MAYSPEIRQA ARALYLKAWT PREIADELNL NSDRIIYYWA DKFGWRDMLR EQTIDEAIAN RIQTLLEVEN PSKPQLDMLD RLINHHVKLK KLRATEQPTQ PNEAGTVSAQ SGAHSSKSGS PKAESGTQTG DSGKQSAPSG KRKKVKNDVS EITEADFKLW HDSLFAYQHT MRNNLHQRTR NILKSRQIGA TYYFAGEALE QAILTGDNQI FLSASRAQAD VFRRYIVAIA KEFLGIEITG NPSTLSNGAE LHYLSTNGKT AQSYHGHVYI DEYFWIGKFD ELNKVASAMA THKKWRKTYF STPSSKMHPA YPFWTGEKWR GDKTTRKNIE FPTFDELRDG GRLCPDRQWR YVVTIEDAAK GGCDLFDIEE LREEYSETDF NNLFMCVFVD GASSIFEFNK IERCMVDSEI WQDFKPNAAR PFGSREVWLG YDPSRTRDNA VLMVVAPPIV AVEKFRVLEK HTWRGLSFQH QASEISKVFE RFNVTYLGID ITGIGAGVHD LLVNKHPRET VAIHYSNENK NRLVMKMIDI IDGNRLQFDA GMKETAMAFM AIKRVATNSG NMMTFKAERS EQAGHADDFW ALSHALINEP LDHSTQRKST WQMAA
|
| |