Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | VC0395_1107 |
Symbol | |
ID | 5134931 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Vibrio cholerae O395 |
Kingdom | Bacteria |
Replicon accession | NC_009456 |
Strand | + |
Start bp | 1071734 |
End bp | 1073674 |
Gene Length | 1941 bp |
Protein Length | 646 aa |
Translation table | 11 |
GC content | 53% |
IMG OID | 640531429 |
Product | hypothetical protein |
Protein accession | YP_001215943 |
Protein GI | 147671681 |
COG category | [R] General function prediction only |
COG ID | [COG2304] Uncharacterized protein containing a von Willebrand factor type A (vWA) domain |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 53 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTCATCGT TGATTTTTCT CTACCCACAC TGGTTAGGGT TGTTGGTGCC TCTGTTGTTG CTCGCCGCTT GGCGAGGGCT GCGCCAAAAC CAACGGGGAT TGATTGCCCC TCATCTCGCG CAAGCGTTGG GGATTGAAAC TCGCACGCGG CGCTCTTTTG GGGGATTATT AGCACTCAGT TGGATCGTCG CTACCCTTGC GATGGCGGGC CCAAGCTGGC AATCCGCTGA ACGTCCGAGT GTGCAAAATA GCGCCGCACG TGTGTTAATT ATGGATATGT CACGCTCGAT GTATGCAACC GACCTAACGC CAAACCGTTT AACACAGGCG CGTTATAAGG CACTTGACCT CCTAAAAGGT TGGCAAGAAG GCAGTACGGG CTTAGTGGCT TACTCCGCTG ATGCTTATGT GGTGAGCCCA CTGACGAGTG ACAGTGCAAC GCTCGCCAAT CTGCTGCCGA ATCTCTCCCC AGACATCATG CCCTATCAAG GCTCAGATGC CGCTGCCGCA GTGAGTTTAG CGATCACCAT GCTCCAGCAA TCGGGTCATC AGCAAGGGGA TCTGATTCTT ATCACCGATG ATATGAGTGT GACAGAACGA GAAAAATTGA TCTCGTTGTT ACAAGGTAGC CCATGGCGTT TGGTGACGCT TGCGATCGGC ACTCCTAGCG GCGCGCCCAT CCCTTTAGGT GATGGTAGCC TGCTAAAAGA TCGTCAAGGC CAGACCGTGA TTGCCAAAAC CGCATTTGAC CAATTACAGC AGCTCTCACA ACGCGTTCAA GGCGTGCTGA CCGCGTATCG TGCGGATGGC GCAGATGTGG CGCACATCCT CAGCCTGACT CAGCAGCCGA TTGATATTGC CGAATCGACT TCTCGCCAAG CCATCACGGA GCGCGTCAAT AACGGTTACT GGTTAGCATT GCCCCTCTTG ATCGCCGCTT TATGCCTATT TCGCCGTGGA GTGATTTTCA GCTTACTGCT GCTTTTCGGA GTGAGTTTGC CCAACCAGCA GGCTTGGGCC TCTGCATGGT TAAATCAAGA TCAGCAAGCG ATGCACATGT TCAACAATGA GCAATACGCG CAAGCCGCTG AAGCCTTTCG CGACCCACGC TGGCAAGGTG CGGCTCGTTA CTATGCCAAA GATTACCAAG GGGCCATTGA CGCCTATTCG CAAATCGCTA ACCCTGATAC TGCCACGCAG TACAATCTCG CCAATGCTTA TGCGCAAGCA GGAGAGTTAC AGAAAGCGCA GGATTTGTAC GAACAAGTTC TCAAGCAAGA GCCGAATCAT CAAGATGCAC GACACAATTT AGACGTGGTC AAGGCTGCAC AGCAACAGCA ACAGCAACAG CAACAGCAAC AGCAACAGCA ACAGCAACAG CAACAGCAAC AGCAACAGCA ACAGCAACAG CAACAGCAAC AGCAACAGCA ACAGCAACAG CAACAGCAAC AGCAACAGCA ACAGCAGGAT TCGTCATCTG GCTCCTCCGG TCAAGAAGTG CAAGAAGACT CATCAGCAAA TCCCTCAAAT ACAGCTAAGG AGCAAGAGGC GAGCTCTCAA ACAAAAGGCG CATCGACGCC TGATCCACAG CAGGATCTAC AAGAGAGTAC TGAGCCCAAA GCGAATGCAA AACCACAGGA GCAACCAAAC GCTGTGGATG ATGCGCAAGC TGGGGAACCC AGCGCGCACC AGGAGCAATC AAAAGATCCG AAAAATGGGC AGCCAAGTGG AACCGAGCAA AACGATGAGC AGAGTGACAA AGCCAATGCC GCTCAGCCCT CCGAGTCGGT CACCACCTCT TCTGATCCCA ATCTTGACCC TATGTTACGT AAGTTGGAGC AAGTCGAGAG TGCTCGCGAT CCGAGTGCAC TACTGCGCGC GCAATTTATT CTGCAAGCCC AACGTAAACC TCAACCCACG GAACCCGATC AGCCATGGTA A
|
Protein sequence | MSSLIFLYPH WLGLLVPLLL LAAWRGLRQN QRGLIAPHLA QALGIETRTR RSFGGLLALS WIVATLAMAG PSWQSAERPS VQNSAARVLI MDMSRSMYAT DLTPNRLTQA RYKALDLLKG WQEGSTGLVA YSADAYVVSP LTSDSATLAN LLPNLSPDIM PYQGSDAAAA VSLAITMLQQ SGHQQGDLIL ITDDMSVTER EKLISLLQGS PWRLVTLAIG TPSGAPIPLG DGSLLKDRQG QTVIAKTAFD QLQQLSQRVQ GVLTAYRADG ADVAHILSLT QQPIDIAEST SRQAITERVN NGYWLALPLL IAALCLFRRG VIFSLLLLFG VSLPNQQAWA SAWLNQDQQA MHMFNNEQYA QAAEAFRDPR WQGAARYYAK DYQGAIDAYS QIANPDTATQ YNLANAYAQA GELQKAQDLY EQVLKQEPNH QDARHNLDVV KAAQQQQQQQ QQQQQQQQQQ QQQQQQQQQQ QQQQQQQQQQ QQQQQQQQQD SSSGSSGQEV QEDSSANPSN TAKEQEASSQ TKGASTPDPQ QDLQESTEPK ANAKPQEQPN AVDDAQAGEP SAHQEQSKDP KNGQPSGTEQ NDEQSDKANA AQPSESVTTS SDPNLDPMLR KLEQVESARD PSALLRAQFI LQAQRKPQPT EPDQPW
|
| |