Gene VC0395_1107 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagVC0395_1107 
Symbol 
ID5134931 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameVibrio cholerae O395 
KingdomBacteria 
Replicon accessionNC_009456 
Strand
Start bp1071734 
End bp1073674 
Gene Length1941 bp 
Protein Length646 aa 
Translation table11 
GC content53% 
IMG OID640531429 
Producthypothetical protein 
Protein accessionYP_001215943 
Protein GI147671681 
COG category[R] General function prediction only 
COG ID[COG2304] Uncharacterized protein containing a von Willebrand factor type A (vWA) domain 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones53 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTCATCGT TGATTTTTCT CTACCCACAC TGGTTAGGGT TGTTGGTGCC TCTGTTGTTG 
CTCGCCGCTT GGCGAGGGCT GCGCCAAAAC CAACGGGGAT TGATTGCCCC TCATCTCGCG
CAAGCGTTGG GGATTGAAAC TCGCACGCGG CGCTCTTTTG GGGGATTATT AGCACTCAGT
TGGATCGTCG CTACCCTTGC GATGGCGGGC CCAAGCTGGC AATCCGCTGA ACGTCCGAGT
GTGCAAAATA GCGCCGCACG TGTGTTAATT ATGGATATGT CACGCTCGAT GTATGCAACC
GACCTAACGC CAAACCGTTT AACACAGGCG CGTTATAAGG CACTTGACCT CCTAAAAGGT
TGGCAAGAAG GCAGTACGGG CTTAGTGGCT TACTCCGCTG ATGCTTATGT GGTGAGCCCA
CTGACGAGTG ACAGTGCAAC GCTCGCCAAT CTGCTGCCGA ATCTCTCCCC AGACATCATG
CCCTATCAAG GCTCAGATGC CGCTGCCGCA GTGAGTTTAG CGATCACCAT GCTCCAGCAA
TCGGGTCATC AGCAAGGGGA TCTGATTCTT ATCACCGATG ATATGAGTGT GACAGAACGA
GAAAAATTGA TCTCGTTGTT ACAAGGTAGC CCATGGCGTT TGGTGACGCT TGCGATCGGC
ACTCCTAGCG GCGCGCCCAT CCCTTTAGGT GATGGTAGCC TGCTAAAAGA TCGTCAAGGC
CAGACCGTGA TTGCCAAAAC CGCATTTGAC CAATTACAGC AGCTCTCACA ACGCGTTCAA
GGCGTGCTGA CCGCGTATCG TGCGGATGGC GCAGATGTGG CGCACATCCT CAGCCTGACT
CAGCAGCCGA TTGATATTGC CGAATCGACT TCTCGCCAAG CCATCACGGA GCGCGTCAAT
AACGGTTACT GGTTAGCATT GCCCCTCTTG ATCGCCGCTT TATGCCTATT TCGCCGTGGA
GTGATTTTCA GCTTACTGCT GCTTTTCGGA GTGAGTTTGC CCAACCAGCA GGCTTGGGCC
TCTGCATGGT TAAATCAAGA TCAGCAAGCG ATGCACATGT TCAACAATGA GCAATACGCG
CAAGCCGCTG AAGCCTTTCG CGACCCACGC TGGCAAGGTG CGGCTCGTTA CTATGCCAAA
GATTACCAAG GGGCCATTGA CGCCTATTCG CAAATCGCTA ACCCTGATAC TGCCACGCAG
TACAATCTCG CCAATGCTTA TGCGCAAGCA GGAGAGTTAC AGAAAGCGCA GGATTTGTAC
GAACAAGTTC TCAAGCAAGA GCCGAATCAT CAAGATGCAC GACACAATTT AGACGTGGTC
AAGGCTGCAC AGCAACAGCA ACAGCAACAG CAACAGCAAC AGCAACAGCA ACAGCAACAG
CAACAGCAAC AGCAACAGCA ACAGCAACAG CAACAGCAAC AGCAACAGCA ACAGCAACAG
CAACAGCAAC AGCAACAGCA ACAGCAGGAT TCGTCATCTG GCTCCTCCGG TCAAGAAGTG
CAAGAAGACT CATCAGCAAA TCCCTCAAAT ACAGCTAAGG AGCAAGAGGC GAGCTCTCAA
ACAAAAGGCG CATCGACGCC TGATCCACAG CAGGATCTAC AAGAGAGTAC TGAGCCCAAA
GCGAATGCAA AACCACAGGA GCAACCAAAC GCTGTGGATG ATGCGCAAGC TGGGGAACCC
AGCGCGCACC AGGAGCAATC AAAAGATCCG AAAAATGGGC AGCCAAGTGG AACCGAGCAA
AACGATGAGC AGAGTGACAA AGCCAATGCC GCTCAGCCCT CCGAGTCGGT CACCACCTCT
TCTGATCCCA ATCTTGACCC TATGTTACGT AAGTTGGAGC AAGTCGAGAG TGCTCGCGAT
CCGAGTGCAC TACTGCGCGC GCAATTTATT CTGCAAGCCC AACGTAAACC TCAACCCACG
GAACCCGATC AGCCATGGTA A
 
Protein sequence
MSSLIFLYPH WLGLLVPLLL LAAWRGLRQN QRGLIAPHLA QALGIETRTR RSFGGLLALS 
WIVATLAMAG PSWQSAERPS VQNSAARVLI MDMSRSMYAT DLTPNRLTQA RYKALDLLKG
WQEGSTGLVA YSADAYVVSP LTSDSATLAN LLPNLSPDIM PYQGSDAAAA VSLAITMLQQ
SGHQQGDLIL ITDDMSVTER EKLISLLQGS PWRLVTLAIG TPSGAPIPLG DGSLLKDRQG
QTVIAKTAFD QLQQLSQRVQ GVLTAYRADG ADVAHILSLT QQPIDIAEST SRQAITERVN
NGYWLALPLL IAALCLFRRG VIFSLLLLFG VSLPNQQAWA SAWLNQDQQA MHMFNNEQYA
QAAEAFRDPR WQGAARYYAK DYQGAIDAYS QIANPDTATQ YNLANAYAQA GELQKAQDLY
EQVLKQEPNH QDARHNLDVV KAAQQQQQQQ QQQQQQQQQQ QQQQQQQQQQ QQQQQQQQQQ
QQQQQQQQQD SSSGSSGQEV QEDSSANPSN TAKEQEASSQ TKGASTPDPQ QDLQESTEPK
ANAKPQEQPN AVDDAQAGEP SAHQEQSKDP KNGQPSGTEQ NDEQSDKANA AQPSESVTTS
SDPNLDPMLR KLEQVESARD PSALLRAQFI LQAQRKPQPT EPDQPW