Gene Bpro_5217 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBpro_5217 
Symbol 
ID4015983 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePolaromonas sp. JS666 
KingdomBacteria 
Replicon accessionNC_007949 
Strand
Start bp321849 
End bp324110 
Gene Length2262 bp 
Protein Length753 aa 
Translation table11 
GC content58% 
IMG OID637944844 
Productvon Willebrand factor, type A 
Protein accessionYP_551976 
Protein GI91791025 
COG category[R] General function prediction only 
COG ID[COG5271] AAA ATPase containing von Willebrand factor type A (vWA) domain 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGGGAAAGA TCGCGAGTGC CCAACGGGTA ATCGCAGGTT TGATGTCTCT GATTCTGGGC 
AAGAAAAGCA CAGTTGACTG GGGACAGTCG GCCGCATGCG GTGAGAGCGG CGCAATTTCA
TTGCCTCGCC CAAAGACCGG TGATGCCGAT GAGATCGCGC TTCTGACGCG ATTGGCCGTG
CATGAAGCTG GACATGATAA GCACACGGAC TTTGAATGCA TTGAAGGACT GGACGGCAAC
GTGCAGGCTC TGATGAATGC GCTTGAAGAC CCGCGCATTG AGCGTGAGCA GGTAAAGACC
TTTCCGGGTG CTGCCTTGAT TCTGAATCGT GGATTGGAAG ATGCCATCCG GGTGGTGGAC
TCCAAGCTCG ACGCAGGTAA TCCCGAGCAC AGCGCAGATC TGGTAACAGT GAATGTGCTG
CTCAAGGGGT ACCGAAAGCT GGTGCGTCAT CAGGGTGTGA AGGAAGCTGC TGACAGTCTG
GTTGCCAAGG GCGATCAGAT CCTGGGCGAG GCTCGTGTGG ATGCGGTTGG TCAAGCAATT
GATCGGCTGG CAGGCTGTAC CAGCACTTCC GATGCAGTAG CACTCGCGAA GGACCTCTGG
ACGGCATTAC AGCCACCCGA ACCGGAGCAA CAACAGCCGC CACCGTCCGA TCCTCAGGAG
CAAGACGGTG CCGGCGGAGC TGCTGATGAC AGCCAGGAAA AACAGGATGA GGCTGACGAT
GCGGACCAAG ACACGCAAGA GCAGCCAGGT GAGCCTGAAG GTAGCGGGGA TGACGAAAGT
CGACCCGGTA CTGATAGCCA GCCTGATAGC ACCCCGCCTG AGTCGAGCGA TGCCGGTGGG
GACGCTCCGC AAGGGGGAGA TCCGCAGGAA GGTGCCGATG GCGATCCTAA ATCTGATGGA
AATGGCGGTG ACGAGAGTCG CGCAGATTCC AGCGGTTCAC CTGGAGATGC CGCCCAGGAT
TCAGCAGATG GGGCTAGCGA GACCGGAGGG GGCAACCCTG AAGGCGGCGC CCCCAAGGAG
CCGAACGACG AATCGGGATC TGGCGGCGGT GGCCAGGCCA ATGGTTCTGG TGATCAGGAT
GGTGATGGGA AACCATCGCC AGACGCTCAT AACCAAGTAA GCCGGGAAGC TGGTGAAGGC
GGAAACGCGC AAGAACCTCA AAAAGGTGAT GGTGGACAGG GCAAAGCCGG TCAGAACGGG
AAACTTGATC TGACCTCAGC AATGGGAACC GATCTTGGGG CACTACTGGC CCAGGCCTAC
GAGCTCAAAT ACGGGAAACC GGACATTGAT GCACCTGGCC TGGCGGCGCC CGCGCAAACA
ACCACAACCA CGGACGATTT CACCCAGCTG GTGGCTACTG CTTTGGAGCG GGCTGCTGAC
GATGGTGAAT CGCTGGAGAG GGCGCTGGAG TTGATTGAGG TGGCCTTGGA AGCTGTTTCC
GCATCTGAAC AAGGTGAGGG CAGTGAAAAA GAGCCTCAAC TGCTGGGGCT GGCTGCCGGG
ATTGGAAATG CCACGGGTAC AGCCATTCCC TTGGACACAT CAGCACGCAT GAGTGGTGCT
GTGAGCCGGT TAGTACGGAT CTTCACCAAG GAGCTGCAGG ACAAGCGCCG GCGGACTGTG
AAGCTGGCTT CGGCTGGTGG GCAGGTTGCA TCCAACCGCG TATGGCGCCT GAAGGCAATG
GGTGACACCA ATGTTTTCAA AGTGACCTCA AGCGTGTGTG GAATCGACGC GGCAGCGACC
ATCTTGCTGG ACCGATCGGG TTCCATGAGC CGTTGCATCG TTGAGGCCGC TGGTGCCGCA
CTGTCTTGTT CTCAGGCGCT GGAGAGGATT TCGAAGGTGA AAACTTCGAT TGAAATGTTC
CCGGGCTATG CGAAATGTGT GGGTAACACG GTGGCCCTGC AGGCGTTTGG CCAGTCCGCG
CGGCAAGTGG CGCGCAGGGT CAACGAAGTG GATGCCGAAG GTGGAACGCC TTTGGCTGAA
GCCTTACAGG AAGTGATGCC CAGGCTGCTT GCGCAGCGTG TGAAGAAGCG GATCGTGTTT
CTGGTGACTG ATGGCATTCC CAACAACCGT CCTGGGGCGC TTGAAGAAAT CGGGAAGGCG
GAGAAATTGG GCGTTGAGTT CGTAGGCATC GGTATCGGTG TACACGGCAG AGCGATTGAG
GGGCTTACCC CTTTCTCGAT TTGCATCAAT GACGCATCTG AATTACCGGA TGCATTTGAG
AAGCTGTTTC GAGGCAACAT TGCCTTGAAG TTGGCAGCTT GA
 
Protein sequence
MGKIASAQRV IAGLMSLILG KKSTVDWGQS AACGESGAIS LPRPKTGDAD EIALLTRLAV 
HEAGHDKHTD FECIEGLDGN VQALMNALED PRIEREQVKT FPGAALILNR GLEDAIRVVD
SKLDAGNPEH SADLVTVNVL LKGYRKLVRH QGVKEAADSL VAKGDQILGE ARVDAVGQAI
DRLAGCTSTS DAVALAKDLW TALQPPEPEQ QQPPPSDPQE QDGAGGAADD SQEKQDEADD
ADQDTQEQPG EPEGSGDDES RPGTDSQPDS TPPESSDAGG DAPQGGDPQE GADGDPKSDG
NGGDESRADS SGSPGDAAQD SADGASETGG GNPEGGAPKE PNDESGSGGG GQANGSGDQD
GDGKPSPDAH NQVSREAGEG GNAQEPQKGD GGQGKAGQNG KLDLTSAMGT DLGALLAQAY
ELKYGKPDID APGLAAPAQT TTTTDDFTQL VATALERAAD DGESLERALE LIEVALEAVS
ASEQGEGSEK EPQLLGLAAG IGNATGTAIP LDTSARMSGA VSRLVRIFTK ELQDKRRRTV
KLASAGGQVA SNRVWRLKAM GDTNVFKVTS SVCGIDAAAT ILLDRSGSMS RCIVEAAGAA
LSCSQALERI SKVKTSIEMF PGYAKCVGNT VALQAFGQSA RQVARRVNEV DAEGGTPLAE
ALQEVMPRLL AQRVKKRIVF LVTDGIPNNR PGALEEIGKA EKLGVEFVGI GIGVHGRAIE
GLTPFSICIN DASELPDAFE KLFRGNIALK LAA