Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Bpro_5217 |
Symbol | |
ID | 4015983 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Polaromonas sp. JS666 |
Kingdom | Bacteria |
Replicon accession | NC_007949 |
Strand | + |
Start bp | 321849 |
End bp | 324110 |
Gene Length | 2262 bp |
Protein Length | 753 aa |
Translation table | 11 |
GC content | 58% |
IMG OID | 637944844 |
Product | von Willebrand factor, type A |
Protein accession | YP_551976 |
Protein GI | 91791025 |
COG category | [R] General function prediction only |
COG ID | [COG5271] AAA ATPase containing von Willebrand factor type A (vWA) domain |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 24 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 15 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGGGAAAGA TCGCGAGTGC CCAACGGGTA ATCGCAGGTT TGATGTCTCT GATTCTGGGC AAGAAAAGCA CAGTTGACTG GGGACAGTCG GCCGCATGCG GTGAGAGCGG CGCAATTTCA TTGCCTCGCC CAAAGACCGG TGATGCCGAT GAGATCGCGC TTCTGACGCG ATTGGCCGTG CATGAAGCTG GACATGATAA GCACACGGAC TTTGAATGCA TTGAAGGACT GGACGGCAAC GTGCAGGCTC TGATGAATGC GCTTGAAGAC CCGCGCATTG AGCGTGAGCA GGTAAAGACC TTTCCGGGTG CTGCCTTGAT TCTGAATCGT GGATTGGAAG ATGCCATCCG GGTGGTGGAC TCCAAGCTCG ACGCAGGTAA TCCCGAGCAC AGCGCAGATC TGGTAACAGT GAATGTGCTG CTCAAGGGGT ACCGAAAGCT GGTGCGTCAT CAGGGTGTGA AGGAAGCTGC TGACAGTCTG GTTGCCAAGG GCGATCAGAT CCTGGGCGAG GCTCGTGTGG ATGCGGTTGG TCAAGCAATT GATCGGCTGG CAGGCTGTAC CAGCACTTCC GATGCAGTAG CACTCGCGAA GGACCTCTGG ACGGCATTAC AGCCACCCGA ACCGGAGCAA CAACAGCCGC CACCGTCCGA TCCTCAGGAG CAAGACGGTG CCGGCGGAGC TGCTGATGAC AGCCAGGAAA AACAGGATGA GGCTGACGAT GCGGACCAAG ACACGCAAGA GCAGCCAGGT GAGCCTGAAG GTAGCGGGGA TGACGAAAGT CGACCCGGTA CTGATAGCCA GCCTGATAGC ACCCCGCCTG AGTCGAGCGA TGCCGGTGGG GACGCTCCGC AAGGGGGAGA TCCGCAGGAA GGTGCCGATG GCGATCCTAA ATCTGATGGA AATGGCGGTG ACGAGAGTCG CGCAGATTCC AGCGGTTCAC CTGGAGATGC CGCCCAGGAT TCAGCAGATG GGGCTAGCGA GACCGGAGGG GGCAACCCTG AAGGCGGCGC CCCCAAGGAG CCGAACGACG AATCGGGATC TGGCGGCGGT GGCCAGGCCA ATGGTTCTGG TGATCAGGAT GGTGATGGGA AACCATCGCC AGACGCTCAT AACCAAGTAA GCCGGGAAGC TGGTGAAGGC GGAAACGCGC AAGAACCTCA AAAAGGTGAT GGTGGACAGG GCAAAGCCGG TCAGAACGGG AAACTTGATC TGACCTCAGC AATGGGAACC GATCTTGGGG CACTACTGGC CCAGGCCTAC GAGCTCAAAT ACGGGAAACC GGACATTGAT GCACCTGGCC TGGCGGCGCC CGCGCAAACA ACCACAACCA CGGACGATTT CACCCAGCTG GTGGCTACTG CTTTGGAGCG GGCTGCTGAC GATGGTGAAT CGCTGGAGAG GGCGCTGGAG TTGATTGAGG TGGCCTTGGA AGCTGTTTCC GCATCTGAAC AAGGTGAGGG CAGTGAAAAA GAGCCTCAAC TGCTGGGGCT GGCTGCCGGG ATTGGAAATG CCACGGGTAC AGCCATTCCC TTGGACACAT CAGCACGCAT GAGTGGTGCT GTGAGCCGGT TAGTACGGAT CTTCACCAAG GAGCTGCAGG ACAAGCGCCG GCGGACTGTG AAGCTGGCTT CGGCTGGTGG GCAGGTTGCA TCCAACCGCG TATGGCGCCT GAAGGCAATG GGTGACACCA ATGTTTTCAA AGTGACCTCA AGCGTGTGTG GAATCGACGC GGCAGCGACC ATCTTGCTGG ACCGATCGGG TTCCATGAGC CGTTGCATCG TTGAGGCCGC TGGTGCCGCA CTGTCTTGTT CTCAGGCGCT GGAGAGGATT TCGAAGGTGA AAACTTCGAT TGAAATGTTC CCGGGCTATG CGAAATGTGT GGGTAACACG GTGGCCCTGC AGGCGTTTGG CCAGTCCGCG CGGCAAGTGG CGCGCAGGGT CAACGAAGTG GATGCCGAAG GTGGAACGCC TTTGGCTGAA GCCTTACAGG AAGTGATGCC CAGGCTGCTT GCGCAGCGTG TGAAGAAGCG GATCGTGTTT CTGGTGACTG ATGGCATTCC CAACAACCGT CCTGGGGCGC TTGAAGAAAT CGGGAAGGCG GAGAAATTGG GCGTTGAGTT CGTAGGCATC GGTATCGGTG TACACGGCAG AGCGATTGAG GGGCTTACCC CTTTCTCGAT TTGCATCAAT GACGCATCTG AATTACCGGA TGCATTTGAG AAGCTGTTTC GAGGCAACAT TGCCTTGAAG TTGGCAGCTT GA
|
Protein sequence | MGKIASAQRV IAGLMSLILG KKSTVDWGQS AACGESGAIS LPRPKTGDAD EIALLTRLAV HEAGHDKHTD FECIEGLDGN VQALMNALED PRIEREQVKT FPGAALILNR GLEDAIRVVD SKLDAGNPEH SADLVTVNVL LKGYRKLVRH QGVKEAADSL VAKGDQILGE ARVDAVGQAI DRLAGCTSTS DAVALAKDLW TALQPPEPEQ QQPPPSDPQE QDGAGGAADD SQEKQDEADD ADQDTQEQPG EPEGSGDDES RPGTDSQPDS TPPESSDAGG DAPQGGDPQE GADGDPKSDG NGGDESRADS SGSPGDAAQD SADGASETGG GNPEGGAPKE PNDESGSGGG GQANGSGDQD GDGKPSPDAH NQVSREAGEG GNAQEPQKGD GGQGKAGQNG KLDLTSAMGT DLGALLAQAY ELKYGKPDID APGLAAPAQT TTTTDDFTQL VATALERAAD DGESLERALE LIEVALEAVS ASEQGEGSEK EPQLLGLAAG IGNATGTAIP LDTSARMSGA VSRLVRIFTK ELQDKRRRTV KLASAGGQVA SNRVWRLKAM GDTNVFKVTS SVCGIDAAAT ILLDRSGSMS RCIVEAAGAA LSCSQALERI SKVKTSIEMF PGYAKCVGNT VALQAFGQSA RQVARRVNEV DAEGGTPLAE ALQEVMPRLL AQRVKKRIVF LVTDGIPNNR PGALEEIGKA EKLGVEFVGI GIGVHGRAIE GLTPFSICIN DASELPDAFE KLFRGNIALK LAA
|
| |