Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rsph17029_2675 |
Symbol | |
ID | 4897513 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodobacter sphaeroides ATCC 17029 |
Kingdom | Bacteria |
Replicon accession | NC_009049 |
Strand | + |
Start bp | 2821132 |
End bp | 2823087 |
Gene Length | 1956 bp |
Protein Length | 651 aa |
Translation table | 11 |
GC content | 72% |
IMG OID | 640113276 |
Product | von Willebrand factor, type A |
Protein accession | YP_001044549 |
Protein GI | 126463435 |
COG category | [R] General function prediction only |
COG ID | [COG2304] Uncharacterized protein containing a von Willebrand factor type A (vWA) domain |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 14 |
Fosmid unclonability p-value | 0.260015 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGCGACG ACCGCACCGA CGACGATCTC GAGGCTCTGC GCCTGGCACT CGCGCGCGAG GCGGCTCCGG CGCCCGATCC CGAGGCTCGG GCACGGGCCA TGGCGCTGGC GCTCGAGAAT TACGACAGGT TCCAAGGACG GGCGAGCGGG CTGCGTCAGG GGAAGGACCG CCCGGCCGGG GCGGGGTTCC TCAACGGAGT GCGCCGCATG TCGGGTTCCT TCTTCTCCCG CCCCCTTCTC GCCGCCACCG GATCGGTCGC GGCTCTGGGC CTCGCCCTCG TCGTGGTGAT GCCGAACGCG CGGCTGGCCG AGCCGCCGCA GACCGCGCCG GACGCGCCCG AGGCGGATGC CCGCCTCACC GCCGCGCCGG AGGCGGGTGG CGGGGCCGAG ACGGCCGGTG CGCCGGTTCC CGCGGAACCG CGGGCGCGGA GTGCCGAGGG CGCCGCCCCG CAGACCTTCG CCGCCGACGA AGCGATGCCC ATGGCTGCGC CGCCCGCGCC GGATCTCGCC CTCTCGAAGC AGGCCGCCGA AGCCCCCGCG CGCGCCCTTC CGCAGGGCGA CAGCGAGGCC TTCGCGAATG CGCCCGACAA TCCGCTCCGC GTGACCGCCG AGGATCCCGT CTCGACCTTC TCGATCGACG TGGATACGGC GAGCTACGCG ATCCTCCGCT CGAGCCTGCG GGCCGGGCAG CTTCCCCCGC GCGAGGCGGT GCGGATCGAG GAGATGATCA ACTACTTCCC CTACGACTAC CCGGCGCCGG AGAACGGAAC GCCGCCCTTC CGCCCCACCC TCTCCATCAC CCGGACGCCC TGGAACCCGG AGACGCGGCT CGTCCATGTG GCGCTGCAGG GCCGGATGCC CGCCATCGAG GACCGGCCGC CGCTGAACCT CGTCTTCCTG ATCGACACAT CGGGCTCGAT GCAGGATCCG GCGAAGCTGC CGCTCCTCAA GCAGTCGTTC GGGCTGATGC TCGGCCGCCT GCGCCCCGAG GATCAGGTGG CCATCGTGAC CTATGCCGGC TCGGCGGGCG AGGTGCTGGC GCCCACGGCT GCGAACCAGC GCAGCACCAT CCTCTCCGCC CTCGACCGGC TCGATGCCGG CGGATCGACC GCGGGGGACG AGGGGCTGGC GCTCGCCTAC CGGACGGCTT CGGAAATGGC GGGCGCGGGC GAGGTCACGC GCGTGGTGCT GGCCACCGAC GGAGACTTCA ACCTCGGGAT CAGCGACCCG GAAGAGCTGG CCCGCCTCGT GGCGCACGAG CGCGACACCG GCGTCTATCT CTCGGTGCTG GGCTTCGGGC GCGGCAATCT CGACGATGCG ACGATGCAGG CGCTGGCGCA GAACGGCAAC GGGCAGGCCG CCTATATCGA CAGTCTGAAC GAGGCGCAGA AGGTTCTGGT CGACCAGCTC AGCGGCGCGC TCTTCCCCAT TGCCGACGAT GTGAAGGTGC AGGTGGAGTG GAGCCCGGCC CGCGTCGCGG AATACCGGCT CATCGGCTAC GAGACCCGCG GCCTGCGCCG CGAGGATTTC GCCAACGACC GGGTCGATGC GGGCGAGATC GGCGCCGGCC ATTCGGTGAC GGCGATCTAC GAGATCACGC CGGTGGACAG CCCCGCGCGC CTGACCGATC CCCTGCGCTA CGGCGCCGAA CCGCCCGAGG GCGCGCATGG CGATGAACTG GGCTTCCTGC GGCTGCGCTA CAAGGCGCCG GGGGAAAGCA CATCGACCCT GATCGACACG CCGATCCCGG ACATGCTGAC CGAGGCTTCC GAGGACGTGC GCTTCTCCAC CGCCATCGCG GGCTTCGGCG AGCTTCTGCG CGGGTCGGAC AAGCTCGGCG CCTGGGGCTG GGACGAGGCC ATCGCCTTGG CCGACGGGGC GCGCGGGGCC GATCCCTTCG GCTACCGGGT GGAGGCCGTC CAGCTGATGC GCCTGGCCGA GAGCCTCAGC CGCTGA
|
Protein sequence | MSDDRTDDDL EALRLALARE AAPAPDPEAR ARAMALALEN YDRFQGRASG LRQGKDRPAG AGFLNGVRRM SGSFFSRPLL AATGSVAALG LALVVVMPNA RLAEPPQTAP DAPEADARLT AAPEAGGGAE TAGAPVPAEP RARSAEGAAP QTFAADEAMP MAAPPAPDLA LSKQAAEAPA RALPQGDSEA FANAPDNPLR VTAEDPVSTF SIDVDTASYA ILRSSLRAGQ LPPREAVRIE EMINYFPYDY PAPENGTPPF RPTLSITRTP WNPETRLVHV ALQGRMPAIE DRPPLNLVFL IDTSGSMQDP AKLPLLKQSF GLMLGRLRPE DQVAIVTYAG SAGEVLAPTA ANQRSTILSA LDRLDAGGST AGDEGLALAY RTASEMAGAG EVTRVVLATD GDFNLGISDP EELARLVAHE RDTGVYLSVL GFGRGNLDDA TMQALAQNGN GQAAYIDSLN EAQKVLVDQL SGALFPIADD VKVQVEWSPA RVAEYRLIGY ETRGLRREDF ANDRVDAGEI GAGHSVTAIY EITPVDSPAR LTDPLRYGAE PPEGAHGDEL GFLRLRYKAP GESTSTLIDT PIPDMLTEAS EDVRFSTAIA GFGELLRGSD KLGAWGWDEA IALADGARGA DPFGYRVEAV QLMRLAESLS R
|
| |