Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rleg2_1580 |
Symbol | |
ID | 6980316 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhizobium leguminosarum bv. trifolii WSM2304 |
Kingdom | Bacteria |
Replicon accession | NC_011369 |
Strand | + |
Start bp | 1604750 |
End bp | 1606318 |
Gene Length | 1569 bp |
Protein Length | 522 aa |
Translation table | 11 |
GC content | 62% |
IMG OID | 643396305 |
Product | von Willebrand factor type A |
Protein accession | YP_002281096 |
Protein GI | 209549179 |
COG category | [R] General function prediction only |
COG ID | [COG2425] Uncharacterized protein containing a von Willebrand factor type A (vWA) domain |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 25 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 21 |
Fosmid unclonability p-value | 0.743111 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCGAGCAC TGCTTTCAGG CCTGGCGCTC CTTGCGGCTC TCGCCCTTGC CGGCTGCGAT CCCTTCGGCA AGGGGCCGGA CTTCTCGATC GTCTCCGGAT CGGAAAATAC CGTTCTGCAG CCGATCGTCG AGGAATTCTG CAAGCAGAAG AATGCGACCT GCACCTTCAA ATATGAAGGC ACGCTCGATA TCGGCCTGGC GCTGCAGAGC GATCAGGGCG TCGAGCAGGA TGCGGTCTGG CCGGCCTCCA GCGTCTGGGT CGACATGTTC GACACCAAGC GCCGCGTCAA GAGCCTGACT TCGATCGCCC AGACGCCGGT GGTCCTGGGT GTGCGCAAGT CGAAGGCCGA GCAGCTCGGC TGGATCGGCA GGGATGTGTT CATGAAGGAT ATTCTGGCGG CCGTCGAAAA CGGATCGCTG AAATTCCTGA TGACCTCGGC GACGCAATCA AACTCCGGCG CCAGCGCCTA TCTCGCCATG CTGTCGAGTG CGCTCGGCAA CAAGCCGGTG ATCGAGCCCG GCGATCTCGA CGACAAGGGT GTCCAGGAGA GCGTCCGGTC GCTGCTTTCG GGCGTCATGC GCTCTTCCGG CTCTTCCGGC TGGCTTGCCG ATCTCTACGT CGAATCCGCC GGCAAGGGCA CTGTCTACGA CGGCATGTGG AACTACGAGG CGGTGCTGAA GGAAACCAAC GACAAGCTTG CCGCCTTGTC GCAGGAACCG CTTTACGCGA TCTACCCGGC CGATGGCGTG GCCATGGCGG ATTCGCCGCT CGGTTTCGTC GATCATGGAC GCGGGCCTGA AGCCCAGACT TTCTTCAACG ATCTGCTCGC CTATCTCCGT TCGGCCTCGG CGCAGCAGCG TATCGCTGAT ACCGGCCGGC GCATTCCGCT CAGCGGCGTT GCCGCAAAAC CGGAGCCTGG CTGGAATTTC GATCCCGCCC GGCTTGTGAC GGCCATCCGA ATGCCGGAGC CGGAGGTCAT CCGCCAGGCG CTCACTCTTT ATCAGGCCGC GCTGCGCAAA CCGTCCTTGA CGGCGCTCTG CCTCGATTTC TCAGGCTCGA TGCAGGGCGA CGGCGAGGAC CAGCTGCAGA AGGCGATGCG TTTCCTCCTG ACACCTGACG AGGCGAGCAA GGTGCTGGTG CAATGGTCGC CCGCCGACCG GATCATCGTC ATTCCTTTCG ACGGCAGCGT GCGCAACACC TTCATGGCAA GCGGAAACCC GTTGGAGCAG GAAGGGCTGC TGAACGAGAT TTCCCGGCAG AAGGCCGGCG GCGGCACGGA CATGTATACC TGCGCCGCAC AGGCTCTCCA GCAGATTGCC CGAAGCGACA GGCTCTCGAC GTATCTGCCT GCCATCGTCA TCATGACCGA CGGAAGGTCC GACGATCAAA GCCAGGCTTT CATGAGCGAA TGGAACGCGA CAGAGCCGCA TGTGCCGGTG TTCGGCATCA CATTCGGCGA CGCCGACAAG ACACAGCTCG ATAGCCTTGC CAAGCAGACT TCGGCGCGCG TGTTCGACGG CGGTTCGAAT CTCGCCACCG CTTTCCGCAC CGCACGCGGC TATAATTAG
|
Protein sequence | MRALLSGLAL LAALALAGCD PFGKGPDFSI VSGSENTVLQ PIVEEFCKQK NATCTFKYEG TLDIGLALQS DQGVEQDAVW PASSVWVDMF DTKRRVKSLT SIAQTPVVLG VRKSKAEQLG WIGRDVFMKD ILAAVENGSL KFLMTSATQS NSGASAYLAM LSSALGNKPV IEPGDLDDKG VQESVRSLLS GVMRSSGSSG WLADLYVESA GKGTVYDGMW NYEAVLKETN DKLAALSQEP LYAIYPADGV AMADSPLGFV DHGRGPEAQT FFNDLLAYLR SASAQQRIAD TGRRIPLSGV AAKPEPGWNF DPARLVTAIR MPEPEVIRQA LTLYQAALRK PSLTALCLDF SGSMQGDGED QLQKAMRFLL TPDEASKVLV QWSPADRIIV IPFDGSVRNT FMASGNPLEQ EGLLNEISRQ KAGGGTDMYT CAAQALQQIA RSDRLSTYLP AIVIMTDGRS DDQSQAFMSE WNATEPHVPV FGITFGDADK TQLDSLAKQT SARVFDGGSN LATAFRTARG YN
|
| |