Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rleg2_4506 |
Symbol | |
ID | 6977600 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhizobium leguminosarum bv. trifolii WSM2304 |
Kingdom | Bacteria |
Replicon accession | NC_011368 |
Strand | + |
Start bp | 143444 |
End bp | 145540 |
Gene Length | 2097 bp |
Protein Length | 698 aa |
Translation table | 11 |
GC content | 62% |
IMG OID | 643393684 |
Product | von Willebrand factor type A |
Protein accession | YP_002278502 |
Protein GI | 209546584 |
COG category | [R] General function prediction only |
COG ID | [COG2304] Uncharacterized protein containing a von Willebrand factor type A (vWA) domain |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 0.407878 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 24 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGATGGACA AGGAACTCGA AAAACTCTCC CGCCTTACCC CGCCGGCAGC CGATCCGGAG GCGCGAGCAC GCGCCCTTGC CGCGGCGATG CAGGCCTTCG ACAGCGCAGA AAATAATGCA GCGACAGCCC AAGGAAATGC GAAAGGCTGG CGTCCAAGCT CCATCATCAA CTGGATATGG AGCTCTGCTG TGAATAAGAA ATTTCTCGCC GGTTCAGCCC TTGCGACGCT GCTCGTCATT CCCGCCGCCG GCTATCTCAC CCTCGAGCTG GCTCGCAACC AACCGATCGT CGACCAGGAA AAGATCGCCG GCACTGTTTT CAAAAATGAC ATGCCGAAAT CGGCCCAACC CAGGCTTTCC GAAGCGCCAG CCGAAGAGAA TCGGCCGGCT CAATCGTCGG CGGCTGATAG CACTCAAGTC TTGCAGGACA AGCAGCCACA GGCCGCAAAG TCCGCTGCGG AGCTGCGCGC TGACTTCGAT GCCGGCGAGA TCGCCACTCT TAAGAACAAG TCGGAGGATT CCGCCGCGGC ACTTGGCATG GCAAAACGGG CTGCGCCGGC TGCTCCCGGC GTGGTTGCTC AGGGCCAGCT ATTGGCTGAG CCGATGGCCG TTGCCCCCTC ACCTGTTCCG CCGGCAGATG GGCATATGCA GATTCAGCTC GATCCCAGCC GCGAACGCTT TGCCAATGCT GCGGCAAATC CGATCAAGAG TGTGGCGACC GATCCGGTCT CGACCTTCTC GGCCGATGTC GACAGTGCCT CCTATTCCTT CGTCCGCCGG TCGTTGACGG GCGGGGCGAT GCCGGATCCG CAATCGGTTC GTGTCGAGGA GATGATCAAT TATTTCCCCT ATGACTGGGC GGGTCCTGAG AAGGCCGATC AGCCCTTCAA GGCGACCGTG ACGGTGATGC CGACACCGTG GAATCACGAC ACGGAACTGA TGCATGTGGC GATCAAAGGC TATGACATTG CGCCGGCGAC CGCGCCGCAT GCCAATCTCG TCTTCCTGAT CGACGTCTCG GGCTCGATGG ACGAGCCGGA CAAGCTGCCG CTGCTGAAAA GCGCCTTCCG TCTTCTGGTC AGCAAGCTGA AGGCCGACGA TACAGTCTCG ATCGTCACCT ATGCCGGCAA TGCCGGCACG GTGCTCGAGC CGACGCGGGT GGCGGAGAAA TCGAAGATCC TCTCGGCAAT CGACAGGCTG GAGGCCGGAG GCTCGACCGG CGGCGCCGAA GGCATCGAGG CGGCCTATAA TCTTGCCAAA CAGGCCTTCG TCAAGGACGG CGTCAACCGG GTGATGCTGG CGACGGACGG CGACTTCAAT GTCGGCCCGT CGAGCGACGA GGATCTGAAG CGCATCATCG AGGAGAAGCG CAAGGACGGC ATCTTCCTCA CCGTTCTCGG CTTCGGGCGC GGCAATCTCA ACGATTCCCT GATGCAGACG CTGGCGCAGA ACGGCAATGG CAGTGCCGCC TATATCGACA CGCTGGCGGA GGCGCAGAAG ACGCTGGTCG AAGAGGCCGG GTCGACGCTG TTTCCGATCG CCAAGGACGT CAAGTTCCAG GTCGAGTTCA ACCCGGAACG GATCGCCGAA TACCGGCTGA TCGGCTACGA GACGCGCGCC CTCAACCGCG AGGATTTCAA CAATGACCGT GTCGATGCCG GCGATATCGG CTCCGGCCAC AGCGTCACGG CGATCTACGA GATCACGCCC AAGGGAAGCC CCGCGGTCAT GAACGACGAC CTGCGTTACG GCGCAGCCGG CAAGGCGTCG GCCGAGACAT CGGACGGCAC GCATCAGGGC GAGCTTGCCT TCGTCAAGAT GCGTTACAAG CGGCCGGGCG AGGACAAGAG CGCGCTGATC ACCACGCCTG TCGGCGACGG CAACACGGTC GCCACCGTCG ACGCCGCGCC CGGGGACGTC CGCTTCTCGG TGGCCGTCGC CGCCTTCGGC CAGAAACTGA GCCGGATCAC GGCGCTCGAT GCCTATTCCT ATCAGGCGAT TGCCAATCTC GCCGCGGCAT CGCGAGGCAC TGATCCCTTC GGCTACAGAT CGGATTTCCT CGGTCTCGTC CGGTTGGCTG ATGGACTTAG CCAGTGA
|
Protein sequence | MMDKELEKLS RLTPPAADPE ARARALAAAM QAFDSAENNA ATAQGNAKGW RPSSIINWIW SSAVNKKFLA GSALATLLVI PAAGYLTLEL ARNQPIVDQE KIAGTVFKND MPKSAQPRLS EAPAEENRPA QSSAADSTQV LQDKQPQAAK SAAELRADFD AGEIATLKNK SEDSAAALGM AKRAAPAAPG VVAQGQLLAE PMAVAPSPVP PADGHMQIQL DPSRERFANA AANPIKSVAT DPVSTFSADV DSASYSFVRR SLTGGAMPDP QSVRVEEMIN YFPYDWAGPE KADQPFKATV TVMPTPWNHD TELMHVAIKG YDIAPATAPH ANLVFLIDVS GSMDEPDKLP LLKSAFRLLV SKLKADDTVS IVTYAGNAGT VLEPTRVAEK SKILSAIDRL EAGGSTGGAE GIEAAYNLAK QAFVKDGVNR VMLATDGDFN VGPSSDEDLK RIIEEKRKDG IFLTVLGFGR GNLNDSLMQT LAQNGNGSAA YIDTLAEAQK TLVEEAGSTL FPIAKDVKFQ VEFNPERIAE YRLIGYETRA LNREDFNNDR VDAGDIGSGH SVTAIYEITP KGSPAVMNDD LRYGAAGKAS AETSDGTHQG ELAFVKMRYK RPGEDKSALI TTPVGDGNTV ATVDAAPGDV RFSVAVAAFG QKLSRITALD AYSYQAIANL AAASRGTDPF GYRSDFLGLV RLADGLSQ
|
| |