Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rleg_2998 |
Symbol | |
ID | 8013915 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhizobium leguminosarum bv. trifolii WSM1325 |
Kingdom | Bacteria |
Replicon accession | NC_012850 |
Strand | + |
Start bp | 2993905 |
End bp | 2996280 |
Gene Length | 2376 bp |
Protein Length | 791 aa |
Translation table | 11 |
GC content | 64% |
IMG OID | 644825568 |
Product | Vault protein inter-alpha-trypsin domain protein |
Protein accession | YP_002976796 |
Protein GI | 241205700 |
COG category | [R] General function prediction only |
COG ID | [COG2425] Uncharacterized protein containing a von Willebrand factor type A (vWA) domain |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 0.337723 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 23 |
Fosmid unclonability p-value | 0.523415 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTTTCTGG AAGATGAGTT TATCATAGGG CGCATTCGTG CGCGCCGAAT TTCCGTTTCA GTCCTTGTTG CCGTCACCGC CTTTGCAGCC TGCATCGCCG CGATGCTGGC GCTTGCCTCC GCCGCCCGCG CCGCCGAGCC GCAGGCATCC GCACAGCTCG CAGCGCTTGT CCGGCCGAAC GACGTCAATA GCGGCTCGCT GCTCTTTCCG TCGAAGGAGC CTGGCTTCTA TGTCGAAGCG CCGCGGCTGA AGACCGATGT CGCCATCGAT GTCTCCGGCC CGATCGCCAG GGTGAAGGTG ACGCAGCGCT TCCAGAATCC GAGCCAGGGT TGGGTCGAAG GCACCTACGT CTTTCCGCTG CCGGACAATT CCGCCGTTGA CGCGCTGAAA ATGCAGATCG GCGAACGTTT CATCGAAGGC CAAATCAAGC CCCGCCAGGA AGCCCGCGAG ATCTACGAGC AAGCCAAGGC CGAAGGCAAA AAGACGGCGT TGCTCGAACA GCAGCGGCCG AACATCTTCA CCAACCAGGT CGCCAATATC GGTCCCGGCG AAACCATCGT CGTCCAGATC GAATACCAGC AGACCATCCA CCAGTCGGGT GGCGAGTTCT CGCTGCGCTT CCCGATGGTC GTCGCCCCGC GGTACAATCC GGCGCCGATC GTCCAGACTG TCGAGTTCAA CAACGGCGCC GGTTTTGCCA CGCCGCGCGA CCCGGTGGAA AACCGCGACA AGATCGCTGC CCCCGTGCTT GATCCGCGTG AGAACGCCAG GATCAATCCG GTTTCGCTGA CCGTCGACCT CCGGGCTGGT TTCCCGCTCG GCGATGTCAA ATCGTCCTTC CATGCGGTCG ATATCAACCA GGATGGCGAC CAGGCGAGGA CGATCAGCCT GAAGGCGGAC ACCGTTCCCG CCGACAAGGA TTTCGAGCTC ACCTGGAAGG CCGCCGCCGG CAAGATGCCG AGTGCCGGCC TCTTCCGCGA AGTGATTGAT GGTAAGACCT ATCTGCTTGC CTTCGTCACC CCGCCCGCGG CCCCGGACAC GGCAGCGCCG CCGGCAAAAC GCGAGGTGGT CTTTGTCATC GACAATTCCG GCTCCATGTC CGGCCCGTCG ATCGAGCAGG CCCGCCAGAG CCTGGCGCTT GCCATCTCCA AGCTGAACCC CGACGACCGC TTCAACGTCA TCCGTTTCGA CGATACGATG ACTGACTATT TCAAGGGTCT CGTCACTGCC ACCCCTGACA ATCGCGAAAA GGCGATCGGC TATGTCAGAG GCCTGACCGC CGACGGCGGC ACGGAAATGC TGCCTGCCTT GCAGGCTGCG CTGCGCAACC AGGGACCGGT CGCAAGCGGA GCGCTGCGCC AGGTCGTGTT CCTGACCGAT GGCGCGATCG GCAACGAACA GCAGCTTTTC CAGGAAATCA CCGCAAATCG CGGCGATGCC CGGGTCTTCA CCGTCGGCAT CGGTTCGGCG CCGAACACCT ATTTCATGAC CAAGGCCGCC GAGATGGGCC GCGGCACCTT CACGGCGATC GGCTCGACCG ATCAGGTGGC AAGCCGCATG GGCGAGCTTT TCGCCAAGCT GCAGAACCCA GCCATGACCG ATATCGCTGC CACCTTCGAA GGCATCAAAG CCGAAGATAT CACGCCGAAC CCGATGCCGG ACCTCTATAG CGGTGAGCCC GTCGTGCTGA CCGCGCAGTT GCCCGAGAAC AATGCCGGCA AGCTGCAGAT CATCGGCAAG ACAGGCGACC AGCCCTGGCG CGTCGAGATG GATATCGCCA ACGCCGCCGA CGGCAGCGGC ATTTCCAAGC TCTGGGCGCG CCGCAAGATC GACGATTTCG AGGCCCGCGC CTATGAGCGT CAGGATCCGG CCGCGCTCGA CAAGGATATC GAGACGGTGG CGCTCGCCCA TCACCTCGTC TCCCGCGTCA CCAGCCTGGT CGCCGTCGAT GTCACTCCGT CGCGCCCGGC CGATCAGCCG CTCGGCTCGG CCAAGCTGCC GCTCAACCTG CCGGATGGTT GGGACTTCGA TAAAGTCTCC GGCGAAAACG CTGCCCCTCT TGGCGGCGCG GAACGCCATG GCTCGGCTAC GCCGGCTGGA AACGCCGGAC CGGAGCAGGC CGAAACACAG GCACTTGTCG CATCGCCTGA GATCGCAAAC ATGATGGCCG CAGCCCCGAC TGCCAAGGCG GCCACCATGA TCGCGCAGAA GAGCTCGACC GTGAACCTGC CGCAGACGGC GACGCGCGCC GACGAGCAGA TCATCCGCGG GCTTACCATG CTGCTCCTGG CGCTGACGGC GGCAAGCGGG CTGGCCGTCT GGCGGCGGCG CCTCAAGCGC ATTATCACGG TCGGAGCCGA GCGCGATGGT CTCTAG
|
Protein sequence | MFLEDEFIIG RIRARRISVS VLVAVTAFAA CIAAMLALAS AARAAEPQAS AQLAALVRPN DVNSGSLLFP SKEPGFYVEA PRLKTDVAID VSGPIARVKV TQRFQNPSQG WVEGTYVFPL PDNSAVDALK MQIGERFIEG QIKPRQEARE IYEQAKAEGK KTALLEQQRP NIFTNQVANI GPGETIVVQI EYQQTIHQSG GEFSLRFPMV VAPRYNPAPI VQTVEFNNGA GFATPRDPVE NRDKIAAPVL DPRENARINP VSLTVDLRAG FPLGDVKSSF HAVDINQDGD QARTISLKAD TVPADKDFEL TWKAAAGKMP SAGLFREVID GKTYLLAFVT PPAAPDTAAP PAKREVVFVI DNSGSMSGPS IEQARQSLAL AISKLNPDDR FNVIRFDDTM TDYFKGLVTA TPDNREKAIG YVRGLTADGG TEMLPALQAA LRNQGPVASG ALRQVVFLTD GAIGNEQQLF QEITANRGDA RVFTVGIGSA PNTYFMTKAA EMGRGTFTAI GSTDQVASRM GELFAKLQNP AMTDIAATFE GIKAEDITPN PMPDLYSGEP VVLTAQLPEN NAGKLQIIGK TGDQPWRVEM DIANAADGSG ISKLWARRKI DDFEARAYER QDPAALDKDI ETVALAHHLV SRVTSLVAVD VTPSRPADQP LGSAKLPLNL PDGWDFDKVS GENAAPLGGA ERHGSATPAG NAGPEQAETQ ALVASPEIAN MMAAAPTAKA ATMIAQKSST VNLPQTATRA DEQIIRGLTM LLLALTAASG LAVWRRRLKR IITVGAERDG L
|
| |