Gene Rleg_2998 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRleg_2998 
Symbol 
ID8013915 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhizobium leguminosarum bv. trifolii WSM1325 
KingdomBacteria 
Replicon accessionNC_012850 
Strand
Start bp2993905 
End bp2996280 
Gene Length2376 bp 
Protein Length791 aa 
Translation table11 
GC content64% 
IMG OID644825568 
ProductVault protein inter-alpha-trypsin domain protein 
Protein accessionYP_002976796 
Protein GI241205700 
COG category[R] General function prediction only 
COG ID[COG2425] Uncharacterized protein containing a von Willebrand factor type A (vWA) domain 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.337723 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones23 
Fosmid unclonability p-value0.523415 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTTTCTGG AAGATGAGTT TATCATAGGG CGCATTCGTG CGCGCCGAAT TTCCGTTTCA 
GTCCTTGTTG CCGTCACCGC CTTTGCAGCC TGCATCGCCG CGATGCTGGC GCTTGCCTCC
GCCGCCCGCG CCGCCGAGCC GCAGGCATCC GCACAGCTCG CAGCGCTTGT CCGGCCGAAC
GACGTCAATA GCGGCTCGCT GCTCTTTCCG TCGAAGGAGC CTGGCTTCTA TGTCGAAGCG
CCGCGGCTGA AGACCGATGT CGCCATCGAT GTCTCCGGCC CGATCGCCAG GGTGAAGGTG
ACGCAGCGCT TCCAGAATCC GAGCCAGGGT TGGGTCGAAG GCACCTACGT CTTTCCGCTG
CCGGACAATT CCGCCGTTGA CGCGCTGAAA ATGCAGATCG GCGAACGTTT CATCGAAGGC
CAAATCAAGC CCCGCCAGGA AGCCCGCGAG ATCTACGAGC AAGCCAAGGC CGAAGGCAAA
AAGACGGCGT TGCTCGAACA GCAGCGGCCG AACATCTTCA CCAACCAGGT CGCCAATATC
GGTCCCGGCG AAACCATCGT CGTCCAGATC GAATACCAGC AGACCATCCA CCAGTCGGGT
GGCGAGTTCT CGCTGCGCTT CCCGATGGTC GTCGCCCCGC GGTACAATCC GGCGCCGATC
GTCCAGACTG TCGAGTTCAA CAACGGCGCC GGTTTTGCCA CGCCGCGCGA CCCGGTGGAA
AACCGCGACA AGATCGCTGC CCCCGTGCTT GATCCGCGTG AGAACGCCAG GATCAATCCG
GTTTCGCTGA CCGTCGACCT CCGGGCTGGT TTCCCGCTCG GCGATGTCAA ATCGTCCTTC
CATGCGGTCG ATATCAACCA GGATGGCGAC CAGGCGAGGA CGATCAGCCT GAAGGCGGAC
ACCGTTCCCG CCGACAAGGA TTTCGAGCTC ACCTGGAAGG CCGCCGCCGG CAAGATGCCG
AGTGCCGGCC TCTTCCGCGA AGTGATTGAT GGTAAGACCT ATCTGCTTGC CTTCGTCACC
CCGCCCGCGG CCCCGGACAC GGCAGCGCCG CCGGCAAAAC GCGAGGTGGT CTTTGTCATC
GACAATTCCG GCTCCATGTC CGGCCCGTCG ATCGAGCAGG CCCGCCAGAG CCTGGCGCTT
GCCATCTCCA AGCTGAACCC CGACGACCGC TTCAACGTCA TCCGTTTCGA CGATACGATG
ACTGACTATT TCAAGGGTCT CGTCACTGCC ACCCCTGACA ATCGCGAAAA GGCGATCGGC
TATGTCAGAG GCCTGACCGC CGACGGCGGC ACGGAAATGC TGCCTGCCTT GCAGGCTGCG
CTGCGCAACC AGGGACCGGT CGCAAGCGGA GCGCTGCGCC AGGTCGTGTT CCTGACCGAT
GGCGCGATCG GCAACGAACA GCAGCTTTTC CAGGAAATCA CCGCAAATCG CGGCGATGCC
CGGGTCTTCA CCGTCGGCAT CGGTTCGGCG CCGAACACCT ATTTCATGAC CAAGGCCGCC
GAGATGGGCC GCGGCACCTT CACGGCGATC GGCTCGACCG ATCAGGTGGC AAGCCGCATG
GGCGAGCTTT TCGCCAAGCT GCAGAACCCA GCCATGACCG ATATCGCTGC CACCTTCGAA
GGCATCAAAG CCGAAGATAT CACGCCGAAC CCGATGCCGG ACCTCTATAG CGGTGAGCCC
GTCGTGCTGA CCGCGCAGTT GCCCGAGAAC AATGCCGGCA AGCTGCAGAT CATCGGCAAG
ACAGGCGACC AGCCCTGGCG CGTCGAGATG GATATCGCCA ACGCCGCCGA CGGCAGCGGC
ATTTCCAAGC TCTGGGCGCG CCGCAAGATC GACGATTTCG AGGCCCGCGC CTATGAGCGT
CAGGATCCGG CCGCGCTCGA CAAGGATATC GAGACGGTGG CGCTCGCCCA TCACCTCGTC
TCCCGCGTCA CCAGCCTGGT CGCCGTCGAT GTCACTCCGT CGCGCCCGGC CGATCAGCCG
CTCGGCTCGG CCAAGCTGCC GCTCAACCTG CCGGATGGTT GGGACTTCGA TAAAGTCTCC
GGCGAAAACG CTGCCCCTCT TGGCGGCGCG GAACGCCATG GCTCGGCTAC GCCGGCTGGA
AACGCCGGAC CGGAGCAGGC CGAAACACAG GCACTTGTCG CATCGCCTGA GATCGCAAAC
ATGATGGCCG CAGCCCCGAC TGCCAAGGCG GCCACCATGA TCGCGCAGAA GAGCTCGACC
GTGAACCTGC CGCAGACGGC GACGCGCGCC GACGAGCAGA TCATCCGCGG GCTTACCATG
CTGCTCCTGG CGCTGACGGC GGCAAGCGGG CTGGCCGTCT GGCGGCGGCG CCTCAAGCGC
ATTATCACGG TCGGAGCCGA GCGCGATGGT CTCTAG
 
Protein sequence
MFLEDEFIIG RIRARRISVS VLVAVTAFAA CIAAMLALAS AARAAEPQAS AQLAALVRPN 
DVNSGSLLFP SKEPGFYVEA PRLKTDVAID VSGPIARVKV TQRFQNPSQG WVEGTYVFPL
PDNSAVDALK MQIGERFIEG QIKPRQEARE IYEQAKAEGK KTALLEQQRP NIFTNQVANI
GPGETIVVQI EYQQTIHQSG GEFSLRFPMV VAPRYNPAPI VQTVEFNNGA GFATPRDPVE
NRDKIAAPVL DPRENARINP VSLTVDLRAG FPLGDVKSSF HAVDINQDGD QARTISLKAD
TVPADKDFEL TWKAAAGKMP SAGLFREVID GKTYLLAFVT PPAAPDTAAP PAKREVVFVI
DNSGSMSGPS IEQARQSLAL AISKLNPDDR FNVIRFDDTM TDYFKGLVTA TPDNREKAIG
YVRGLTADGG TEMLPALQAA LRNQGPVASG ALRQVVFLTD GAIGNEQQLF QEITANRGDA
RVFTVGIGSA PNTYFMTKAA EMGRGTFTAI GSTDQVASRM GELFAKLQNP AMTDIAATFE
GIKAEDITPN PMPDLYSGEP VVLTAQLPEN NAGKLQIIGK TGDQPWRVEM DIANAADGSG
ISKLWARRKI DDFEARAYER QDPAALDKDI ETVALAHHLV SRVTSLVAVD VTPSRPADQP
LGSAKLPLNL PDGWDFDKVS GENAAPLGGA ERHGSATPAG NAGPEQAETQ ALVASPEIAN
MMAAAPTAKA ATMIAQKSST VNLPQTATRA DEQIIRGLTM LLLALTAASG LAVWRRRLKR
IITVGAERDG L