Gene Rleg2_1580 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRleg2_1580 
Symbol 
ID6980316 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhizobium leguminosarum bv. trifolii WSM2304 
KingdomBacteria 
Replicon accessionNC_011369 
Strand
Start bp1604750 
End bp1606318 
Gene Length1569 bp 
Protein Length522 aa 
Translation table11 
GC content62% 
IMG OID643396305 
Productvon Willebrand factor type A 
Protein accessionYP_002281096 
Protein GI209549179 
COG category[R] General function prediction only 
COG ID[COG2425] Uncharacterized protein containing a von Willebrand factor type A (vWA) domain 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones25 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones21 
Fosmid unclonability p-value0.743111 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGAGCAC TGCTTTCAGG CCTGGCGCTC CTTGCGGCTC TCGCCCTTGC CGGCTGCGAT 
CCCTTCGGCA AGGGGCCGGA CTTCTCGATC GTCTCCGGAT CGGAAAATAC CGTTCTGCAG
CCGATCGTCG AGGAATTCTG CAAGCAGAAG AATGCGACCT GCACCTTCAA ATATGAAGGC
ACGCTCGATA TCGGCCTGGC GCTGCAGAGC GATCAGGGCG TCGAGCAGGA TGCGGTCTGG
CCGGCCTCCA GCGTCTGGGT CGACATGTTC GACACCAAGC GCCGCGTCAA GAGCCTGACT
TCGATCGCCC AGACGCCGGT GGTCCTGGGT GTGCGCAAGT CGAAGGCCGA GCAGCTCGGC
TGGATCGGCA GGGATGTGTT CATGAAGGAT ATTCTGGCGG CCGTCGAAAA CGGATCGCTG
AAATTCCTGA TGACCTCGGC GACGCAATCA AACTCCGGCG CCAGCGCCTA TCTCGCCATG
CTGTCGAGTG CGCTCGGCAA CAAGCCGGTG ATCGAGCCCG GCGATCTCGA CGACAAGGGT
GTCCAGGAGA GCGTCCGGTC GCTGCTTTCG GGCGTCATGC GCTCTTCCGG CTCTTCCGGC
TGGCTTGCCG ATCTCTACGT CGAATCCGCC GGCAAGGGCA CTGTCTACGA CGGCATGTGG
AACTACGAGG CGGTGCTGAA GGAAACCAAC GACAAGCTTG CCGCCTTGTC GCAGGAACCG
CTTTACGCGA TCTACCCGGC CGATGGCGTG GCCATGGCGG ATTCGCCGCT CGGTTTCGTC
GATCATGGAC GCGGGCCTGA AGCCCAGACT TTCTTCAACG ATCTGCTCGC CTATCTCCGT
TCGGCCTCGG CGCAGCAGCG TATCGCTGAT ACCGGCCGGC GCATTCCGCT CAGCGGCGTT
GCCGCAAAAC CGGAGCCTGG CTGGAATTTC GATCCCGCCC GGCTTGTGAC GGCCATCCGA
ATGCCGGAGC CGGAGGTCAT CCGCCAGGCG CTCACTCTTT ATCAGGCCGC GCTGCGCAAA
CCGTCCTTGA CGGCGCTCTG CCTCGATTTC TCAGGCTCGA TGCAGGGCGA CGGCGAGGAC
CAGCTGCAGA AGGCGATGCG TTTCCTCCTG ACACCTGACG AGGCGAGCAA GGTGCTGGTG
CAATGGTCGC CCGCCGACCG GATCATCGTC ATTCCTTTCG ACGGCAGCGT GCGCAACACC
TTCATGGCAA GCGGAAACCC GTTGGAGCAG GAAGGGCTGC TGAACGAGAT TTCCCGGCAG
AAGGCCGGCG GCGGCACGGA CATGTATACC TGCGCCGCAC AGGCTCTCCA GCAGATTGCC
CGAAGCGACA GGCTCTCGAC GTATCTGCCT GCCATCGTCA TCATGACCGA CGGAAGGTCC
GACGATCAAA GCCAGGCTTT CATGAGCGAA TGGAACGCGA CAGAGCCGCA TGTGCCGGTG
TTCGGCATCA CATTCGGCGA CGCCGACAAG ACACAGCTCG ATAGCCTTGC CAAGCAGACT
TCGGCGCGCG TGTTCGACGG CGGTTCGAAT CTCGCCACCG CTTTCCGCAC CGCACGCGGC
TATAATTAG
 
Protein sequence
MRALLSGLAL LAALALAGCD PFGKGPDFSI VSGSENTVLQ PIVEEFCKQK NATCTFKYEG 
TLDIGLALQS DQGVEQDAVW PASSVWVDMF DTKRRVKSLT SIAQTPVVLG VRKSKAEQLG
WIGRDVFMKD ILAAVENGSL KFLMTSATQS NSGASAYLAM LSSALGNKPV IEPGDLDDKG
VQESVRSLLS GVMRSSGSSG WLADLYVESA GKGTVYDGMW NYEAVLKETN DKLAALSQEP
LYAIYPADGV AMADSPLGFV DHGRGPEAQT FFNDLLAYLR SASAQQRIAD TGRRIPLSGV
AAKPEPGWNF DPARLVTAIR MPEPEVIRQA LTLYQAALRK PSLTALCLDF SGSMQGDGED
QLQKAMRFLL TPDEASKVLV QWSPADRIIV IPFDGSVRNT FMASGNPLEQ EGLLNEISRQ
KAGGGTDMYT CAAQALQQIA RSDRLSTYLP AIVIMTDGRS DDQSQAFMSE WNATEPHVPV
FGITFGDADK TQLDSLAKQT SARVFDGGSN LATAFRTARG YN