Gene Rleg_1944 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRleg_1944 
Symbol 
ID8012984 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhizobium leguminosarum bv. trifolii WSM1325 
KingdomBacteria 
Replicon accessionNC_012850 
Strand
Start bp1933972 
End bp1935837 
Gene Length1866 bp 
Protein Length621 aa 
Translation table11 
GC content58% 
IMG OID644824533 
Producthypothetical protein 
Protein accessionYP_002975765 
Protein GI241204669 
COG category[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG2931] RTX toxins and related Ca2+-binding proteins 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value0.69459 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones30 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCTCAAG CCGTTAGAAT AAACGATATT ATTCGTTCCT TTGGAATTGA TACGCATATC 
GACTACACAG ATGGAAAATA TTCCAACGTC GGAGAAGTTG TTAAAGCACT CGACTATCTT
GGCCTTGATA CAGTTCGCGA TCACGCCCCC AACTCCGCTT CCGATCCCAA CGGCCAAACG
CATCTCGGCG ATGCTGCCGA GGCCGGCGTG CAATTCGTCT TCAGCGCCCA ACGGGAAGTC
GACCCCGCCA CTGTCGCCCA GCGGCTGCAT TCCTTCGTGC AGGCCCATCC AGGATCGGTC
GTCGGTATCG AAGGTCCGAA CGAAGTCAAC AACTGGCCAG TCAGCTATCA CGGCCTGAGC
GGCCAAGCCG CAGCGCTCGC CTATCAGAAG GACCTGTCTG CCGACGTCAA CGCCGATCCC
TTGCTGAAAA ATATCCCCGT TCTCGGCTTT ACCGGATATA CCGTGGCTTC CGCCTCCGAC
TACACGACGA TCCACACCTA TGCGAAGGAT GGCGACCAGC CATATTCATG GCTCTCCCGA
GAATCCGGCG TGCAGCGCGC TGCCGATCCG GGCAAGCCGC TGGCGATCAC CGAGACCGGC
TACCACACCT CGCTGACCGC CGACACCAAT GGCGGCTGGG AAGGCGTCAG TGAAGCGACG
CAGGCAAAGC TCCTGCTCAA TACGCTGATG GACGGCGCCG CACTCGGATC AAAACAGACG
TTCATCTACG AGCTGCTGGA CGCCTATTCC GATCCGCAGG GCACCAATCA GGAAAAGCAT
TTCGGCCTTT TTCATCTCGA CTATTCGGCC AAGCCGGCTG CGACGGCGAT CCACAATCTG
ACCGAAATCC TTGCGGATGA CGGCGCCGCG AAGGCAAGCT TCAGCGCGGG GACCCTCAAT
TATTCGATCG ACGGCCTGCC GTCCTCGGCC CGGAGCCTGC TGACGGAAAA ATCGGACGGA
AGCTACCAGA TCATCGTCTG GAACGAGCCC GATATCTGGA GCCAGTCCTC CGACACGGTT
ATTCAGGCCA CGACAACAGC CGTCAAAGTC AATCTCGGGG CCTCGTTTGG CTCCGTTAAG
GTCTTCGACC CGGTGACAGG AACGACGGCG ATCAAAAGCC TCAGCAACGT GTCGTCGCTG
CCGCTCGATG TCGTCGACCA TCCCTTGATC ATCGAGGTAG CAGGCACCGG CGCCAGCACA
CCGCCGCCGG CCACCAACCA TCTCTATGGC GGCACCGGTA ACGACACCTT CACCGTGACC
AATGCAAATC AAATCGTCGA CGAAAGCCGG GGCGGTGGAA CAGATACCGT CAAGGCTTCG
ATCTCCTTCA GCCTGGCCGA TCAGAAGCAT ACGGTCGGAA CGATCGAAAA CCTCACTTTG
ACCGGGACGG GCAATCTCAG CGCGACGGGC AACAATACGG CCAACATTCT CACCGGCAAC
GACGGAAACA ATTCCCTCAA CGGCGGGAAA GGAAACGACC GATTGATCGG CGGGCTCGGA
AACGACAAGC TGATCGGCAA GGCCGGTGCT GACGTTCTCA CCGGCGGCGG CGGCAGCGAT
TCCTTCGTCT TCGATGTGAA GCCCGACAAT ACCAGCGTCG ACAAGATCCG GGATTTTTCC
TCCGCGGCGG GCGACAAGCT GATGCTCGAT CATTCGATTT TCGCCGAGCT TAGCCTATCC
GGATTTTCGG ATGAGAATTT CGTTTTGGGA AGGAAAGCGC TCGAGGCTGA TGACAAGCTG
ATCTACGATC AGGCGAGCGG CATTCTATCC TATGACGCGG ATGGAAGCGC GGCGGGCGCG
GCCATCCATG TTACGGATCT CGATAATTCC GCAGCACTTC ACTTCAAAGA CTTCCTGCTT
GTCTGA
 
Protein sequence
MAQAVRINDI IRSFGIDTHI DYTDGKYSNV GEVVKALDYL GLDTVRDHAP NSASDPNGQT 
HLGDAAEAGV QFVFSAQREV DPATVAQRLH SFVQAHPGSV VGIEGPNEVN NWPVSYHGLS
GQAAALAYQK DLSADVNADP LLKNIPVLGF TGYTVASASD YTTIHTYAKD GDQPYSWLSR
ESGVQRAADP GKPLAITETG YHTSLTADTN GGWEGVSEAT QAKLLLNTLM DGAALGSKQT
FIYELLDAYS DPQGTNQEKH FGLFHLDYSA KPAATAIHNL TEILADDGAA KASFSAGTLN
YSIDGLPSSA RSLLTEKSDG SYQIIVWNEP DIWSQSSDTV IQATTTAVKV NLGASFGSVK
VFDPVTGTTA IKSLSNVSSL PLDVVDHPLI IEVAGTGAST PPPATNHLYG GTGNDTFTVT
NANQIVDESR GGGTDTVKAS ISFSLADQKH TVGTIENLTL TGTGNLSATG NNTANILTGN
DGNNSLNGGK GNDRLIGGLG NDKLIGKAGA DVLTGGGGSD SFVFDVKPDN TSVDKIRDFS
SAAGDKLMLD HSIFAELSLS GFSDENFVLG RKALEADDKL IYDQASGILS YDADGSAAGA
AIHVTDLDNS AALHFKDFLL V