Gene Rleg2_4156 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRleg2_4156 
Symbol 
ID6982928 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhizobium leguminosarum bv. trifolii WSM2304 
KingdomBacteria 
Replicon accessionNC_011369 
Strand
Start bp4332866 
End bp4334206 
Gene Length1341 bp 
Protein Length446 aa 
Translation table11 
GC content60% 
IMG OID643398886 
Producthypothetical protein 
Protein accessionYP_002283644 
Protein GI209551727 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value0.256613 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones38 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCAAATC CGAACCCTTA TACCCCGAGC TATTCGTTTT CCGGCTGGCA GACGTCCAAT 
CCTGCGAAAC CGTTGCCTGC GCCGCAGGTC GACAATGAGC TTGCGAACAT CTCGACGTCG
CTCAACGCCG CGATATCAGG GCTGCTGGAC ATCCGGCGCT CCGACGGCAA GCTAAAGAAC
GGGATCGTCA CGTTCGAAAG CCTGAATAAC GACCTAAAGG CCGGGTACTC AGGCGGCGCT
GTTTCGGCAT GGGCGCCAGT CGTTGATTAC GCGGCCGGGA TTGTCGCCAC GTCGATCGCG
CCAGCTACGG TCGTCGTCTA CCAGGGGGAA AGCTACGTTT GCACCACCGA CCACGTCACA
ACGGCGCTGT TCGACGTTAC CAAGTGGCAC AAGATCGCCG CCCGCGGGGC AAATGGCACG
GGCTCCGGCG ATATGCTGGC GTCGCTCAAT CTGTCGGATC TGACGAACAA GCCGCAGGCC
CGGATCAATC TTGGGCTCGG TAACGTCGAC AACACGAACG ATGCCGGCAA GCCGATTTCG
ACCGCCACGC AGGCCGCGTT CGATACGGTG AACGCGAGCC TGGCCGCTCT GACGGCCGCG
CAGTTCGATT TTTTCACAGA CTGTGTTCCG TCTTACGTCA GCGTCACCAG TATCAGTTTC
TCCTCAGGCG TCGGGCTATT CGGCAATAAG AAGCACATTC TGCCGGCCTA CACGAAGCTG
ATGAGTGCTA CGTTCGCGGC CGGCGCCGGT GTCGGAATGC TCGACACCGG CACCATCGGC
GCCAGCAAGA CCTATTTCCT GTTCGCGATC CGGAACACAT CGACGGGTGA TTGCGATTAT
CTGGCTTCGT TGAGCCTGAC GCCGCTTGTT CCGGCCGGAT GGGAGCTGAA CTCCGGCAGC
CGCATCGGGA TCATCTTAAC GAACGGCTCC AGCCAGATCA GGAACTTCGT CCAAACGGGC
AACCAAGTTA CCATCATAGG AACGGCACAA CAGGTCTTTA CGACTTCGAC CTCCATTGCA
GCGGCGCTGA TCGCACTTCC CAACTGCCCC GTTGGTATCT CGGTAGGTGC CATGCTGGCC
CTTGATGTCT CGGCGTCCAC GAACGGTGAC GTCTCGGCGT ACCTCTCCGA CTACAGCGCT
CCGGACGCTC AACGGGTCAG GGCCAGAACT TTCTGCGCGG CACAGCCTTC CGCTACGGTT
GCTCAAGCCA ATTACGCGCC GGTGCGGACA AACACCCTGG CGCAAGTCTA CCGGTCCGTG
GGCGTCGTGA CAGGCCCCGC AACGGCGACC GGCTACATCA ACGGGTGGGT AGACCACCAA
TGCAAAAGGC TTTTCCCATG A
 
Protein sequence
MANPNPYTPS YSFSGWQTSN PAKPLPAPQV DNELANISTS LNAAISGLLD IRRSDGKLKN 
GIVTFESLNN DLKAGYSGGA VSAWAPVVDY AAGIVATSIA PATVVVYQGE SYVCTTDHVT
TALFDVTKWH KIAARGANGT GSGDMLASLN LSDLTNKPQA RINLGLGNVD NTNDAGKPIS
TATQAAFDTV NASLAALTAA QFDFFTDCVP SYVSVTSISF SSGVGLFGNK KHILPAYTKL
MSATFAAGAG VGMLDTGTIG ASKTYFLFAI RNTSTGDCDY LASLSLTPLV PAGWELNSGS
RIGIILTNGS SQIRNFVQTG NQVTIIGTAQ QVFTTSTSIA AALIALPNCP VGISVGAMLA
LDVSASTNGD VSAYLSDYSA PDAQRVRART FCAAQPSATV AQANYAPVRT NTLAQVYRSV
GVVTGPATAT GYINGWVDHQ CKRLFP