Gene Rleg_3870 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRleg_3870 
Symbol 
ID8014693 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhizobium leguminosarum bv. trifolii WSM1325 
KingdomBacteria 
Replicon accessionNC_012850 
Strand
Start bp3941157 
End bp3942173 
Gene Length1017 bp 
Protein Length338 aa 
Translation table11 
GC content62% 
IMG OID644826440 
Producthypothetical protein 
Protein accessionYP_002977652 
Protein GI241206556 
COG category[S] Function unknown 
COG ID[COG4093] Uncharacterized protein conserved in bacteria 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value0.13922 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCAGCGT CAAGCCAATC CGGCAGCAGC CAATCCGGCA GCGGTAAGAA ATTCTGGTTG 
CTGGGTGGAG GCGTCCTCCT GGTGATTGCG CTTTATACCG GCGGCTGGTT CTATGCGGCC
TCGGCGCTGA AGAACACAGT GCTGAAAGCG ATCGCGCCGC GCGACCAGGC AGGCGTCAGC
GGCGAATGCT CCGATATCGA ATTTCGCGGC TATCCCTTCC GTATCGGCCT GTTCTGCTCC
AAGATCGACG TCGACGACAA TGTCAACGGC GTCTCCGCCA CCTTCGGCGC GCTGCGCTCG
GCAGCACAGG TCTACGCGCC CGGCAATATC GTCTGGGAAC TCGATTCTCC GGCAGAGATC
CGCACCAGCA ACGGCCTTTC GATCTCGGCC CAATGGACGA ACCTGCAGGC GAGCCTTACG
ACGAGGCTGC AGGGCATCGA CCACAGCTCG ACCGTCATCG AGGGTCTGAA GGCGATGGCC
TTCTCCTCCT ACACCGGCCA GACCATGAGC TTCGATGCCG CTCGCACCGA AATCCACCTG
CGCCAGAATG GTGCTGATCT CGACGGTGCG ATTTCCGTGC AGGACGCCAA CGCGGCGATC
AAGGACTGGC CGCAGATCTT CCCGAAATTC TCGGCGAGCA TCGATCTGAC CGTCGCCGGC
AAGGCCGGCC TGATCGACGG CAGTGACCGG AACGGCCTCA ATGGCGCCAC CGGCGACCTG
CGCCGCATCG TCGCCGACAT CGGTGACGGC AAGGTGATGA CGCTCACCGG CCCCTTCTCC
TTCGACGAGC AGGGCTTGCT TTCGGGAAAA TTCAAACTGG AGATCGAACA ACTCGGCCCT
TGGGGGGACA GCCTGAAACA GGCCTTTCCG GATATCGCCT CGACCGTCAA CACGGCGACG
AAGCTGCTGA AATCGCTTGC CGGCGGCGGC GACAAGGTCT CCGTCGATCT CGTCGTCAAT
CGCGGCAATG CCACCGTCAG CGGTTTCATC CCGCTCGGCC GCATTCCACC GATCTGA
 
Protein sequence
MAASSQSGSS QSGSGKKFWL LGGGVLLVIA LYTGGWFYAA SALKNTVLKA IAPRDQAGVS 
GECSDIEFRG YPFRIGLFCS KIDVDDNVNG VSATFGALRS AAQVYAPGNI VWELDSPAEI
RTSNGLSISA QWTNLQASLT TRLQGIDHSS TVIEGLKAMA FSSYTGQTMS FDAARTEIHL
RQNGADLDGA ISVQDANAAI KDWPQIFPKF SASIDLTVAG KAGLIDGSDR NGLNGATGDL
RRIVADIGDG KVMTLTGPFS FDEQGLLSGK FKLEIEQLGP WGDSLKQAFP DIASTVNTAT
KLLKSLAGGG DKVSVDLVVN RGNATVSGFI PLGRIPPI