Gene Rleg_5448 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRleg_5448 
Symbol 
ID8016757 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhizobium leguminosarum bv. trifolii WSM1325 
KingdomBacteria 
Replicon accessionNC_012853 
Strand
Start bp25337 
End bp26617 
Gene Length1281 bp 
Protein Length426 aa 
Translation table11 
GC content54% 
IMG OID644827621 
Producthypothetical protein 
Protein accessionYP_002978821 
Protein GI241518193 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value0.0547483 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones26 
Fosmid unclonability p-value0.0192928 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCTATT TTTTGATCCC GAAAGTCGTC GCGTTACTTG CAGTTCTTGG CACGGTGCAA 
TCTTGCTCGT CGGTATATTC GTCGTCGGCA TTGGAGGTAT CCGCAACGTC GGGGACGTCC
GAGTGCCTCG CAAATATGGG AACCTATTTT CTTCCCAAAG GAGAGCTTTC GTTCGTTGTT
CTGAAAAAGC CGACGATAGG CATGACAGGC TTCCGATACG ACATGAAGAC CGTCGCCGAC
AACACTGGTA CCGACGGACT ATCCGTGGTC ATGTCGCCTG ACGAACGCCA CCAGTACTGC
CTCGACTTCA AGCCTAATTC TTCGTACTCC GACGTGGTCC GCGCGCAGCG CAACGAACTT
GGACTCCTGA CAAGTGTCTA CAGCAATGTC GAGGATCAGA GCAAAACCAT CGTAGAGGAT
ACCGCACGAG GCATTGCGTT GGCAGTAGCG GCAGAATCCC GGCTCGCTAA TAGAGACTTT
TTGGTCGCCG ACCCGGCGAC TGTCGTTCAC ATGAAGATGC AATTCGATCC TTTCGATCTG
GATCGCATCA CCAGCGTAAA CCGGGCGCTC GAAAAAAGCG GTTATTGCAT CTACATCGAT
CCCAAAAGCG ATCCCTTTGT TCCGTTCTGG ATGCGAAATC AATGTTCATC CACTCCGCAG
CTCGTCGCTT ACAATTTCAA GGGGGACGCG GAAGAGGTCT TTAGTTCCGC GAGCTACACT
GCAGGCGAAG GCCGGTTCGG CATCCTCTAC AAGCCTGCAT TGAGCCACAC TCTGGTTATC
CTCAAGCGTG ACGATCCAAC GTCAGGGAAG CCATGGCGCA TCTGGAAGCG CCAGATTGTC
GAGTTGCCTA ACCGTGCGCC TGTTTTCATG CTGCAGGTGA GCCGCGGCTT CTTCACCGCC
CGCAAGAGCG AGATAACGTT CCAAAACGGG ATGCTCGCCA GTGTCGAAGT TGATAAGAAG
AGCGAGCTGA AGGCCGTGTC GGAAGCGTTT GTGAACGTGG TTAGTATCGT CGTGAGAATT
CCGGCCAAGG CCCTTATCAT CGGAACCAAC GAGGCAAAAA ACCAGGAAGC GCTCATCAGG
GCAAACCAGG CTCTTCTGCA AGCGTACGCA GAATTGGAAG CCGAACAAAG GAAACAGGCT
AACCTCAAAC AAGGCCTCGA CGTAGATGGC CTTCCCAGAA CCTCGTCCGC ACGCACGAGA
GCAGCCTGCC TCGATTATGC TGACCTCAGC GCGGTGGAAG ACCCGAACGT ATACTGTCAG
GACAAGGCCG AGACGCAATG A
 
Protein sequence
MSYFLIPKVV ALLAVLGTVQ SCSSVYSSSA LEVSATSGTS ECLANMGTYF LPKGELSFVV 
LKKPTIGMTG FRYDMKTVAD NTGTDGLSVV MSPDERHQYC LDFKPNSSYS DVVRAQRNEL
GLLTSVYSNV EDQSKTIVED TARGIALAVA AESRLANRDF LVADPATVVH MKMQFDPFDL
DRITSVNRAL EKSGYCIYID PKSDPFVPFW MRNQCSSTPQ LVAYNFKGDA EEVFSSASYT
AGEGRFGILY KPALSHTLVI LKRDDPTSGK PWRIWKRQIV ELPNRAPVFM LQVSRGFFTA
RKSEITFQNG MLASVEVDKK SELKAVSEAF VNVVSIVVRI PAKALIIGTN EAKNQEALIR
ANQALLQAYA ELEAEQRKQA NLKQGLDVDG LPRTSSARTR AACLDYADLS AVEDPNVYCQ
DKAETQ