Gene Rleg_5456 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRleg_5456 
Symbol 
ID8016765 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhizobium leguminosarum bv. trifolii WSM1325 
KingdomBacteria 
Replicon accessionNC_012853 
Strand
Start bp36393 
End bp37499 
Gene Length1107 bp 
Protein Length368 aa 
Translation table11 
GC content59% 
IMG OID644827628 
Producthypothetical protein 
Protein accessionYP_002978828 
Protein GI241518200 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value0.324633 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones38 
Fosmid unclonability p-value0.925053 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCATCGT TTGACCCGCC TGGATTTCTC AAAGACTTCA ACAGCCAGCA GGCAGATGCC 
TGGAGCGACT GGATTTCACA GCAACTGGAT GAAGCGAAAG CCGGGCGGCC GGATCTCTTC
GATTTCGACG CGCCGAGACC GCGTTTCTTC AATGCATCGT TGGTTGCTCC CGCTGCCGAC
GCCGTTGAAA AGGATATTAC CTGGACAGCT TTTCCCCGCC TTGTGTCGAT AGATGCCGCG
ACCGACGAGG AAAGATGGCG AACGGCAGAT TTATCGCGCG ATGCGCAGGA CGAGTATTGC
GAGTGGAGTG TTGCGCGGCG CGCCGACGGG CGCATCAACA GCGTGACTTT CACCTGCGAA
GGTCCGGAAT ATTGGGAGTT CCTGGCTGCG ACGAATTTTC AGAAAGTGCT CGACCTCTAC
CAAGAGTTCG TCGATCCCGC GGTCGAAGCG AAAGACCTGC GTCTGACGGG CGGTCGCTAC
AATGCGAGAA ACAAATGGAA TAACAGTACA AGCCGCGGGG CGATGCACCT CATACAGCCG
AACAATACGC TGGGTGCCGA AATCGAGCTT GCCGCCGGCG CCAGCAATGC AAGGGCCCCG
GGAGGCACCT TGTTGACCAA CGATCAGGAC CTCATCCGCT GCGGACGATA TGGCCAGCCC
GAACGGCATA GCGACCCGAC CATCGGCGGC GGGATCAATG CGTTGGCGCG CGCCAATGCC
GATATCACGC TTGCCAATCC AGTTGGGATA TATTTTGCCG GCCTGAACAC CTCCGGATGG
ACAACACCCG ACGGATCCGA CGCGTCGCTT TATTGGAAAG TGGCCCGCGG CACGACGGCC
AAGCCAGTTC GAATGGTCTA CGCAGTGCCC GATGGCAAGG GCTTCACCGT GAGCGACATT
TCCATCAACG ACAACCCAAT CCGCTTCGGC GGGCAGATCG CCGATGCGAT CAGCATGAAG
CTGACGGGTC TGGCGATGAA TATAGGTCAG AGCAATCACC CGCCGCTCGC AGACTGCAGG
CGCGACGCGG CGACTCCGAC AACGGTGTCC ACCAGCACCA TGGACGTAGC AACGACACTG
AACATTACGA GGCAGTCAAC AAGGTGA
 
Protein sequence
MPSFDPPGFL KDFNSQQADA WSDWISQQLD EAKAGRPDLF DFDAPRPRFF NASLVAPAAD 
AVEKDITWTA FPRLVSIDAA TDEERWRTAD LSRDAQDEYC EWSVARRADG RINSVTFTCE
GPEYWEFLAA TNFQKVLDLY QEFVDPAVEA KDLRLTGGRY NARNKWNNST SRGAMHLIQP
NNTLGAEIEL AAGASNARAP GGTLLTNDQD LIRCGRYGQP ERHSDPTIGG GINALARANA
DITLANPVGI YFAGLNTSGW TTPDGSDASL YWKVARGTTA KPVRMVYAVP DGKGFTVSDI
SINDNPIRFG GQIADAISMK LTGLAMNIGQ SNHPPLADCR RDAATPTTVS TSTMDVATTL
NITRQSTR