Gene Rleg_5584 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRleg_5584 
Symbol 
ID8016475 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhizobium leguminosarum bv. trifolii WSM1325 
KingdomBacteria 
Replicon accessionNC_012853 
Strand
Start bp166733 
End bp167863 
Gene Length1131 bp 
Protein Length376 aa 
Translation table11 
GC content63% 
IMG OID644827750 
Producthypothetical protein 
Protein accessionYP_002978950 
Protein GI241518322 
COG category[S] Function unknown 
COG ID[COG4641] Uncharacterized protein conserved in bacteria 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value0.348361 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones26 
Fosmid unclonability p-value0.0177217 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCAATC CGCTCGATAT CCTCATCCTC GGCCTCTCCT TGTCGTCGTC GTGGGGCAAC 
GGTCACGCGA CGACCTATCG CGCCCTCATC GGCGGCTTGC ACGCCGGGGG GCATCGAGTG
TTGTTCCTGG AGCGCGACGT GCCCTGGTAT GCGGCGCACC GAGATCTTCC CGATCCCGAC
TTTTGCCAGC TCGTCCATTA CAGCGACATC GAAGAGATGA TCGAAAACCA TGCCGATCGG
ATCAAGGCGG CGGACGCGGT CATCATCGGG TCCTACGTTC CATCCGGCGT GGCGGTTATC
GACAGGATCG CCGCCCTGAA GCCCCGACGG CTGTGCTTTT ATGACATCGA CACGCCGGTG
ACGCTGGCGA AGCTCGACCG CGGCGACGAG GAATATCTGG CGCGCCGACA GCTTGCGACC
TTCGACGCCT ACTTCTCGTT TTCGGGCGGT GACGTGTTGG CGGGTCTCGA GCGCGGATAC
GGCGCGCGCA AGGCGATCCC TCTCTACTGC TCCGTCGATG CGAGCCGATA TCGGCCAACG
GACGAAGCCT TCCGCTGGGA TTTCGGCTAT CTCGGCACCT ATAGCCCCGA CCGACAGCCA
ACGCTGGAGC GGCTGCTGAT TGAGCCTGCC AGGCAACTGC CGCATCTGAG CTTCGTGGTC
GCCGGTCCTC AATATCCTGA AAATATTGAC TGGCCGGCGA ATGTGGAGCG GATCGAACAC
CTGCCGCCTG CCGATCATCC GAGCTTCTAC AGCCGGCAGC GTTTCACGCT CAACGTCACG
CGAACCGACA TGATCGCAGC GGGCTGGTCG CCGAGCGTGC GGCTATTCGA GGCCGCTGCG
TGCGGCACGC CGATCATCAG TGACGAGTGG CGCGGCTTGA ACGAGTTCTT CGCCGACGGT
CAGGCGATCA TCATCGCCAA AGGATCGGGG GATGTCGTCG ACGCCCTGAC AACCATCGCC
GCCGCGGGGC GCCGTGCGCT CGCATCGGCC GCCAGGGCGA CGGTGCTTGA ACGCCATACC
GGCGAGGTGC GCGCTCGTGA ACTCGCCGCC GCCTTGCGAG AACTGCCAGA AGAAGGGGGA
GAACGACAAT CGTCCCCAGC CTCAATCCAT TTCAGCTTAG GAGACGCATG A
 
Protein sequence
MTNPLDILIL GLSLSSSWGN GHATTYRALI GGLHAGGHRV LFLERDVPWY AAHRDLPDPD 
FCQLVHYSDI EEMIENHADR IKAADAVIIG SYVPSGVAVI DRIAALKPRR LCFYDIDTPV
TLAKLDRGDE EYLARRQLAT FDAYFSFSGG DVLAGLERGY GARKAIPLYC SVDASRYRPT
DEAFRWDFGY LGTYSPDRQP TLERLLIEPA RQLPHLSFVV AGPQYPENID WPANVERIEH
LPPADHPSFY SRQRFTLNVT RTDMIAAGWS PSVRLFEAAA CGTPIISDEW RGLNEFFADG
QAIIIAKGSG DVVDALTTIA AAGRRALASA ARATVLERHT GEVRARELAA ALRELPEEGG
ERQSSPASIH FSLGDA