Gene Rleg_5588 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRleg_5588 
Symbol 
ID8016814 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhizobium leguminosarum bv. trifolii WSM1325 
KingdomBacteria 
Replicon accessionNC_012853 
Strand
Start bp171327 
End bp172427 
Gene Length1101 bp 
Protein Length366 aa 
Translation table11 
GC content59% 
IMG OID644827754 
Producthypothetical protein 
Protein accessionYP_002978954 
Protein GI241518326 
COG category[R] General function prediction only 
COG ID[COG0655] Multimeric flavodoxin WrbA 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value0.256684 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones24 
Fosmid unclonability p-value0.00697782 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGCCCGACG AACCAGCCGC GAACGCATCT GACCTCGAAC CGCGAAAAGG CAGCCCGAGC 
CCTCGCCTCG ACGAGCGAGA ATTCAAGAGG CGTTTCCTAA GCCAGTTCAA GGATCGGGTT
TACGACACAC TGGATAACGA ACTCGGCAAG GTGGCCGCGG CCGCATGGAA TGCATACGCC
CATTCGCGAA AGAGCCCGAT AACCAAAAAG GCTGGACCAG GATTTGCCGA TCCCGACTAT
GATCTTGGCG TCGATTGGCT CGAGGCGAGA GATCAAATCC GCGCTGCGCA AGCGCGCTTC
GAGGACGAAA ACGCGCCATC GCGCGTCCTC CTCGTCAACG GCTCGTCGCG AAGCGAACAT
ACCTGTCCGG GCGAGATTTC GAAAAGCTTC CGAATGGTCG AGATCGCCAA GGATGTGTTT
GCCGAAGCCG GCCTAACCGT GAACGTCCTC GACCTCAGCC GCATCGCTTC CGAATATGGC
CGACAGATCC ATCCGTGCAA GGCGTGCTTC TCGACATCGG CCGTGCTCTG TCACTGGCCG
TGCTCCTGCT ACCCGAACTA CTCCCTCGGA CAGATCCACG ATTGGATGAA CGAAATCTAT
CCAATGTGGG TCGAGGCGCA CGGCATCATG ATCGTCAGTC CGGTGAACTG GTACCAGACA
CCCTCGCCGC TGAAACTGAT GATCGACCGG CTTGTCTGCG CCGACGGCGG CAATCCAGAT
CCGACGAGCA CGCATGGCAA ACACGCCAAG GAGGCCAAGG AGCTTGAGAT GCAAGGATGG
GACTATCCCC GCCATCTTTC CGGCCGCTTA TTTTCTGTCG TCGTCCATGG CGATACCGAA
GGCGTCGAAA ATGTCCGCCG CAGCGTCTCG GATTGGCTGA CGTCGATGGA CCTCGTCCCG
GCCGGAGCCT TGGCCGAGGT CGACCGTTAC ATCGGTTATT GGGAGCCCTA CGCCACCAGC
CACGACGCCT TCGAAAAGGA CACCGCTTTT CAGCAAGAGG TGCGCAATGC CGCCAGAACC
CTGTTGGAGG GAATTACCTC ACGACGCAAT GGCAAGATGG TCGCGGCGGG AAGACGGTTG
AAGCAACCGC GTGAGAAGTG A
 
Protein sequence
MPDEPAANAS DLEPRKGSPS PRLDEREFKR RFLSQFKDRV YDTLDNELGK VAAAAWNAYA 
HSRKSPITKK AGPGFADPDY DLGVDWLEAR DQIRAAQARF EDENAPSRVL LVNGSSRSEH
TCPGEISKSF RMVEIAKDVF AEAGLTVNVL DLSRIASEYG RQIHPCKACF STSAVLCHWP
CSCYPNYSLG QIHDWMNEIY PMWVEAHGIM IVSPVNWYQT PSPLKLMIDR LVCADGGNPD
PTSTHGKHAK EAKELEMQGW DYPRHLSGRL FSVVVHGDTE GVENVRRSVS DWLTSMDLVP
AGALAEVDRY IGYWEPYATS HDAFEKDTAF QQEVRNAART LLEGITSRRN GKMVAAGRRL
KQPREK