Gene Rleg_0739 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRleg_0739 
Symbol 
ID8015476 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhizobium leguminosarum bv. trifolii WSM1325 
KingdomBacteria 
Replicon accessionNC_012850 
Strand
Start bp769872 
End bp771092 
Gene Length1221 bp 
Protein Length406 aa 
Translation table11 
GC content59% 
IMG OID644823328 
Producthypothetical protein 
Protein accessionYP_002974579 
Protein GI241203483 
COG category[S] Function unknown 
COG ID[COG5397] Uncharacterized conserved protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones25 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones43 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGGCTTTC GGCCGGTCGA ACGCGGTGTT ATAATCTTTT TACAGGAAAG TTATAACCGG 
AGAAAATCCG TGCCCATCCG CGAGATCGAT TTGATGTACC AGACCATGCT GGCGGAGCTC
GGCCAGCGTT CGCTGGACGG AAGTTTTGTC GCCGAGTTCC CGCTGGAGGG TCGGTTCGTC
TCCGTCCCCG TGAAGGGCAA GGAATACTGG TATTTCGATC ATCCGGGCCA AGACGGAGTC
AAACGGTCTT ATGTCGGGCC AAAGAATGAC GAGGAGCTTA CGAAAAGAGT GACCGACTTC
GGCGCGATCA AGGATGATCT CCGAAACCGC CGGCGAATGG TGGCCACTCT TACGAGAGAA
GGCGGCATGA ACGCACCGCC CAGGTTTACG GGCGACATCA TCGAGGCGCT TGCCAATGCA
GGGCTCTTCC GTCTGCGCGC GGTTCTCGTC GGAACCGTGG CATTCCAGAC CTATTCCGGC
ATTCTCGGCG TTCGACTGCC GGCGTCGCTT ATGCAGACGA GCGATGCCGA CTTCGCCCAG
TTCCATTCGA TCTCGACGGC GGTCAACGAC AGCATCCCTC CCATAGGCGA GGTTCTTGAA
AAGCTGGACC CGACATTCCG GGAGGTCCCA CATCTCAACC ATCCCACGCG CTCGACCCAG
TTCGTGAACG CGAAGAACTA CAAGGTCGAA TTCCTGACGC CGAATACGGG CAGCGACGAC
AATCAGCAGA AGCCCGCTGA TATGCCCGCC CTTGGGGGAA TTTCTGCCGA ACCGCTCAGA
TTCCTCGATT ACCTCATCTA CAACCCGATC AGGACCGTGA TCCTTCACAA GAGCGGCATT
ACGGTCAATG TTCCCGCTGC GGAGCGCTAC GCAGTTCACA AGCTGATCGT CGCCTCGCGG
CGGCAGAACG ACGACAATGG CGTGCTCAAG CGCGAAAAGG ACGTGCAGCA GGCTTCCCAT
CTTTTCGAAG CGATGGGCGC GACACGCCGC CATTCTGATC TTGCGCTGGC CTATTGCGAG
GCGTGGGAAC GCGGTCAGTC ATGGCGTGAC GCAATTGCAC GCGGATTGTC GTTCATGCGA
CCGGACCGCC GTCTACAGCT CATGTCCGTT CTCGCCGAAG GCATGGCGGA AATTGGCGAA
GATCCCGCCC GTTACGGAGT TGAAACTGGC CCCGACGGAG CCGGGGGAAC TTCAACACCT
GCGCCAAAGT CCCGCCGTTA G
 
Protein sequence
MGFRPVERGV IIFLQESYNR RKSVPIREID LMYQTMLAEL GQRSLDGSFV AEFPLEGRFV 
SVPVKGKEYW YFDHPGQDGV KRSYVGPKND EELTKRVTDF GAIKDDLRNR RRMVATLTRE
GGMNAPPRFT GDIIEALANA GLFRLRAVLV GTVAFQTYSG ILGVRLPASL MQTSDADFAQ
FHSISTAVND SIPPIGEVLE KLDPTFREVP HLNHPTRSTQ FVNAKNYKVE FLTPNTGSDD
NQQKPADMPA LGGISAEPLR FLDYLIYNPI RTVILHKSGI TVNVPAAERY AVHKLIVASR
RQNDDNGVLK REKDVQQASH LFEAMGATRR HSDLALAYCE AWERGQSWRD AIARGLSFMR
PDRRLQLMSV LAEGMAEIGE DPARYGVETG PDGAGGTSTP APKSRR