Gene Rleg_4847 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRleg_4847 
Symbol 
ID8007235 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhizobium leguminosarum bv. trifolii WSM1325 
KingdomBacteria 
Replicon accessionNC_012848 
Strand
Start bp223593 
End bp225245 
Gene Length1653 bp 
Protein Length550 aa 
Translation table11 
GC content61% 
IMG OID644821777 
Producttransferase hexapeptide repeat containing protein 
Protein accessionYP_002973037 
Protein GI241113202 
COG category[R] General function prediction only 
COG ID[COG0110] Acetyltransferase (isoleucine patch superfamily) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value0.0521363 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGATCACG TCAATAGCCC GCAATCGTCA GTGAGCGAAG AGAAGGAAGC GCGCAGGCTT 
CAATACCTGA CCTGGGAACA CATTGCGTCT GACCTCCGTC ATCCCACTCA CCTCGCCCGC
AAGGCGGAAC TCAGGCGATC ATGCAGTGCG GAACTGGCCG AGACGTCCTA CATTGCCGAG
CATGCCGCAA TCTTCACCGA AAGCCTGACG ATGGGCGAGC GGTCCTGGAT CGCCGGGCAC
GCGCTCGTTC GCGGCCATGT GATCCTCGGC GACGATTGCA CCATCAATCC CTATGCCTGT
GTTTCCGGAA CGGTGACGTG CGGTCATGGC GTTCGGATTG CTTCGCATGC ATCGATCGTC
GGCTTCAATC ATGGCTTCGA CGATCCGACC ATTCCTATAC ACCGCCAGGG CGTCGTCAGC
ATCGGCATCG CGATCGGCGA CGATGTCTGG ATCGGCGCAA ATTGCGTGAT CCTCGATGGC
GCAACAATTG GAAACGGTGC GGTGATCGCC GCCGGCGCTG TGGTCACGGG GGACATTCCC
GCCATGGCAA TTGCCGGTGG CGTGCCCGCC CGGGTGCTGC GAAGCCGAGG CTCGGCGCCG
ACGAAAACCG GCACCGGCGA CATCGAAGAT CAATTGGTGA GGCTCGGCCA GAAAGCGAAA
GACCAGTGGC CGGACATCCT TGCACGCTGG AAAACGCGAG GGTCCTATGA ATCGCTGGAA
GCGGACGGCA TCCGCAGACC GGCGATCCGG CACCTCTGCG ATGCAATCGA GATCGCTGCC
GGCTTCGGCC ACCTGCCGCC CGATCTCGAT GCGGCGGAGA CCGTCGAGCG TCTCCAAGGT
CTTCAGGACC GAGAGACCGG CCTTTTCCCG GAAGGACATT CGCGCATCCT TGGCAAGGCG
CTGAGGGATG ATCCAAAGGC GCTCTATAAC GTCCTTGCGG TTGGCTATGC ACTTGAACTG
CTTGGTTCAG GTCCGCGCCA ACCCGTCCAC GCAGTCGAGC TCGAGGCCGG GGAACTGGAT
GAATGGCTGA GCGCCCTGCC CTGGTCGACC CGGGCATGGC ACGCCGGAAG CGTGGTCGAT
GCGATCGGAA CTGCCATGTA CTTCAATGCG AAGTCTTTCG GCATCAGGCA TTCACGGCAG
GCGCTCTTCG AATGGCTAAG CCGCAATGCC AACAGCGTTT CGGGGTTGTG GGGTGAACCG
ACCGCGGCGG AAGGATGGCT TCAACCGGTG AACGGCTTTT ATCGCCTGAC GCGCGGCACC
TACGCCCAGT TCGGCGTGGC ACTTCCCCAC CCGCACGCCT CACTCGAAAC GGTTCATCTC
AACTATCGCA ACCACAAGGG CTTCGTTGCT GCAAAATACA ATGCGTGCAA CCTGCTCGAT
ACGATTCATC CTCTGCTGCT GATTGCCCGG CAGACCGACT ACAGACGGGC CGACGGCGAG
GCGATCGCCC GCAAGGTCAT CTCAAGGGCG CTGGATAGAT GGCGGGATGG CGAAGGATTC
CCGTTTGCCG ATGGTGGTGA ACCGAGCTTG CAGGGGACGG AAATGTGGCT TTCCGTCATT
CACCTGGCGG CCGATTTTCT CGGCCTGTCA GATCGCTTCG CCTTCGTCCC GAAAGGCGTT
CACCGGACGG CAACCGTCGG GCTGGGTTTG TGA
 
Protein sequence
MDHVNSPQSS VSEEKEARRL QYLTWEHIAS DLRHPTHLAR KAELRRSCSA ELAETSYIAE 
HAAIFTESLT MGERSWIAGH ALVRGHVILG DDCTINPYAC VSGTVTCGHG VRIASHASIV
GFNHGFDDPT IPIHRQGVVS IGIAIGDDVW IGANCVILDG ATIGNGAVIA AGAVVTGDIP
AMAIAGGVPA RVLRSRGSAP TKTGTGDIED QLVRLGQKAK DQWPDILARW KTRGSYESLE
ADGIRRPAIR HLCDAIEIAA GFGHLPPDLD AAETVERLQG LQDRETGLFP EGHSRILGKA
LRDDPKALYN VLAVGYALEL LGSGPRQPVH AVELEAGELD EWLSALPWST RAWHAGSVVD
AIGTAMYFNA KSFGIRHSRQ ALFEWLSRNA NSVSGLWGEP TAAEGWLQPV NGFYRLTRGT
YAQFGVALPH PHASLETVHL NYRNHKGFVA AKYNACNLLD TIHPLLLIAR QTDYRRADGE
AIARKVISRA LDRWRDGEGF PFADGGEPSL QGTEMWLSVI HLAADFLGLS DRFAFVPKGV
HRTATVGLGL