Gene Rleg2_1226 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRleg2_1226 
Symbol 
ID6979947 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhizobium leguminosarum bv. trifolii WSM2304 
KingdomBacteria 
Replicon accessionNC_011369 
Strand
Start bp1239577 
End bp1240818 
Gene Length1242 bp 
Protein Length413 aa 
Translation table11 
GC content62% 
IMG OID643395940 
Producthypothetical protein 
Protein accessionYP_002280746 
Protein GI209548829 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG5653] Protein involved in cellulose biosynthesis (CelD) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value0.0671994 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones26 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGCAGACGC AAAAGCTCGA TGTTCATGAA CAGCCGTCTG AGGACGGGGC GCTGGCCCAA 
ACCGCCATTT CGCGCCTGCG CCAGCTGAGC ATGAAACTCG CCATGGCCGA AATCGACATC
AGCGTTTTCG ACGCGATGCA GCCGCTGGAG GGCGACTGGA GGACGCTGGA GCGCGACAAT
CTCCAGTCCC TGCATCAGGG CTACGACTGG TGCGCCGCCT GGGTAAGCGC CTTTCAGCGG
CCGCTGGCGA TCCTCAAAGG CACCTATGCC GGTGAGACCG CTTTCATTCT GCCGGTCGAA
ATCGTCAAGT CGCGGGGGCT TGGCGTGGCG AAGTTCATCG CCGCCGATCA CAGCAATATC
AATACCGGCC TGTTTTCCCG AAATTTTGCC GAAAGCGGCG GCAGCATTGA CGCCGAGAAG
TTCGCGGGGC AGCTTCGGCA CGCCTTGAGG GGCCGAGCCG ACCTGCTGCT GCTGCAGAAT
ATTCCGCTGG AATGGCGGGG ACGACAGACC CCGCTCACCG GGCTGCCGAT GGTGCAGAAC
CAGAATCATG CCTATCAGCT GCCGTTCTTT CCGGCTTTCG AGGAGACGCT GAAGCAGCTC
AACGCCAAGA ACCGGCGCAA GAAATTCCGC GTTCAGTCGA AACGCCTCGA GGCGGCCGGC
GGCTTCGAAT ACCTTGTCCC CCAGGCATCA GAAGAACAGC ACCGCCTGCT CGACATCTTC
TTCCGGCTGA AGAGCGCCCG TTTCGCCAGC CTCGGCCTGC CCGACGTCTT CGCAGACGGG
GAAACGCAGG CCTTTCTGCA CGGTCTGATC GACATGCGGG ATGACGGCAG GCAATATTTC
GGGCTGCAGA TGCATGTGCT GCGGCTCAAG GGCGCGAATG AGGGTCGGGT CGCCGCGATT
TCAGGGATTT CGCGCAAGGG CGACCATATC ATCTGCCAGT TCGGCGCGAT CGATGAGGAA
CTCGTGCCGG ATACCAGCCC CGGTGAATTC CTCTATTGGC AGACCATCTC GGGATTGCAT
GGCAAGGGGG TGGCGCTGTT CGACTTCGGC CTCGGCGACC AGACCTACAA GCGATCCTGG
GCGCCGGTGG AGACCGCGCA TTACGACGTG GTGCTGCCGG TCTCGCCGTT CGGCGTTCTC
GCCGGCACAG CGCACCGGAT CGTCACCCGC GGCAAGGCCC ATATCAAGGC CCGCCCGAAG
CTCTATAAAT TTACGCAGAG CATCCGCGCA CGCATCGGCT GA
 
Protein sequence
MQTQKLDVHE QPSEDGALAQ TAISRLRQLS MKLAMAEIDI SVFDAMQPLE GDWRTLERDN 
LQSLHQGYDW CAAWVSAFQR PLAILKGTYA GETAFILPVE IVKSRGLGVA KFIAADHSNI
NTGLFSRNFA ESGGSIDAEK FAGQLRHALR GRADLLLLQN IPLEWRGRQT PLTGLPMVQN
QNHAYQLPFF PAFEETLKQL NAKNRRKKFR VQSKRLEAAG GFEYLVPQAS EEQHRLLDIF
FRLKSARFAS LGLPDVFADG ETQAFLHGLI DMRDDGRQYF GLQMHVLRLK GANEGRVAAI
SGISRKGDHI ICQFGAIDEE LVPDTSPGEF LYWQTISGLH GKGVALFDFG LGDQTYKRSW
APVETAHYDV VLPVSPFGVL AGTAHRIVTR GKAHIKARPK LYKFTQSIRA RIG