Gene Rleg2_3993 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRleg2_3993 
Symbol 
ID6982763 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhizobium leguminosarum bv. trifolii WSM2304 
KingdomBacteria 
Replicon accessionNC_011369 
Strand
Start bp4156514 
End bp4158166 
Gene Length1653 bp 
Protein Length550 aa 
Translation table11 
GC content60% 
IMG OID643398722 
ProductTetratricopeptide domain protein 
Protein accessionYP_002283481 
Protein GI209551564 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG5653] Protein involved in cellulose biosynthesis (CelD) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones29 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGCATAG ACATCATCGA TACCATTGCC GGGTTCGAGG CGCTGCGCGA CAACTGGGAT 
CAGGTCTTCA TGGAAGATCC CGATGCGCAG CATTTCCTCT CCTGGATCTG GCTGAAGAAT
TATCTGTCCC GCCGGCGCCG CTGGTTCATA CTCGCCCTTC GCGAACGCGA TCCGTACGAA
CCCTACGTTG CTTTTTTTCC CCTACGCCTC ATCACGCATC TGAACGAAAA GACCGGGCTC
TTCTACGACG AGATCATCAT GGCCGGGAAT TTTGCGGCCG ATTATACCGG CTTCATCGTC
AGGCCGGATT ACGAACATCA TGCCATTGCC GGCTTTGCCT CGTTTATCAA ACATCAGAAC
TGGACCGACC TGAAGCTCGA ATATTTCAGT GGCCCTGCCG GGCGGCGCGA GAAGATGATC
GAGGCGCTGC GAGGACCGGA GGTGATGTTT CGCGACAGCT CGCCGAAAAA CAATGAGAAC
ATCGACAATA CGATCTGCCC GATCGTTTCC CTGCCGGCAA GCTTCGACCA CTATCTCGAA
CAGCGCATGA GCAGCCAGAC GCGCCAGAAG CTCCGCCGGT TCCTGCGCAA AGTCGAAGGC
GACGATATCT ACCGCATCAC GATGTCGACC CCCGAGACCA TCCATCGCGA CCTGGACATT
CTCTTCGATC TCTGGCGGAC CAAGTGGAGC GCCCGCAAAG GCGCGGAGCG GACCGAGCGG
CTGATCATTA CCACGCGCGA AATGCTGATG GACTGTTTCA ACAACGGCAA TCTCGAGGTG
CCGGTCTTCT GGCATGGCGA CCAGCCGCTC GGCGCGCTGG CAAATATCGT CGACCGGCAG
AAGAAAGCGA TCCTCTTCTA TATCACCGGT CGCGACGAAA ACTGGAAAAC GCCGTCTCCC
GGTCTCATCC TGCACGGTTA CTGCATCCGG CGGGCGATCG AGCAGGGCTT CAAGACCTAT
GACTTCCTGC GCGGAAACGA GCCCTATAAA TATATGTTCG GGGTCGAGGA ACGACACATC
AGCTGCACGC TCTTCCGCAC CCGCAATGGC CAGAATCTGC ATGGCGCGCT CAACCCGCGC
AGCATTCGCT TCGTCTATGA GCAGGCGCTT GACATGTACC GCAACGGCGC CCGCCGGAGA
GCGGAGATCG TCTTCAACCA GGTCCTGCAA TCGGCTCCAG GCCATACCGG CGCGGGCTTC
GGGCTGGCCA ATCTGCTGTT CGACCGGGGC AAGCTGACGG AGGCACTGGC TGCCTATAAG
GCGCTCGCCG AACAAGCGCC CGATCCGACA CCGATCCGGA TGCGGCTTGG CGACACGCAG
CTTGCTTTGC ATCAATACGA CCAGGCCGCC GAGACGTTCC GCCTGGTCGG CGAGGTCGGG
CCGCATCTGA TCCAGGCGCA TTACAAGCGT GGCATTGCCC TTGTTGCCGG TAAACGGCTG
GCCGAGGCGG AAGCTGCTTT CGCCGCGATC CGGGACGTGC ATTCGGACGA TCCGGCCGCA
CTCGACTATG TTGCCAAGGC AAATGCCGCC CTCGAACGGA TCCAGGCGAG CGCCGAACCC
ACGCCTCACA AGACCGATGT CGTGTCCGAG ACCATCGCCC GCTGGAACCG GGGCTGGCAG
CTCAGCGAGC GACGCCGGCC ACGTTTGCAC TGA
 
Protein sequence
MRIDIIDTIA GFEALRDNWD QVFMEDPDAQ HFLSWIWLKN YLSRRRRWFI LALRERDPYE 
PYVAFFPLRL ITHLNEKTGL FYDEIIMAGN FAADYTGFIV RPDYEHHAIA GFASFIKHQN
WTDLKLEYFS GPAGRREKMI EALRGPEVMF RDSSPKNNEN IDNTICPIVS LPASFDHYLE
QRMSSQTRQK LRRFLRKVEG DDIYRITMST PETIHRDLDI LFDLWRTKWS ARKGAERTER
LIITTREMLM DCFNNGNLEV PVFWHGDQPL GALANIVDRQ KKAILFYITG RDENWKTPSP
GLILHGYCIR RAIEQGFKTY DFLRGNEPYK YMFGVEERHI SCTLFRTRNG QNLHGALNPR
SIRFVYEQAL DMYRNGARRR AEIVFNQVLQ SAPGHTGAGF GLANLLFDRG KLTEALAAYK
ALAEQAPDPT PIRMRLGDTQ LALHQYDQAA ETFRLVGEVG PHLIQAHYKR GIALVAGKRL
AEAEAAFAAI RDVHSDDPAA LDYVAKANAA LERIQASAEP TPHKTDVVSE TIARWNRGWQ
LSERRRPRLH