Gene Rleg_1314 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRleg_1314 
Symbol 
ID8012412 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhizobium leguminosarum bv. trifolii WSM1325 
KingdomBacteria 
Replicon accessionNC_012850 
Strand
Start bp1295801 
End bp1297042 
Gene Length1242 bp 
Protein Length413 aa 
Translation table11 
GC content62% 
IMG OID644823896 
Producthypothetical protein 
Protein accessionYP_002975145 
Protein GI241204049 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG5653] Protein involved in cellulose biosynthesis (CelD) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones50 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGCAGACGC AAAAGCTCGA TGTCCATGAA CAGCCGTCCG ACGACGTGGC GCTGGCCGAT 
ACGGCCATTT CGCGCCTGCG CCAGCTGAGC ATGAAACTCG CCATGGCTGA AATCGACATC
GAAGTCTTCG ATACGATGCA GCCGCTGGAG GATGAGTGGC GGGCGCTGGA GCGCGACAAT
CTCCAGTCCC TGCATCAGAG CTACGACTGG TGCGCCGCCT GGGTGAGCGC CTTCCAGCGG
CCGCTTGCGA TCCTCAAAGG CACTCATGCG GGTCAGACCG CCTTCATTCT GCCGGCCGAG
ATCGTCAAGT CTCGCGGGCT CACGACGGCG AAATTCATCG CCGCCGATCA CAGCAATATC
AATACCGGCC TATTCGCAGA GAGCTTTGCC GAAGCCGGCA GGACCATCGC CCCCCATGAG
TTCGCCGGCC GGCTCCGGCA TGCGCTGAAG GGCCGCGCCG ATCTGCTGCT GCTGCAGAAC
ATTCCGCTGG AATGGCGTGG GCGCGAGAGC CCGCTCGCCG GGCTGCCGGT GGTGCAGAAC
CAGAATCACG CCTATCAGCT GCCGTTCCTT CCCGCTTTCG AGGACACGCT GAAGCAGCTC
AACGCCAAGA ACCGGCGCAA GAAATTCCGT GTTCAGTCGA AGCGCCTCGA GGCGGCCGGC
GGCTTCGAAT ACCTCATTCC TCGGACATCG GAAGAACAGC ACGGCCTGCT CGATATCTTC
TTCCGCCTGA AAAGCGCCCG TTTCGCCAGC CTTGGCCTGC CCGACGTCTT TGCCGATAGG
GAGACGCAGA CCTTCCTGCA CGGTCTCATC GACAAGCGGG ACGACACCAG GCAGTATTTC
GGGCTGCAGA TGCATATGCT CCGGCTCAAG GGCGAACTTG AGGGTAAGAT CGCCGCGATA
TCAGGCATCT CGCGCAAGGG TGACCATATC ATCTGCCAGT TCGGCGCGAT CGACGAAGAG
CTCGTGCCGG ATACGAGCCC CGGCGAATTC CTCTATTGGC AGACCATCTC GGGACTGCAT
GGCAAGGGTG TCGCACTGTT CGATTTCGGC CTCGGCGACC AGACCTACAA GCGTTCCTGG
GCGCCGGTCG AGACCGCGCA TTATGACGTG GTGCTGCCGG TATCGCCGTT CGGCGTCGTC
GCCGGCGCCG CTCACCGGAT CGTCACCCAC GGCAAGGCGC ACATCAAGGC GCGCCCGAAG
CTCTATAAAT TCGCCCAAGG CATCCGGGCA CGGATCGGCT AG
 
Protein sequence
MQTQKLDVHE QPSDDVALAD TAISRLRQLS MKLAMAEIDI EVFDTMQPLE DEWRALERDN 
LQSLHQSYDW CAAWVSAFQR PLAILKGTHA GQTAFILPAE IVKSRGLTTA KFIAADHSNI
NTGLFAESFA EAGRTIAPHE FAGRLRHALK GRADLLLLQN IPLEWRGRES PLAGLPVVQN
QNHAYQLPFL PAFEDTLKQL NAKNRRKKFR VQSKRLEAAG GFEYLIPRTS EEQHGLLDIF
FRLKSARFAS LGLPDVFADR ETQTFLHGLI DKRDDTRQYF GLQMHMLRLK GELEGKIAAI
SGISRKGDHI ICQFGAIDEE LVPDTSPGEF LYWQTISGLH GKGVALFDFG LGDQTYKRSW
APVETAHYDV VLPVSPFGVV AGAAHRIVTH GKAHIKARPK LYKFAQGIRA RIG