Gene Rleg_0449 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRleg_0449 
Symbol 
ID8011649 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhizobium leguminosarum bv. trifolii WSM1325 
KingdomBacteria 
Replicon accessionNC_012850 
Strand
Start bp466227 
End bp467411 
Gene Length1185 bp 
Protein Length394 aa 
Translation table11 
GC content55% 
IMG OID644823043 
Productglycosyl transferase group 1 
Protein accessionYP_002974297 
Protein GI241203201 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0438] Glycosyltransferase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones32 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones37 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAGATAT TGATCTATCT CACCGGCGGT ACCCACTGGA TTGGCGGTGT CCAGTACACT 
CGCAACCTGT TGCGCGCTGT TTCGCTGCTG CCAGCGCAGG AACGCCCCGC GCTCGTGCTT
CAGATAGGTC GGAAGAATGC TGGCCAAGGA TACGAGGAGG AATTCTCGCA CTATCCCGGG
GTGGTCATCG ATGGGCCACT TGAGCGGGGT TCGGCGATCC GGTCAAGAAT ATTGGATCTC
GCGCGGCGCG CATGGAAGAG GTCGACCGGC AAGGATCTAC GTCAGAAGCT TCTGCACTCC
GACGAGTGTG ACGTCGCATT TCCTGCAAAA GGTCCAAACA TTCCGGGTTT GGCACAGAAG
GTCTATTGGG TTCCTGATTT TCAGTACAAG CATTTCCCAC AGTTTTTCTC CGAAGACGAG
CGACGTAGCC GTGACGCCTT TTACGGAAAG ATGTTTGATG AGAGCGGCAT TCTTGTCCTG
AGCAGTGAAG CGGTGAAAGC CGACTTCATA CGGTTTTTCC CGACCTATTC CCAAAAACCG
GTGCGCATCC TCCACTTTTC AAGCACGCTT CATGACGAGG AGTATGCCCT GGATCCAGTC
GCGGTCTGTG CTAAACATGG CTTGCCGGAA AAATTTGTGT ATCTGCCCAA TCAGATGTGG
CAACACAAGG GCTTCGACGC CGCCTTTCGT GCGCTGGGCA TTCTGAAACG CGCGGGGGTT
ATCATCCCCC TTGTCCTGAC GGGGAGCTCA GAGGATTATC GCAGTAATGA CTACGCTCGC
CAACTCGAAG AAATCCTGAC GGAATATGAC CTTCAGGATC AGATCTACCG TCTGGGCGTC
CTGCCGCGAA GCGAGCAACT TCAGCTTTTC CGCCGCGCTG CCGTTGTTCT TCAGCCATCA
CGGTTCGAAG GGTGGAGCAC GACAGTCGAA GATACCCGCG CCCTGGGGAG GCCGATCGTG
TTGTCGAACA TCGATGTTCA TCTGGAGCAG GCCCCCCCAA ACGCGAGCTA TTTTGTTGTC
GGGGATCAAA AAGATCTTGC GGATAAGCTC GGCAAAGCTT GGCTCACCGC CGAGGCGGGG
CCTGATTTCA AACAGGAAGA TGCCGCACGC AAGGCGGCAA ACCTCAACAG TTTGGCGTAT
GCGAGGACCT TTCTTTCAAT TATGAGACAG GCTCATCGCG AGTGA
 
Protein sequence
MKILIYLTGG THWIGGVQYT RNLLRAVSLL PAQERPALVL QIGRKNAGQG YEEEFSHYPG 
VVIDGPLERG SAIRSRILDL ARRAWKRSTG KDLRQKLLHS DECDVAFPAK GPNIPGLAQK
VYWVPDFQYK HFPQFFSEDE RRSRDAFYGK MFDESGILVL SSEAVKADFI RFFPTYSQKP
VRILHFSSTL HDEEYALDPV AVCAKHGLPE KFVYLPNQMW QHKGFDAAFR ALGILKRAGV
IIPLVLTGSS EDYRSNDYAR QLEEILTEYD LQDQIYRLGV LPRSEQLQLF RRAAVVLQPS
RFEGWSTTVE DTRALGRPIV LSNIDVHLEQ APPNASYFVV GDQKDLADKL GKAWLTAEAG
PDFKQEDAAR KAANLNSLAY ARTFLSIMRQ AHRE