Gene Rleg2_5903 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRleg2_5903 
Symbol 
ID6977459 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhizobium leguminosarum bv. trifolii WSM2304 
KingdomBacteria 
Replicon accessionNC_011366 
Strand
Start bp318913 
End bp320283 
Gene Length1371 bp 
Protein Length456 aa 
Translation table11 
GC content62% 
IMG OID643393356 
Productglycoside hydrolase family 4 
Protein accessionYP_002278174 
Protein GI209546284 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1486] Alpha-galactosidases/6-phospho-beta-glucosidases, family 4 of glycosyl hydrolases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value0.670067 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCAGCAA ATCCCAAGAT CACATTCATC GGAGCTGGCT CCACCGTTTT CATGAAGAAC 
ATCATCGGCG ACGTTTTGCA GCGTCCCGCC CTTTCGGGCG CAACGATCGC GCTGATGGAT
CTCAACCCGC AGCGGCTTGA GGAAAGCGCC ATCGTCGTCA ACAAGCTGAT CTCGACGCTC
GGCGTCAAGG CCAAGGCCGA GACCTATTCC GACCAGCGCA AGGCGCTAGC CGGCGCCGAT
TTCGTCGTCG TCGCCTTCCA GATCGGCGGC TATGAGCCCT GCACCGTCAC CGATTTCGAA
GTGCCGAAGA AATACGGCCT GCGCCAGACG ATCGCCGATA CGCTCGGCGT CGGCGGCATC
ATGCGGGGCT TGCGCACCGT GCCGCATCTC TGGAAGGTCT GCGAGGATAT GCTCGCCGTC
TGCCCCGAGG CGATCATGCT GCAATATGTC AACCCGATGG CGATCAACAC CTGGGCGATC
TCCGAGAAAT ACCCGACCAT CAGCCAGGTC GGCCTCTGCC ATTCGGTGCA GGGCACGGCG
ATGGAGCTGG CCCATGACCT CGACATTCCC TACGAGGAAA TCCGCTACCG CGCGGCCGGC
ATCAACCACA TGGCCTTCTA TCTCAAATTC GAGCATCGCC AGGCCGACGG CTCCTACCGC
AATCTCTATC CCGATCTCGT GCGCGGCTAC CGCGAGGGCA GAGCGCCGAA GCCCGGCTGG
AACCCGCGCT GCCCGAACAA GGTGCGCTAC GAGATGCTGA CGCGGCTCGG CTATTTCGTC
ACCGAAAGCT CGGAGCATTT CGCCGAATAC ACGCCCTATT TCATCAAGGA AGGCCGCGAC
GACCTGATCG AGAAATTCGG CATTCCGCTC GATGAATATC CGAAACGCTG CATCGAGCAG
ATCGAGCGCT GGAAGGGCCA GGCGGAAGCC TATCGCAGCG CCGACAAGAT CGAGGTGACG
CCCTCGAAGG AATACGCTTC CTCGATCATC AACTCGGTCT GGACCGGCGA ACCCTCTGTC
ATTTACGGCA ATGTCCGCAA CAATGGCTGC ATCACCTCGC TGCCCGCCAA TTGCGCCGCC
GAAGTGCCCT GCCTCGTCGA CGCCTCCGGC ATCCAGCCGA CCTTCATCGG CGACCTGCCG
CCGCAGCTGA CCGCGCTGAT CCGCACCAAT ATCAACGTCC AGGAACTGAC GGTGCAGGCG
CTGATGACCG AAAATCGCGA GCACATCTAC CACGCCGCGA TGATGGACCC GCACACGGCC
GCCGAACTCG ACCTCGACCA GATTTGGTCG CTGGTCGACG ACCTGCTCGC CACCCACGGC
AACTGGCTGC CCGAATGGGC CCGCACATCT AGAAAAGTTC AAGCCGCCTG A
 
Protein sequence
MAANPKITFI GAGSTVFMKN IIGDVLQRPA LSGATIALMD LNPQRLEESA IVVNKLISTL 
GVKAKAETYS DQRKALAGAD FVVVAFQIGG YEPCTVTDFE VPKKYGLRQT IADTLGVGGI
MRGLRTVPHL WKVCEDMLAV CPEAIMLQYV NPMAINTWAI SEKYPTISQV GLCHSVQGTA
MELAHDLDIP YEEIRYRAAG INHMAFYLKF EHRQADGSYR NLYPDLVRGY REGRAPKPGW
NPRCPNKVRY EMLTRLGYFV TESSEHFAEY TPYFIKEGRD DLIEKFGIPL DEYPKRCIEQ
IERWKGQAEA YRSADKIEVT PSKEYASSII NSVWTGEPSV IYGNVRNNGC ITSLPANCAA
EVPCLVDASG IQPTFIGDLP PQLTALIRTN INVQELTVQA LMTENREHIY HAAMMDPHTA
AELDLDQIWS LVDDLLATHG NWLPEWARTS RKVQAA