Gene Rleg2_4755 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRleg2_4755 
Symbol 
ID6977849 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhizobium leguminosarum bv. trifolii WSM2304 
KingdomBacteria 
Replicon accessionNC_011368 
Strand
Start bp386954 
End bp388270 
Gene Length1317 bp 
Protein Length438 aa 
Translation table11 
GC content55% 
IMG OID643393922 
Productglycoside hydrolase family 4 
Protein accessionYP_002278740 
Protein GI209546822 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1486] Alpha-galactosidases/6-phospho-beta-glucosidases, family 4 of glycosyl hydrolases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones25 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones27 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCGAAAA TTTGCCTTAT CGGGGCTGGG AGTACCGTCT TCGCCCAGAA CATTCTGGGA 
GATGTGCTGT CCACGCCCTC AGACCATGAC TACATCATCG GCCTTTTTGA CATCGATCCA
GAGCGCCTCA AAACATCCGA GATCGTCGCG CGGCGCATAT GCGCGTCGCT GAAGCTCGAC
ACGGTTCGGA TCGAGGCAAC GCTCAACCGC CGGGAGGCGC TGAAAGGTGC CGATTTCGTG
ATCCTGATGA TGCAGGTTGG CGGCTATAAA CCGGCAACGG TCACGGATTT CAACGTGGCG
AAGAACTACG GCTTGCGCCA GACCATTGCG GATACGCTTG GCATCGGCGG CATCTTCCGC
GGTCTCAGGA CGATCCCAGT CCTCGAAAGC ATCTGCGGCG ACATGGAAGA GGTGTGCCCG
AATGCATTGC TAATGCAGTA TGTAAACCCG ATGGCGATTA ATTGCTGGGC GATCAAGGAA
ATCGCCCCGA GCATTCGCAC CGTCGGTCTC TGCCACAGTG TTCAACATAC GGCAGATCAC
CTTGCCAGGT GCCTCGGCGA GAAAATCGAC AATATCAGTT ACATCTCAGC CGGCATCAAC
CATATCGCTT TCTTCCTTAA ATATGAAAAG CTCCATGGCG ATGGCAGCCG CGAAGACCTT
TATCCCAAGC TGAAGGCGCT GGCCGCGGAG GGCAAGGTTC CCGCAGATGA CCGTGTTCGC
TTTGATGCCC TGAAAAGGCT CGGTCATTTC GTGACCGAAT CCAGCGAGCA TTTTGCGGAA
TATACGTCAT GGTACATCAA GAACCACCAA CCGGAATTGG TAGACCAGCT CAACATTCCA
CTCGACGAAT ATATTCGCCG TTGCGAGCTG CAGATCTCAC AATGGCATGT CCTGCGGCAG
GACCTCGAAG GGGGAAGACC GATCGAAGTA TGCCGCAGCA ATGAATATGC TTCAGGCATT
ATTCATGCTG CGGTGACCGG GAAGCCGGCG CTGATTTATG GAAATGTGCC GAACAACGGC
CTGATTGAAA ATCTTCCGCC AGAATGCATT GTCGAAGTTC CATGCCATGT CGATCGCAAT
GGCGTCCAAC CGACGCGGAT CGGTAGGATC CCTTCTCAAT TGGCCGCCGT CATGCGGCTG
AGCATTTCCG TGCAGGAGCT CACTGTCGAA GCGGCACTGA CAGGCAAGCG TGACCGCATC
TATCAGGCCG CGCTGCTCGA TCCGCACACC TCGGCGGAAC TTTCGCCTGA TAAAATCTGG
CATATGGTCG ATGACCTCAT CGAGGCACAT GGCGATCTGC TGCCGAACTA CCACTGA
 
Protein sequence
MPKICLIGAG STVFAQNILG DVLSTPSDHD YIIGLFDIDP ERLKTSEIVA RRICASLKLD 
TVRIEATLNR REALKGADFV ILMMQVGGYK PATVTDFNVA KNYGLRQTIA DTLGIGGIFR
GLRTIPVLES ICGDMEEVCP NALLMQYVNP MAINCWAIKE IAPSIRTVGL CHSVQHTADH
LARCLGEKID NISYISAGIN HIAFFLKYEK LHGDGSREDL YPKLKALAAE GKVPADDRVR
FDALKRLGHF VTESSEHFAE YTSWYIKNHQ PELVDQLNIP LDEYIRRCEL QISQWHVLRQ
DLEGGRPIEV CRSNEYASGI IHAAVTGKPA LIYGNVPNNG LIENLPPECI VEVPCHVDRN
GVQPTRIGRI PSQLAAVMRL SISVQELTVE AALTGKRDRI YQAALLDPHT SAELSPDKIW
HMVDDLIEAH GDLLPNYH