Gene Rleg_5221 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRleg_5221 
Symbol 
ID8007116 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhizobium leguminosarum bv. trifolii WSM1325 
KingdomBacteria 
Replicon accessionNC_012848 
Strand
Start bp633004 
End bp634149 
Gene Length1146 bp 
Protein Length381 aa 
Translation table11 
GC content61% 
IMG OID644822130 
ProductABC transporter related 
Protein accessionYP_002973390 
Protein GI241113555 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG3839] ABC-type sugar transport systems, ATPase components 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value0.533287 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones21 
Fosmid unclonability p-value0.501993 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGGCTGTGG TTGTACTCGA CAAAATCTGC AAGACCTATG GAAACAGCTA CCATGCGATC 
AAGGATCTGA GCCTGACGAT CCATGATGGC GAGTTTCTGA TTCTGGTCGG GCCGTCCGGA
TGCGGAAAAT CGACCGCTCT GCGCATGATT GCCGGGCTCG AGGAAATCAG CAGCGGAACA
TTGAGCATCG GCGGCCAGGA CGTCGTGGAT CTCGCGCCCA AGGACCGGGA CATTGCCATG
GTCTTTCAGA GCTATGCGCT TTATCCGCAC ATGACCGTCT TCGATAACAT TGCCTTTTCG
ATGAAGCTGG CCGGAAAGAA CAAGGCCGAA CGCACCAAAC GTGTCCACGA AATCGCCAAG
ATCCTGCAGC TGGAGCCCTT GCTGGGCAAC AAGCCCGCGC AGCTTTCCGG CGGCCAGCGC
CAGCGTGTTG CGATGGGCCG CGCCATGGTC CGCGAGCCCG CGGCATTCCT CATGGACGAA
CCGCTCTCGA ACCTCGATGC GAAGCTGCGT GTTCAGATGC GGGCAGAGAT CGCCAGCCTG
CAGAGACAGC TGGGCGTGAC GACGATCTAT GTGACGCACG ACCAGACTGA AGCGCTGACC
ATGGGCGATC GGGTCGCGGT GCTGAAGGGC GGCGTGCTGC AGCAGGTGGA TACGCCCAAG
GCTCTGTATC ACCGCCCGGT CAATGCGTTT GTCGCCGGCT TTATCGGTTC GCCGTCGATG
AACCTTTTCG AAGGGCGTCT GGCGGGCGGA CGGATCCATC TGCCGGGCTT CTCCATCCCC
TTGTCCGGCG GCGCCTTCGA GCGCTCTCCC GGTCTATCCG CTTTCGAGGG AAAGGATGTG
ATCTTTGGGG TCAGGCCCGA GGACCTCTAC GACAGCCGGT TGCCATCTGG CGCCTCCCAT
CCGACGATCC CGGTTGTCGT GAAATCGATC GAGGAGCTTG GCTCCGAGCT GATCGTGCAT
TTGAAGATCG ACGCGGTCCG CATCGACTCG GGCGACCCCG ATGCCGTCGA GGACCTGAGC
GGGGCCGCCA ATGCCGTCGC GCGGTTCGAA GCGGTCAGCG CGGTCGAGAC AGGCCAATCG
ATCGACCTGG CCATCGACCC GGCGAAACTG CACTTTTTCC ACCCTCAAAC GCATATGGCG
CTGTGA
 
Protein sequence
MAVVVLDKIC KTYGNSYHAI KDLSLTIHDG EFLILVGPSG CGKSTALRMI AGLEEISSGT 
LSIGGQDVVD LAPKDRDIAM VFQSYALYPH MTVFDNIAFS MKLAGKNKAE RTKRVHEIAK
ILQLEPLLGN KPAQLSGGQR QRVAMGRAMV REPAAFLMDE PLSNLDAKLR VQMRAEIASL
QRQLGVTTIY VTHDQTEALT MGDRVAVLKG GVLQQVDTPK ALYHRPVNAF VAGFIGSPSM
NLFEGRLAGG RIHLPGFSIP LSGGAFERSP GLSAFEGKDV IFGVRPEDLY DSRLPSGASH
PTIPVVVKSI EELGSELIVH LKIDAVRIDS GDPDAVEDLS GAANAVARFE AVSAVETGQS
IDLAIDPAKL HFFHPQTHMA L