Gene Rleg_4944 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRleg_4944 
Symbol 
ID8007537 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhizobium leguminosarum bv. trifolii WSM1325 
KingdomBacteria 
Replicon accessionNC_012848 
Strand
Start bp322849 
End bp324015 
Gene Length1167 bp 
Protein Length388 aa 
Translation table11 
GC content57% 
IMG OID644821861 
ProductABC transporter related 
Protein accessionYP_002973121 
Protein GI241113286 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG3839] ABC-type sugar transport systems, ATPase components 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value0.719486 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones40 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCGTCCA TGACCTTCGA TGGGATCGGC AAGACCTTTC CGGACGGAAC CGTTGCCGTT 
GCGAATGTAA GTTTTTCGGT CGCCAACGGA GAATTCGTCG TGTTGGTCGG CCCGTCCGGT
TGTGGCAAGT CGACATTATT GCGGATCGCA GCGGGTCTTG AAACGCTCAA CAGCGGCCGG
TTGCTCATGG ATGACGCTAA TGTCACCGAG ACTGAGCCCC AGGACCGGGA TATCGCGATG
GTTTTTCAGA ACTACGCGCT TTACCCCCAT ATGACTGTCT ACGACAATAT GGCCTTCGGT
CTGCAGCAGC GCAAAATGCC CAAGGACAAG ATCGATAAGC TGGTGCGTGA CGCGGCGGAA
ATGCTCGACC TTACCCGCTA TCTCGAACGC AAACCAGGGG CGTTGTCGGG TGGCCAGCGC
CAACGTGTGG CAATGGGTCG GGCGATCGTT CGCCATCCCA TGGCCTTCCT GATGGACGAG
CCGCTTTCAA ACCTTGATGC CAAGCTCCGC GTACAGATGC GCGGCGAACT GAAGTTGCTC
AACCAGCGGC TCGGTGTAAC GACGCTCTAC GTAACCCACG ACCAGGTCGA AGCCATGACC
ATGGGCGATC GTGTCGCTGT GCTGAAGCCA GTATTCAATG GCGAGGAGAG CAATCTTCAG
CAGATCGACA CCCCGCAAAT GCTCTACGAC AAGCCCGCCA ACCTCTTTGT CGCGGGCTTC
ATCGGATCGC CGGCGATGAA TTTTGTGCGC GTCGAGTTGA CTGCGGAAGC CGGGTCACTC
AAAGCTGCGG TAACTGGAAC GCAGATATCC TTCTCCGTCG CCGCCAAGCC GGCACTTTCG
GAATATATAG GCCGGCAGGT CATCGTTGGA ATTCGCCCGG AGATGTTTCT GGTTTGCCCC
GCGTCTGAAG CCCTCTTCAA CGAGCAGGTC CCGGTTGCCG AAGCGCTGGG AGCCGACACC
TTCGTCTTTT TCGACATCGC GTCACCGCCG GTCAACGTAA ACGATGCCGA AGATACCGAA
GACTTTCCAA ACAAAGGTAA GAACCGACTT GTCGCGCGGA TCCCACCGGC GCTCACACCG
CGTCCCAACC AACATTTGCC GCTCACTGTC GATCTGGAGA AATTGCACTG GTTCGATCCG
GTAACCGGAA CTGCGATCCG AGACTGA
 
Protein sequence
MASMTFDGIG KTFPDGTVAV ANVSFSVANG EFVVLVGPSG CGKSTLLRIA AGLETLNSGR 
LLMDDANVTE TEPQDRDIAM VFQNYALYPH MTVYDNMAFG LQQRKMPKDK IDKLVRDAAE
MLDLTRYLER KPGALSGGQR QRVAMGRAIV RHPMAFLMDE PLSNLDAKLR VQMRGELKLL
NQRLGVTTLY VTHDQVEAMT MGDRVAVLKP VFNGEESNLQ QIDTPQMLYD KPANLFVAGF
IGSPAMNFVR VELTAEAGSL KAAVTGTQIS FSVAAKPALS EYIGRQVIVG IRPEMFLVCP
ASEALFNEQV PVAEALGADT FVFFDIASPP VNVNDAEDTE DFPNKGKNRL VARIPPALTP
RPNQHLPLTV DLEKLHWFDP VTGTAIRD