Gene Rleg_4345 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRleg_4345 
Symbol 
ID8015919 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhizobium leguminosarum bv. trifolii WSM1325 
KingdomBacteria 
Replicon accessionNC_012850 
Strand
Start bp4468871 
End bp4469830 
Gene Length960 bp 
Protein Length319 aa 
Translation table11 
GC content59% 
IMG OID644826921 
Productbinding-protein-dependent transport systems inner membrane component 
Protein accessionYP_002978124 
Protein GI241207028 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1175] ABC-type sugar transport systems, permease components 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones25 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.0000915897 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGAGCAATG CGATGCGCAC GCCCGCAGGG GCCATCAATG TGGAAAGATA TCAGGGGGCC 
GTGGCCGAAG GACGCTTCAG GCGTCTCTGG AATGCCAATG CTCCCGGCTA TCTCTTCCTT
CTTCCATGGC TCATCGGCTT TTTCGGGCTG ACGCTCGGGC CGGCCCTGAT TTCGCTCTAC
CTTTCCTTCA CCGACTTCGA CATGCTTCAG TCGCCGCGGT GGGTGGGAAT GGCGAATTAC
GTGCGCATCG CCACGGCGGA CCCGAAATTC TCGGCCGCCA TGCATGTCAC CCTGACCTAT
GTCGTCTTCT CGGTGCCGTT CAAGCTGACC TTCGCATTGC TGGTCGCCAT GGCGCTGAAC
CGCGGCTTGC GTGGGCTGTC GGTCTATCGT GCCATCTTCT ATCTGCCGTC ACTGCTGGGC
GGTAGCGTGG CGATCGCCGT GCTCTGGCGT CAGCTTTTCG CCAGCGATGG CCTCGTCAAT
GCCGCGCTTT CATATTTCGG TATCGAAGGT CCAAGCTGGA TCTCGCATCC GAACTATTCG
ATCTACACGC TGGTGGCTCT TTCCGTCTGG CAGTTCGGCT CGCCGATGAT TATTTTCCTG
GCCGGCCTGC GCCAGATCCC GCAGGATATG TATGAGGCCG CGAGCCTCGA TGGCGCCTCC
AAGTTCCGGC AATTCTACAA GATCACGCTG CCGCTTCTGA CGCCGGTGAT CTTTTTCAAT
GCCGTCGTTC AGACGATTGA TGCTTTCAAG GCCTTCACGC CGGCCTTCAT CATATCAGGC
GGCACCGGCG GTCCGATCAA CTCGACGCTG TTCTACACGC TCTACCTCTA TCAGGAAGCC
TTCGGCAATT TCCGCATGGG CTACGCCTCG GCGCTCGCCT GGATCCTGGT GGTGATCATC
GCGATCTTCA CCGCCTTCTC CTTCCTGACC TCGCGTTATT GGGTGCACTA CGATGACTGA
 
Protein sequence
MSNAMRTPAG AINVERYQGA VAEGRFRRLW NANAPGYLFL LPWLIGFFGL TLGPALISLY 
LSFTDFDMLQ SPRWVGMANY VRIATADPKF SAAMHVTLTY VVFSVPFKLT FALLVAMALN
RGLRGLSVYR AIFYLPSLLG GSVAIAVLWR QLFASDGLVN AALSYFGIEG PSWISHPNYS
IYTLVALSVW QFGSPMIIFL AGLRQIPQDM YEAASLDGAS KFRQFYKITL PLLTPVIFFN
AVVQTIDAFK AFTPAFIISG GTGGPINSTL FYTLYLYQEA FGNFRMGYAS ALAWILVVII
AIFTAFSFLT SRYWVHYDD