Gene Rleg_5199 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRleg_5199 
Symbol 
ID8007094 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhizobium leguminosarum bv. trifolii WSM1325 
KingdomBacteria 
Replicon accessionNC_012848 
Strand
Start bp610493 
End bp611431 
Gene Length939 bp 
Protein Length312 aa 
Translation table11 
GC content58% 
IMG OID644822108 
Productbinding-protein-dependent transport systems inner membrane component 
Protein accessionYP_002973368 
Protein GI241113533 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1175] ABC-type sugar transport systems, permease components 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value0.0867066 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCCTCGA CAAGCATCGA AACGGCCGCA GTGGCCAAGA CCTCGAAAGG AAGGAGCAAA 
CCGGATCGGT TTGCTCCCAA CTACTGGCCT TTCGTTATCC CGGCGCTCAT CGTCATTGCG
GCCGTCATCG TTTTTCCATG GGTGTTTACC CTTTGGATGA GCGTCAACAG CTGGACGCTT
GGCCAATCCC AGGTCTTTGC GGGACTGGAC AACTATGCCC GCCTGGTCGT GGACATGCGC
TTCTGGGACT CGCTGTGGCA TACGGTGCTC TATACGACGC TCTCCGTGGT GGCGCCCCTT
TTCCTGGGGA CGCTCGCCGC GCTGATCTTT GATACGCAAT TTCCGCTGCG CGGCCTTCTG
CGTGGCATAT TCGTGATGCC GATGATGGCG ACGCCGGTCG CCATCGCCCT CGTCTGGACG
ATGATGTTCC ACCCGCAGCT CGGCGTCCTC AACTATCTTC TCTCCCTCAT GGGCATCGGC
CCGCAGGAAT GGATCTATAA CCAGAACAGT GTCATCCCAT CGCTGGTGCT GGTCGAGACG
TGGCAATGGA CGCCGCTTAT CATGCTGATC GTGCTCGGTG GTCTGGCGGC GGTTCCGCGC
GAGCCCTATG AAAGTGCCGA AATCGACGGA GCCAATGTCT GGCAGAAATT TCGCTACCTG
ACGCTGCCGA TGATCGCGCC ATTTCTGATG ATTGCCGTGA TGATCCGCAG CATCGATGCG
GTGAAAAGTT TCGACATCAT CTACGCCATG ACCCAAGGCG GCCCGGGCAC GGCGTCGGAA
ACGATCAACA TCTATCTCTA CAACACTGCT TTCGCTTACT ACGATATTGG CTATGGATCT
GCGATGGCGG TTGTCTTCTT CATCCTCATC GTCCTGCTCG CCTTCGTCCT GATGATGCTG
CGGCAGCGCG CGAACTGGTC CGACGGGGAG GCACGCTGA
 
Protein sequence
MASTSIETAA VAKTSKGRSK PDRFAPNYWP FVIPALIVIA AVIVFPWVFT LWMSVNSWTL 
GQSQVFAGLD NYARLVVDMR FWDSLWHTVL YTTLSVVAPL FLGTLAALIF DTQFPLRGLL
RGIFVMPMMA TPVAIALVWT MMFHPQLGVL NYLLSLMGIG PQEWIYNQNS VIPSLVLVET
WQWTPLIMLI VLGGLAAVPR EPYESAEIDG ANVWQKFRYL TLPMIAPFLM IAVMIRSIDA
VKSFDIIYAM TQGGPGTASE TINIYLYNTA FAYYDIGYGS AMAVVFFILI VLLAFVLMML
RQRANWSDGE AR