Gene Rleg2_4453 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRleg2_4453 
Symbol 
ID6977547 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhizobium leguminosarum bv. trifolii WSM2304 
KingdomBacteria 
Replicon accessionNC_011368 
Strand
Start bp84849 
End bp85886 
Gene Length1038 bp 
Protein Length345 aa 
Translation table11 
GC content66% 
IMG OID643393631 
Productinner-membrane translocator 
Protein accessionYP_002278449 
Protein GI209546531 
COG category[E] Amino acid transport and metabolism 
COG ID[COG4177] ABC-type branched-chain amino acid transport system, permease component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value0.57987 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value0.00971028 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGAGCGTTC AGCCGCAAAC ATTCGCCTCC AACGCCGAAG GCCTGTCGAT AATGCGCCTG 
CCGAAGCAGC TTCTGCATGT CCCCGTGACG ATGGCGCTCT TCGCGGCCGT GGTCGCCTTG
CTCGCCAATT CCTTCTGGCT TTCGGCGGCA ACCTCGGCCG TTGCGCTCTC GCTGTCCGTG
GCCGGCCTTG CCATTCTCTA TGGCCAGCTC GGCCTGGTTT CCCTCTGCCA GTTCGCCCTC
GTCGGCGTCG GCGGCTGGGT AACGCTGCGC ATCGGCCACG CCTTTCATCC ACCCTTCGAG
GTCAGCCTGC TTGCCGGCGG CATCGTCGCC TCGGCGGTCG GGCTCGCCTT CGGCGTGCCG
GCCTTGCGGC TGCGTGGTCT CTACCTCGCG CTCGTCACCC TGATGCTCGC CGGCGCCTTC
CAGATCATCA TCAGCGCTTG GGGTTTCCCT GATGGCGGTC CTGGTTTTCT CGGTCGGGCG
GACGGTTCGG GGCGCGAGAT GCTGGCGAGG CCTGCCATGG CCGACGGCGA GGTTGCCTAT
TTCCTCTATG TCTGCGCGGC AGCTGCGGTC GGTCTTTTGA TCGCGCAATG GCATAAGCTT
GCGCGTCCCG GCAGGGCCTG GGCGCTGATC CGCAAGGGCG AGACTGTCGC CGTCGCCAGC
GGCGTCAATG TTCTCATCTA CAAGGCCTGG GCCTTTGCGC TCAGCGGCTT GCTCGCCGGC
CTTGCCGGCG GGCTGCTGGC CGGCAATGTC GGCCAGCTCG ACGGGCGCGC CTTCGGCGCC
TTCGAGAGCC TCAATCTCTT TGCGCTCGCC ATCGTCGGCG GTGTCTTCAA CTGGTATGGC
GCGCTGATCG CCGGGCTTCT GCTACGCGCG GTACCGGCGC TGCTCACCGA TCTCGGCATC
GACGGTTATG TCACGATCGG CATTTTCGGC GTCGCCTTGT TCCATGCGCT GGCAACGGCG
CCGACCGGCA TAGCCGGCCA GATCGCGGCT CTGCTCGCCC GTCTAAACAC CGGCCTATCA
AGGGGGAGGG CACGATGA
 
Protein sequence
MSVQPQTFAS NAEGLSIMRL PKQLLHVPVT MALFAAVVAL LANSFWLSAA TSAVALSLSV 
AGLAILYGQL GLVSLCQFAL VGVGGWVTLR IGHAFHPPFE VSLLAGGIVA SAVGLAFGVP
ALRLRGLYLA LVTLMLAGAF QIIISAWGFP DGGPGFLGRA DGSGREMLAR PAMADGEVAY
FLYVCAAAAV GLLIAQWHKL ARPGRAWALI RKGETVAVAS GVNVLIYKAW AFALSGLLAG
LAGGLLAGNV GQLDGRAFGA FESLNLFALA IVGGVFNWYG ALIAGLLLRA VPALLTDLGI
DGYVTIGIFG VALFHALATA PTGIAGQIAA LLARLNTGLS RGRAR