Gene Rleg_6384 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRleg_6384 
Symbol 
ID8016998 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhizobium leguminosarum bv. trifolii WSM1325 
KingdomBacteria 
Replicon accessionNC_012854 
Strand
Start bp95171 
End bp96310 
Gene Length1140 bp 
Protein Length379 aa 
Translation table11 
GC content61% 
IMG OID644828179 
Productsugar ABC transporter, substrate-binding protein 
Protein accessionYP_002979379 
Protein GI241554166 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1879] ABC-type sugar transport system, periplasmic component 
TIGRFAM ID[TIGR01409] Tat (twin-arginine translocation) pathway signal sequence 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value0.108159 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones30 
Fosmid unclonability p-value0.0518769 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTACATTA TATTGGACAC CAATGGCAAG CCGTCAACGC TGCCGCACAC AAAAAAGGGG 
ATCGACATGA AGAATCTTGA AAACGGCATT TCCGCTTCGC TACGCCGCCA GCTTCTCGCC
GGTGCTGCCG CCGCGGCCGC ACTCCTCGTC TTTTCGGCCG GTACGGCCTC GGCCGCCGCC
AATTGCATCA AGGGTGACAG GAAAGCGCCC TATACGATCG GCTGGGCAAA CATCTATTCG
GTGCCGACCT GGATGAAGCA GACCGAAGGC ACCATCACGG CCGAAGTGGA GGAGCTGAAG
AAGGCGGGCC TGGTGAAGGA CCTGATGATC ACGGACGCGC AGGGTAACGC CCAGACCCAG
ATCCAGCATA TCCAGTCGAT GATCGACGCC AATGTCGACG CCATCGTCGT GATCGCCGGT
TCCTCCAACG CGCTCGACCG CGTCATATCA GATGCCTGCG ACAAGGGCAT TGCCGTCGTG
AATTTCGACA GTCTGGTCAA TACCGACAAG GTGACGGCGA AGATCAACAC CGATTCCAAC
GAATGGGGCG CGACCGCTGC CAAGTGGATG GTCGGCCAGC TCGGCGGCAA GGGCAAGATC
ATCATCATGA ACGGCCCGGC CGGCATTTCG GTGAGCGACG ACCGCCGCAA GGGCGCCCAG
CCGGTCCTTG ACGCCAATCC CGGTCTCCAG GTGATCACCG AGACGAACAC GGAATATAAC
GTCGCCCCGG CACAGGAAGC GATGACCAGT CTGCTCTTTG CCAATCCCGA AATCGACGGC
GTGCTGTCGC TCGGCGGCGC GCTATCGGCC GGCTCGGTGC TGGCCTTCGA GCGTCAGGGC
CGCGACCAAG TGCCGACAAC AGGCGAAAAC GCAAGGCAGT TCCTGGAGCT CTGGAAGGAG
AAGGGACTGA AGGGCTGGGC CACCATGCAG CCCAACTGGC TCGGCGCGCT GTCTGTTTAC
ACCGCCGTGC AGGCGCTGGA AGGCAAGGAC GTTCCGGCCT TCGTCAAGGT GCCGCTGCCT
GTCATCGACG ACAGCACGAT CGGCAGCTAC CTCGCCCGGG CCGACAAGTT CCCGGCGGAC
GGCTATATCT ACTCGGACTA CGACAAGGCG CTCTTCGACA AGCTGCTTGC CGCCAAGTAA
 
Protein sequence
MYIILDTNGK PSTLPHTKKG IDMKNLENGI SASLRRQLLA GAAAAAALLV FSAGTASAAA 
NCIKGDRKAP YTIGWANIYS VPTWMKQTEG TITAEVEELK KAGLVKDLMI TDAQGNAQTQ
IQHIQSMIDA NVDAIVVIAG SSNALDRVIS DACDKGIAVV NFDSLVNTDK VTAKINTDSN
EWGATAAKWM VGQLGGKGKI IIMNGPAGIS VSDDRRKGAQ PVLDANPGLQ VITETNTEYN
VAPAQEAMTS LLFANPEIDG VLSLGGALSA GSVLAFERQG RDQVPTTGEN ARQFLELWKE
KGLKGWATMQ PNWLGALSVY TAVQALEGKD VPAFVKVPLP VIDDSTIGSY LARADKFPAD
GYIYSDYDKA LFDKLLAAK