Gene Rleg_4820 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRleg_4820 
Symbol 
ID8007208 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhizobium leguminosarum bv. trifolii WSM1325 
KingdomBacteria 
Replicon accessionNC_012848 
Strand
Start bp195388 
End bp196467 
Gene Length1080 bp 
Protein Length359 aa 
Translation table11 
GC content60% 
IMG OID644821750 
ProductABC transporter related 
Protein accessionYP_002973010 
Protein GI241113175 
COG category[E] Amino acid transport and metabolism 
COG ID[COG4175] ABC-type proline/glycine betaine transport system, ATPase component 
TIGRFAM ID[TIGR01186] glycine betaine/L-proline transport ATP binding subunit 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones25 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCTGATC ATCGTTTCGG CGGCATCAAG ATCCGCCATC TCTACAAGAT CTTCGGTCCC 
AATCCCGGCG CTCATGTCGA TGCTGTCAAG AAGGGATTGT CGAAGACCGA TCTCAACGAA
AAGCACGGCC ACGTGCTCGG CCTGAAAGAT ATCAATGTCG AGATACCCTC GGGCCGCATC
CAGGTCATCA TGGGCCTTTC GGGTTCGGGC AAATCGACGC TCATCCGCCA CATCAACCGT
CTGATCGATC CGACATCAGG CGAAGTGCTT GTCGACGGCG TCGATGTGGT GAAAATGAAC
GAGACCGAAT TGCGGACGTT TCGGCGCCAC CAGACGGCAA TGGTGTTTCA GAAGTTCGCG
CTTCTGCCGC ACCGGAATGT TCTCGACAAC ACTATCTTCG GCCGCGAAGT GCAGGGCATG
GAGCGCTCGA AAGCCGTTGA CGTCGCCATG GGCTGGCTGG AGCGGGTCGG CCTGAAGGGC
TTCGAGAGCA AATATCCCAA CCAGCTGTCG GGCGGCATGC AGCAGCGCGT CGGGCTGGCA
CGCGCCCTCT CGAACGATGC GCCGGTTCTG CTGATGGACG AGGCCTATTC GGCGCTCGAC
CCTTTGATCC GCACCGATAT GCAGTCGGTC CTTCTCGACA TTCAGAAGGA GATCAAGAAG
ACCATCGTCT TCATCACCCA TGATCTCGAT GAGGCCCTTC GGCTGGGCGA CCAGATTGCG
ATCCTGCGCG ATGGCGAAGT CATCCAGCAG GGCACCAGCC AGGACATCGT TCTGCGTCCG
GCAGACGCCT ACATCGCCAA CTTCGTCAAG GAGGTCAATC GAGGCCGGGT CATCCAGGTC
GATGCCATCA TGACATCCCT GCATTCCGGC GCGGTCCCAG GTGGATTGAC GATCGCATCA
GGCACGACCG TCGAAGACGC CGTCCGGATT CTGGCATCGG AGGCGCACGA CGATGCCAGG
GTGGTTTCAC CATCCGGGGA AGCATTGGGG CTTGTTACCT TCCGCCAGCT CGCGGGCGCG
ATGGTGAACT CGCACGAGGT GGCTCCCCGA CGCGACAGTG CCCTCTCGGT CGCACTTTGA
 
Protein sequence
MADHRFGGIK IRHLYKIFGP NPGAHVDAVK KGLSKTDLNE KHGHVLGLKD INVEIPSGRI 
QVIMGLSGSG KSTLIRHINR LIDPTSGEVL VDGVDVVKMN ETELRTFRRH QTAMVFQKFA
LLPHRNVLDN TIFGREVQGM ERSKAVDVAM GWLERVGLKG FESKYPNQLS GGMQQRVGLA
RALSNDAPVL LMDEAYSALD PLIRTDMQSV LLDIQKEIKK TIVFITHDLD EALRLGDQIA
ILRDGEVIQQ GTSQDIVLRP ADAYIANFVK EVNRGRVIQV DAIMTSLHSG AVPGGLTIAS
GTTVEDAVRI LASEAHDDAR VVSPSGEALG LVTFRQLAGA MVNSHEVAPR RDSALSVAL