Gene Rleg_4598 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRleg_4598 
Symbol 
ID8015345 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhizobium leguminosarum bv. trifolii WSM1325 
KingdomBacteria 
Replicon accessionNC_012850 
Strand
Start bp4722869 
End bp4724389 
Gene Length1521 bp 
Protein Length506 aa 
Translation table11 
GC content60% 
IMG OID644827175 
ProductABC transporter related 
Protein accessionYP_002978375 
Protein GI241207279 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1129] ABC-type sugar transport system, ATPase component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value0.0392025 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCAGACG CAGCAGATGA CGGCATTTTG AGGCTGGAAG GGATCGGCAA GCGCTTCCCG 
GGCGTCGTGG CGCTCAAGGA TGTCTCCATG CGGATCGGCC GCGGCAAGGG GCACGTTCTT
CTCGGCGAAA ACGGTGCTGG AAAGTCGACC CTGATCAACC TGCTCGGCGG TGTCTTCAGG
CCGGATGACG GTCACATTCT GTTCGACGGC AAGCAATACC ATCCGGCCTC GCCGTTGGAG
GCCTTCAAGG CGGGCATCCG GGTCATTCAC CAGGAACTGC ACTCCCTTTC CAACCTGACG
GTGGCGGAAA ACCTGCTGTT CGAGCATCTA CCGCGCCGCT ACGGCCTGGT GAACTACAAG
GAAATGAACG GCAGAGCGGC CGAACTCCTG GAAGAGGTCG GTCTGGATGT CGCCCCGACG
ACACTGGCGA GCCGCCTCAG CGTCGCCCAG CTTCAACTGG TCGAGATCGC CAAGGCGCTT
TGTTATGAAA GCAAGCTTCT CGTTCTCGAC GAACCGACGG CAACCCTGAC CTCCAAGGAG
GTCGATCGCC TCTTCGAGAT TCTCAAGCGG CTGAAGCGGC GTGGTGTGAC CACGCTCTAC
ATCTCGCACA GGCTGGAAGA GATCTTTGAC GTCGGCGACG ATGTGACGGT GCTGCGCGAC
GGCCAGCACG TGATCACGCG TCCGCTGGCC GGACTTGCTA TTCCTGACAT CGTCGAACTC
ATGGTCGGAC GGAAGCTTGC GGATCATGGC ATTTTTCGCA GCGACAGCAC CGTGTCCGGC
GAGGCGCTCG GCGTGTCCGG CCTGAAGGTG ACGCGCAATA GCCCGGAACT GTCATTTTCC
GTGGCAAAGG GGGAAATCGT CGGCATTGCC GGCCTTGTCG GCAGCGGGCG GACCGAGGCG
GTTCGCGCCA TATTCGGCGC CGATGCCAAA GCCGCCGGCG AAATTCGGGT GAATGGCGAT
CCGGTTGAGA TCCATTCGCC GAAGGATGCC GTTGCTGCCG GCCTCTGCCT GGCAACCGAA
GACCGCAAGA CGCAGGGCTT GATGCTCGAT ATGAACTGCG CTGAAAATGC CACCATAACC
GATCTCGCCA AGATCTCCCG CAACGGTTTG ATCATGCGAA GGGCGCAGGA CGATCATTCG
CAGCGCCTCG TGCGCGAACT GCGCATCAAG ACCCCGTCCA TCCATCAGGC GGTCAGGACC
TTTTCCGGCG GCAACCAGCA AAAGGTCGTC ATTGCAAAAT GGCTGTTCCG CGGCCCTAAA
GTTCTGATTT TCGATGAGCC GACCCGTGGA ATCGACGTCG GTGCGAAGGC CGAGATTTAC
GAGCTTCTGT GGAAGTTTGC GGCCGAAGGA AAAGGCGTTC TGGTCGTATC GTCGGATCTG
CCGGAGCTCA TCGGCATTTG CCATCGCATC ATCGTCTTCT CCGACGGCAA GATATCGGGC
GAAATAGTTC GGGAACAGTT TGACGAGAGC AGGATCCTTT CGCTCGCCTA CAAGGAGTAC
AGTCGTGTCC GCCAACATTG A
 
Protein sequence
MSDAADDGIL RLEGIGKRFP GVVALKDVSM RIGRGKGHVL LGENGAGKST LINLLGGVFR 
PDDGHILFDG KQYHPASPLE AFKAGIRVIH QELHSLSNLT VAENLLFEHL PRRYGLVNYK
EMNGRAAELL EEVGLDVAPT TLASRLSVAQ LQLVEIAKAL CYESKLLVLD EPTATLTSKE
VDRLFEILKR LKRRGVTTLY ISHRLEEIFD VGDDVTVLRD GQHVITRPLA GLAIPDIVEL
MVGRKLADHG IFRSDSTVSG EALGVSGLKV TRNSPELSFS VAKGEIVGIA GLVGSGRTEA
VRAIFGADAK AAGEIRVNGD PVEIHSPKDA VAAGLCLATE DRKTQGLMLD MNCAENATIT
DLAKISRNGL IMRRAQDDHS QRLVRELRIK TPSIHQAVRT FSGGNQQKVV IAKWLFRGPK
VLIFDEPTRG IDVGAKAEIY ELLWKFAAEG KGVLVVSSDL PELIGICHRI IVFSDGKISG
EIVREQFDES RILSLAYKEY SRVRQH