Gene Rleg_4801 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRleg_4801 
Symbol 
ID8007485 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhizobium leguminosarum bv. trifolii WSM1325 
KingdomBacteria 
Replicon accessionNC_012848 
Strand
Start bp170237 
End bp171556 
Gene Length1320 bp 
Protein Length439 aa 
Translation table11 
GC content64% 
IMG OID644821731 
Productputative branched-chain amino acid ABC transporter, substrate-binding protein 
Protein accessionYP_002972991 
Protein GI241113156 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0683] ABC-type branched-chain amino acid transport systems, periplasmic component 
TIGRFAM ID[TIGR01409] Tat (twin-arginine translocation) pathway signal sequence 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value0.409444 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value0.455184 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAACGGCA TTTTCCGCAA CGGCAAATTC AACGCGGCAT CGCTGAGCCG CCGCTCTTTC 
ATCGCATCGA CCGTGGCGGG TGGTGCGGCG CTGGCGCTTT CCGGCCGCAC GGCCTTCGCT
CAGAGCGGCG ATACGTTGAA GGTCGGCTTC ATCAGCCCGC GCACCGGCCC TCTCGGCGGC
TTCGGTGAGA CGGACGGCTA CGTGCTGGAA CTGGCGCGCA AGGCGCTGGC GAACGGCCTG
CAGGCGGGCG GCAAGACTTG GAAGGTTGAG ATCCTCGACC AGGACACCCA ATCCGATCCC
TCGCGCGCCG GCCAGCTGGC GAAGGACCTG ATCAACAACC AGGCGATCGA TCTGATGCTT
GCCGTCTCGA CGCCCGAAAC CATCAATCCC GTGGCTGACG CATGCGAAGC AGCCGGCATT
CCCTGCCTCT CGACGGTCAT GCCCTGGGAA GCCTGGTATT TCGGCCGCGG CGCCAAGCCG
GGCGCGCCCT CGCCGTTCAA GTGGACCTAT CATTTCGGCT TCGGTGTCGA AGAGTTCCAC
AAGGCCTATG TTTCGCAGTG GAACCTGATC GAGACCAACA AGAAGGTCGG CGTCATGTAT
CCCAACGACG CCGACGGCAA TGCGATCCGC ACCCATCTGG CGCCGGCGCT CGCCAAGGCC
GGCTTCACCA TCGTCGATCC CGGAGCCTAT GAAACCGGAA CCACCGACTT TACCGCGCAG
ATCGCTCTCT TCAGGCAGGA GGGCGTGGAG ATCTTCAACT CGTTCCCGAT CCCGCCCGAC
TTCGCCGCCT TCTGGCGTCA GGCCGCGCAG CAGGGCCTCA CCCAGCAGAT CAAGATCTGC
CAGATCGCCA AGACCGGCCT GTTTCCCTCC GACATCGAGG CGCTCGGCGA CCTCGGCCTG
AACATCGGCA GCGCCGCCTA CTGGCACAAG GCCTTCCCCT ATAAATCCAC GCTGACCGGC
GTCTCCGGAA CCGAACTCGC CGACGGCTAT GAAACGGCAA GCGGCAAGCA GTGGACGCAG
CAGCTCGGCG CCAGCCTTGC GCTTCTCGAC GCCGGCTTCG ATGCGCTGAA GGCGAGCACC
GACGTCAAGA GCAAGGAGGC TGTGGCCAAG GCGATCAGCA CGCTGAAGAC CACGACCATC
GCCGGCAAGG TCGACTTCAC CAGCGGCCCC GTCGCCAACG TCTCTCCCGG ACCGATCATC
GGCACGCAAT GGGTGAAAGC GCCGGAGGGC TCGAAGTTCG CGCTCGACTA TGTCGTCACC
GAAAACGCCA CCGACCCCAA TGTCCCGGTC GGCGCCAAGC TCACCGCCTA TAACGGGTAA
 
Protein sequence
MNGIFRNGKF NAASLSRRSF IASTVAGGAA LALSGRTAFA QSGDTLKVGF ISPRTGPLGG 
FGETDGYVLE LARKALANGL QAGGKTWKVE ILDQDTQSDP SRAGQLAKDL INNQAIDLML
AVSTPETINP VADACEAAGI PCLSTVMPWE AWYFGRGAKP GAPSPFKWTY HFGFGVEEFH
KAYVSQWNLI ETNKKVGVMY PNDADGNAIR THLAPALAKA GFTIVDPGAY ETGTTDFTAQ
IALFRQEGVE IFNSFPIPPD FAAFWRQAAQ QGLTQQIKIC QIAKTGLFPS DIEALGDLGL
NIGSAAYWHK AFPYKSTLTG VSGTELADGY ETASGKQWTQ QLGASLALLD AGFDALKAST
DVKSKEAVAK AISTLKTTTI AGKVDFTSGP VANVSPGPII GTQWVKAPEG SKFALDYVVT
ENATDPNVPV GAKLTAYNG