Gene Rleg_3676 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRleg_3676 
Symbol 
ID8014520 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhizobium leguminosarum bv. trifolii WSM1325 
KingdomBacteria 
Replicon accessionNC_012850 
Strand
Start bp3725119 
End bp3726225 
Gene Length1107 bp 
Protein Length368 aa 
Translation table11 
GC content60% 
IMG OID644826239 
ProductTRAP dicarboxylate transporter- DctP subunit 
Protein accessionYP_002977458 
Protein GI241206362 
COG category[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG4663] TRAP-type mannitol/chloroaromatic compound transport system, periplasmic component 
TIGRFAM ID[TIGR01409] Tat (twin-arginine translocation) pathway signal sequence 


Plasmid Coverage information

Num covering plasmid clones27 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones22 
Fosmid unclonability p-value0.318097 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGATCGTC GTTCGTTTTT CAAGAAGGCG GGAACCGCCG GCGCCGGTGC GGTTGCCGCA 
ACGGCATTGG CTGCGCCTGC GATCGCGCAG GAGAATCCGA AGATCGCGTG GCGCATGACG
TCGTCGTTTC CGAAGAGCTT GGATACGATC TATGGCGGCG CCGAGGATAT CGCCAAGCAT
GTTGCCGCGG CGACGGATGG CAATTTCACG ATCCAGCCCT TTGCGGCCGG TGAAATCGTG
CCGGGCCTGC AGGCCGTCGA TGCCGTTGCC GCCGGCACGG TCGAGGCAGC CCATACCACC
TCCTATTATT TCGTCGGCAA GGACCCGACC TATGCCATCG GGACGGCCAT TCCGTTCGGG
CTGAACAGCC GTCTGACCAA TGCCTGGTAC TATGAAGGCA ACGGCAACAA GCTGATGAAC
GAGTTTTATG CCACACAGGG AATGTATGCG CTGCCGGCCG GCAATACCGG TGCGCAGATG
GGCGGCTGGT TCCGCAAGGA AATCAACACG CTCGATGACC TTAAGGGCGT CAAGATGCGC
ATTGCCGGCC TTGCCGGTCG CGTCATGGAG AAGGTCGGCG TCATCCCGCA GCAGATCGCC
GGCGGCGATA TCTATCCGGC GTTGGAAAAA GGCACGATCG ATGCGGCGGA ATTCGTCGGC
CCCTATGACG ACCTGAAGCT CGGCTTCCAC AAGGTGGCGA AGTACTACTA TTATCCGGGC
TGGTGGGAAG GTGGCCCGAC GGTGCATGGG TTCTTCAATC TCGAAAAATG GAGCAGCCTG
CCCAAGCATT ATCAGGCAGC GCTGACCGAC GCCTGCGCCT TCGCCAATAC CAACATGCTG
GCGAAGTACG ACACGAAGAA CCCGACGGCG CTGAAGCAGC TCGTGGCGGA AGGCGCGACG
CTGCGCCCGT TCAGCCAGGA GATCATGGAA GCCTGCTTCC AGGCGGCGAC GGGCATCTAC
AGCGAAATTT CGGGCACGAA CCAGTATTTC AAGAAGATCT ATGACGACCA GACCGCCTTC
AAGCGGGATG CCTATCTGTG GATGCAGCTG TCGGAATACA CCTTCGATAC GTTCATGATG
ATTCAGCAGC GGGCCGGAAA GCTTTAA
 
Protein sequence
MDRRSFFKKA GTAGAGAVAA TALAAPAIAQ ENPKIAWRMT SSFPKSLDTI YGGAEDIAKH 
VAAATDGNFT IQPFAAGEIV PGLQAVDAVA AGTVEAAHTT SYYFVGKDPT YAIGTAIPFG
LNSRLTNAWY YEGNGNKLMN EFYATQGMYA LPAGNTGAQM GGWFRKEINT LDDLKGVKMR
IAGLAGRVME KVGVIPQQIA GGDIYPALEK GTIDAAEFVG PYDDLKLGFH KVAKYYYYPG
WWEGGPTVHG FFNLEKWSSL PKHYQAALTD ACAFANTNML AKYDTKNPTA LKQLVAEGAT
LRPFSQEIME ACFQAATGIY SEISGTNQYF KKIYDDQTAF KRDAYLWMQL SEYTFDTFMM
IQQRAGKL