Gene Rleg2_3374 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRleg2_3374 
Symbol 
ID6982128 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhizobium leguminosarum bv. trifolii WSM2304 
KingdomBacteria 
Replicon accessionNC_011369 
Strand
Start bp3486634 
End bp3487740 
Gene Length1107 bp 
Protein Length368 aa 
Translation table11 
GC content60% 
IMG OID643398092 
ProductTRAP dicarboxylate transporter- DctP subunit 
Protein accessionYP_002282867 
Protein GI209550950 
COG category[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG4663] TRAP-type mannitol/chloroaromatic compound transport system, periplasmic component 
TIGRFAM ID[TIGR01409] Tat (twin-arginine translocation) pathway signal sequence 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value0.626283 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones19 
Fosmid unclonability p-value0.630322 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGATCGTC GTTCGTTTTT CAAGAAGGCG GGAACCGCCA GCGCCGGCGC GGTCGCCGCA 
ACGGCATTGG CAGCGCCTGC GATCGCGCAG GAGAATCCGA AGATCGCGTG GCGCATGACG
TCGTCGTTTC CGAAGAGCTT GGATACGATC TATGGCGGCG CCGAGGACAT CGCCAAGCAT
GTCGCCGCTG CGACAGACGG CAATTTCACG ATCCAGCCTT TTGCGGCCGG TGAAATCGTG
CCTGGCCTGC AGGCCGTTGA TGCGGTGGCC GCCGGCACGG TCGAGGCCGC CCACACGACC
TCCTATTATT TCGTCGGCAA GGACCCGACC TACGCCATCG GAACGGCCAT TCCGTTTGGC
CTCAACAGCC GCCTGACCAA TGCCTGGTTT TACGAGGGCA ACGGCAACAA GCTGATGAAT
GAATTTTATG CCACACAGGG CATGTATGCG CTGCCTGCCG GCAATACCGG TGCGCAGATG
GGCGGCTGGT TCCGCAAGGA AATCAATACG CTCGATGACC TCAAGGGCGT CAAGATGCGC
ATTGCCGGTC TTGCCGGCCG CGTCATGGAG AAGGTCGGCG TCATCCCGCA GCAGATTGCC
GGCGGCGACA TCTATCCGGC GCTGGAAAAA GGCACGATCG ATGCGGCGGA ATTCGTCGGC
CCCTATGACG ACCTGAAGCT CGGCTTCCAC AAGGTGGCGA AGTACTACTA CTATCCGGGC
TGGTGGGAAG GCGGCCCGAC CGTGCATGGT TTCTTCAACC TTGAAAAATG GAGCAGCCTG
CCGAAGCATT ATCAGGCCGC ACTCACCGAT GCCTGCGCCT TCGCCAACAC CAACATGCTG
GCAAAGTACG ACTCCAAGAA TCCGACGGCG CTGAAGCAGC TCGTTGCAGA AGGTGCGACG
CTGCGCCCCT TCAGCCAGGA AATCATGGAA GCCTGCTTCC AGGCGGCGAC CGGCATCTAC
AGCGAAATCT CCGGCACCAA CCAGTATTTC AAGAAGATCT ACGACGACCA GACCGCCTTC
AAGCGGGACG CCTATCTCTG GATGCAGCTT TCGGAATACA CTTTCGATAC GTTCATGATG
ATCCAGCAGC GGGCCGGAAA GCTCTGA
 
Protein sequence
MDRRSFFKKA GTASAGAVAA TALAAPAIAQ ENPKIAWRMT SSFPKSLDTI YGGAEDIAKH 
VAAATDGNFT IQPFAAGEIV PGLQAVDAVA AGTVEAAHTT SYYFVGKDPT YAIGTAIPFG
LNSRLTNAWF YEGNGNKLMN EFYATQGMYA LPAGNTGAQM GGWFRKEINT LDDLKGVKMR
IAGLAGRVME KVGVIPQQIA GGDIYPALEK GTIDAAEFVG PYDDLKLGFH KVAKYYYYPG
WWEGGPTVHG FFNLEKWSSL PKHYQAALTD ACAFANTNML AKYDSKNPTA LKQLVAEGAT
LRPFSQEIME ACFQAATGIY SEISGTNQYF KKIYDDQTAF KRDAYLWMQL SEYTFDTFMM
IQQRAGKL