Gene Rleg2_1457 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRleg2_1457 
Symbol 
ID6980185 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhizobium leguminosarum bv. trifolii WSM2304 
KingdomBacteria 
Replicon accessionNC_011369 
Strand
Start bp1484020 
End bp1485018 
Gene Length999 bp 
Protein Length332 aa 
Translation table11 
GC content61% 
IMG OID643396178 
ProductTRAP dicarboxylate transporter- DctP subunit 
Protein accessionYP_002280977 
Protein GI209549060 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1638] TRAP-type C4-dicarboxylate transport system, periplasmic component 
TIGRFAM ID[TIGR00787] tripartite ATP-independent periplasmic transporter solute receptor, DctP family
[TIGR01409] Tat (twin-arginine translocation) pathway signal sequence 


Plasmid Coverage information

Num covering plasmid clones28 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value0.303505 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAACAATC TCAACCGACG TAATTTCCTG AGGACCGCCG CTCTAACCGG TACGGCGCTT 
GCGGCGCCAG GCTTTGTCCG CACAGCCGCC GCCCGCACGA CGACGATCAC GATCGCCTCT
CTGCTCGGCG ACGACAAGCC GGAGACGAAG ATCTGGGTGA AAATCGGCGA GCTGGTCGAA
GCCAAACTTC CCGGCCAGTT CAAGTTCAAT ATCGTCAGGA ACGGCGCGCT CGGCGGCGAG
AAGGAAGTGG CCGAAGGCGT GCGGCTCGGC TCGATCCAGG CGAGCCTTTC GACGGTGTCG
TCGCTGTCCG GCTGGGCGCC CGAACTGCAG ATCCTCGATC TGCCTTTCCT CTTTCGCGAT
GCCGACCATG TGCGCAGAAC TGTCGGCGGC GATGTCGGCG CCGATCTCAA GCAGAAACTG
CAGGCGCAGA ATTTCGTCGT CGGCGATTTT ATCAATTACG GCGCCCGCCA TCTCCTGACC
AAGGAGCCGG TGACGCGACC CGAGCAGCTC AAGGGCAAGC GCATCCGCGT CATCCAGAGC
CCGCTTCACA CCAAGCTTTG GAGCGCATTC GGCACGACGC CGATCGGCAT TCCGATCACC
GAGACCTACA ATGCGCTGGC AACCGGCGTC GCCGACGCCA TGGACCTGAC CAAATCGGCT
TACTCAGGCT TCAAGCTTTA TGAGGTCGTG CCCGATATGA CCGAGACAGG CCACATCTGG
GCATCCGGCG TCATCTATTA TTCCTCGACC TTCTGGGCCG GCCTGAATGA CGAGCAGAAG
GCGGTTTTCC AGCAGGCTTC CAGCGAAGGA GCCGCCTATT TCAACCAGCT GATCGTCGAC
GACGAGGTAA AGTCCGTGGA AACGGCGCTT GGCCATGGCG GCAAGCTCTT GAAGCCGGAA
GCCTTCGAGG AATGGCAGAA GGGCGCGCAG GGCGTCTGGG CCGATTTCGC GCCTGTTGTC
GGCGGCCTCG ACAGGATCAA AACCGTTCAG GCGGCTTGA
 
Protein sequence
MNNLNRRNFL RTAALTGTAL AAPGFVRTAA ARTTTITIAS LLGDDKPETK IWVKIGELVE 
AKLPGQFKFN IVRNGALGGE KEVAEGVRLG SIQASLSTVS SLSGWAPELQ ILDLPFLFRD
ADHVRRTVGG DVGADLKQKL QAQNFVVGDF INYGARHLLT KEPVTRPEQL KGKRIRVIQS
PLHTKLWSAF GTTPIGIPIT ETYNALATGV ADAMDLTKSA YSGFKLYEVV PDMTETGHIW
ASGVIYYSST FWAGLNDEQK AVFQQASSEG AAYFNQLIVD DEVKSVETAL GHGGKLLKPE
AFEEWQKGAQ GVWADFAPVV GGLDRIKTVQ AA