Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rleg_1638 |
Symbol | |
ID | 8012709 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhizobium leguminosarum bv. trifolii WSM1325 |
Kingdom | Bacteria |
Replicon accession | NC_012850 |
Strand | + |
Start bp | 1629792 |
End bp | 1630790 |
Gene Length | 999 bp |
Protein Length | 332 aa |
Translation table | 11 |
GC content | 62% |
IMG OID | 644824224 |
Product | TRAP dicarboxylate transporter- DctP subunit |
Protein accession | YP_002975465 |
Protein GI | 241204369 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG1638] TRAP-type C4-dicarboxylate transport system, periplasmic component |
TIGRFAM ID | [TIGR00787] tripartite ATP-independent periplasmic transporter solute receptor, DctP family [TIGR01409] Tat (twin-arginine translocation) pathway signal sequence |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 0.503152 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 18 |
Fosmid unclonability p-value | 0.0466092 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGACAATT TCAACCGACG CAATTTCCTG AAGACCGCAG CTCTCGCCGG AACGGCGCTT GCAGCGCCCG CCTTCGTCCG CACGGCCGCC GCCCGCACGA CGACGATCAC GATCGCCTCG CTGCTCGGTG ACGACAAGCC GGAAACGAAG ATCTGGGTGA AGATCGGCGA ACTGGTCGAA GCGAAACTCC CCGGCCAGTT CAAGTTCAAT ATCGTCAGGA ACGGCGCGCT CGGCGGCGAG AAGGAAGTGG CCGAAGGCGT GCGCCTCGGC TCCATCCAGG CCAGCCTCTC GACGGTCTCG TCGCTCTCCG GCTGGGCGCC GGAACTGCAG ATCCTCGATC TGCCCTTTCT CTTCCGCGAT GCCGACCATG TTCGCAGGAC CGTCGCCGGT GATGTCGGCG CCGATCTCAA GCAGAAACTG CAGGCGCAGA ATTTCGTCGT CGGCGATTTC ATCAATTACG GCGCCCGCCA TCTCCTGACC AAGGAGCCGG TGACGCGGCC AGAACAACTC AAGGGCAAGC GCATTCGCGT CATCCAGAGC CCTCTGCACA CCAAGCTCTG GAGCGCATTC GGCACGACGC CGATCGGCAT TCCGATCACC GAGACCTATA ATGCGCTCGC AACCGGCGTC GCCGACGCGA TGGATCTGAC CAAGTCGGCC TATGCCGGCT TCAAGCTCTA CGAGGTCGTG CCCGATATGA CCGAGACCGG CCACATCTGG GCATCCGGCG TCATCTATTA TGCCTCGACC TTCTGGGCCG GCCTCAATGA CGAGCAGAAG GCGGTGTTCC AGCAGGCCTC CAGCGAGGGG GCTGCCTATT TCAACCAGTT GATCGTCGAT GACGAATCCA AATCCGTCGA GACGGCGCTT GCCAACGGCG GAAAACTCTT GAAGCCGGAA GCCTTCGACG AATGGCAGAA GGGCGCCCAG GGGGTGTGGG ACGATTTCGC GCCGGTTGTC GGCGGCATCG ACAGGATCAA ATCGATCCAG GCCGCCTGA
|
Protein sequence | MDNFNRRNFL KTAALAGTAL AAPAFVRTAA ARTTTITIAS LLGDDKPETK IWVKIGELVE AKLPGQFKFN IVRNGALGGE KEVAEGVRLG SIQASLSTVS SLSGWAPELQ ILDLPFLFRD ADHVRRTVAG DVGADLKQKL QAQNFVVGDF INYGARHLLT KEPVTRPEQL KGKRIRVIQS PLHTKLWSAF GTTPIGIPIT ETYNALATGV ADAMDLTKSA YAGFKLYEVV PDMTETGHIW ASGVIYYAST FWAGLNDEQK AVFQQASSEG AAYFNQLIVD DESKSVETAL ANGGKLLKPE AFDEWQKGAQ GVWDDFAPVV GGIDRIKSIQ AA
|
| |