Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rleg_3619 |
Symbol | |
ID | 8014469 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhizobium leguminosarum bv. trifolii WSM1325 |
Kingdom | Bacteria |
Replicon accession | NC_012850 |
Strand | + |
Start bp | 3659515 |
End bp | 3660549 |
Gene Length | 1035 bp |
Protein Length | 344 aa |
Translation table | 11 |
GC content | 63% |
IMG OID | 644826183 |
Product | Bile acid:sodium symporter |
Protein accession | YP_002977403 |
Protein GI | 241206307 |
COG category | [R] General function prediction only |
COG ID | [COG0385] Predicted Na+-dependent transporter |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 24 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 46 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACTGTTT TATCAAGCCG CGAGATCGTC ATGCGCCGCT TTCTGCCCGA TACATTCACC ATCCTGCTCG TCTGCACCGT CATCCTCGCC TCGGTGTTGC CGGCGCGCGG CACATTCGCG GATCATTTCG GTATCGCCAC CGATCTTGCC ATCGCGCTGC TGTTCTTCCT GCATGGCGCC CGCCTGTCGC GCGACGTGGT CATCGCAGGC TTGCTGCACT GGCGCCTGCA TATCGTCATC CTGCTGACGA CCTTCGGCAT CTTCCCGTTG CTCGGCATGG CACTCGGGCT GATCCCCGAC ACGATCCTGC CGCAACCACT CTATCTCGGC ATCCTCTTCC TCTGCCTGCT GCCGTCGACG GTGCAGTCGT CGATCGCCTT TACGTCGATG GCCGGCGGCA ACGTGCCTGC CGCCATCTGC TCGGCCTCGG CATCCAACAT CTTCGGCATG TTCCTGACGC CACTGCTCGT CGGCCTGCTG TTTTCCGTCG GCGGCCATGG CGGCTTCTCC TTCGACGCAT TGCAGCAGAT CCTGCTGCAG CTGCTCGCCC CCTTCATCGT CGGCCAGATC TTGCAGCCCT GGATCGGCGA CTGGATCCGC GCCAAGAAGA AGATCCTGAT GCCTGTCGAC CGCGGCTCGA TCCTGATGGT CGTCTATCTC GCCTTCAGCA CGGCGGTGGT CGAGGGCCTG TGGCACACCT TCTCGATTGC CGATATCGCA GTCGTCATCG TCGCCGACAT GGTCCTGCTG GCGATTGTCC TGGTGCTGAC GATGTTCGGC AGCCGCTGGC TGGGCTTCAA CAAGGCCGAC CAGATCACCA TCACCTTCTG CGGCTCGAAG AAGAGCCTCG CAAGCGGCGT GCCGATGGCG AACGTCATCT TCGCCGGCCA GTCGATCGGC GCGATCGTGC TGCCGCTGAT GCTGTTCCAC CAGATCCAGC TGATGGTCTG CGCCGTCATC GCCCAGAAAT ACGCCGCCGC CGCGGCCCGC CGTGCAACGG ACAAGGAAAT CGACGAGGCA ACCAGCCCGG CATGA
|
Protein sequence | MTVLSSREIV MRRFLPDTFT ILLVCTVILA SVLPARGTFA DHFGIATDLA IALLFFLHGA RLSRDVVIAG LLHWRLHIVI LLTTFGIFPL LGMALGLIPD TILPQPLYLG ILFLCLLPST VQSSIAFTSM AGGNVPAAIC SASASNIFGM FLTPLLVGLL FSVGGHGGFS FDALQQILLQ LLAPFIVGQI LQPWIGDWIR AKKKILMPVD RGSILMVVYL AFSTAVVEGL WHTFSIADIA VVIVADMVLL AIVLVLTMFG SRWLGFNKAD QITITFCGSK KSLASGVPMA NVIFAGQSIG AIVLPLMLFH QIQLMVCAVI AQKYAAAAAR RATDKEIDEA TSPA
|
| |