Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Smed_4660 |
Symbol | |
ID | 5319335 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Sinorhizobium medicae WSM419 |
Kingdom | Bacteria |
Replicon accession | NC_009620 |
Strand | - |
Start bp | 1171408 |
End bp | 1172592 |
Gene Length | 1185 bp |
Protein Length | 394 aa |
Translation table | 11 |
GC content | 66% |
IMG OID | 640776458 |
Product | major facilitator transporter |
Protein accession | YP_001313390 |
Protein GI | 150376794 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG2814] Arabinose efflux permease |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 0.37515 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 13 |
Fosmid unclonability p-value | 0.00979921 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGGACAAGC GCCTCATCTG GCTCGCCGTT GGCTCCTTCA CCATGAGTAC GGTCGGCTTC GTATTCTCGA GCCTTCTGCC CTCGATCGCC GCCGACACAC ACACGACCAT TCCTCGTGCG GGGCACCTGA TCACCCTGTT CGCGCTCTCC TATGCCATCG GCGCGCCGCT GCTATCGGCG TTGGCCGGCG CGGCGGACAG GCGTCGATTG CTGGTGGCTG CGATGCTTAC CTTCGTCGTC GGCAACTGCA TCGCGGCAAC GAGCGTCTCC TTCGCGACGC TGCTGTTCGC ACAGATCGTG ATGGGAATGG CGAGCGGCCT CTTTGCGGCC ACAGCGCAAG CGACGGCGGT TTCGCTGGCC GGAGCGGAGC ACCGCGCGCT GGCGATTTCA ATCGTGGTGG GCGGCACCAC GTTCGCGGTG GCGCTGGGCG CGCCTCTGGG CGCGCTGATC GCCGCCTTCT GGGGATGGCG CGGCACGTTC GGGGCGATTG CTCTGCTTGG GCTCTCCTGC GCCGCCGTGC TCTGGCTGCG CCTGCCGCGG GGCCTCAGCG GAACGAAGCT GACAATGTCC GAGCGCTTCG GCGCCATCGG CCGTCCGGGA GTCGCTTCGT CGCTATCGGT GACATTTCTC TACCTCACCG GCGGCTTCAT GATCATCTCC TATCTTGCAC CGCTGGCGAT CGATGGGGCT GGGCTTTCGC AGCTGGCGCT ACCTGGCCTG CTGCTTGCCT TCGGTGTTGG CGCGGTGATC GGCAATCTCT CCAGCGGCTA TCTGGCCGAC CGGCTCGGCG CCACGCGCGT GGTGACAGCT TCGCTGATCT CGGCGCTGGT GGTCTCGTTG ATGATCGCCA CCGGCCTGCA CCTTCTCACG CGTGATCTCG CCGGCTTGCT GCTGATCGGC ATCATGGTGC CATGGGGCAT TGTAGGTTGG GCCTTTCCGC CGGCCCAGGC GAGCCGTATC GTCGGCTTCG CACCCGAGGT CGCGCATCTG ACCCTGTCGC TCAACGCCTC GGCGATCTAT CTCGGCATCG CGACCGGCAC GGCGATCGGC GGACGCGTGC TCGAGAATAC GGCTGCAGCC GATCTCGGCT TCTTCGCCGC GCTGTTTCCG GTGGCTTCGC TTGCGGTCCT TTATGCAGGG CTGCGCTCCC ACCGGCGGCG CTTGGCCACT GTCGCGGCGG AATAG
|
Protein sequence | MDKRLIWLAV GSFTMSTVGF VFSSLLPSIA ADTHTTIPRA GHLITLFALS YAIGAPLLSA LAGAADRRRL LVAAMLTFVV GNCIAATSVS FATLLFAQIV MGMASGLFAA TAQATAVSLA GAEHRALAIS IVVGGTTFAV ALGAPLGALI AAFWGWRGTF GAIALLGLSC AAVLWLRLPR GLSGTKLTMS ERFGAIGRPG VASSLSVTFL YLTGGFMIIS YLAPLAIDGA GLSQLALPGL LLAFGVGAVI GNLSSGYLAD RLGATRVVTA SLISALVVSL MIATGLHLLT RDLAGLLLIG IMVPWGIVGW AFPPAQASRI VGFAPEVAHL TLSLNASAIY LGIATGTAIG GRVLENTAAA DLGFFAALFP VASLAVLYAG LRSHRRRLAT VAAE
|
| |