Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Smed_0644 |
Symbol | |
ID | 5321480 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Sinorhizobium medicae WSM419 |
Kingdom | Bacteria |
Replicon accession | NC_009636 |
Strand | - |
Start bp | 696167 |
End bp | 697456 |
Gene Length | 1290 bp |
Protein Length | 429 aa |
Translation table | 11 |
GC content | 63% |
IMG OID | 640789580 |
Product | major facilitator transporter |
Protein accession | YP_001326335 |
Protein GI | 150395868 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG2814] Arabinose efflux permease |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 23 |
Fosmid unclonability p-value | 0.414584 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCTGGCGA ACTTCGCTTC CGTTTTCAGC CTGATGCTGT CCACCTTTCT GATGATGGTG GCCTTCGGGC TGCAGAGCTA CGTCATCCCG GTACGCTCCG TGGCGGAAAG CTGGTCGACC CTCACGATCT CGATTTTCGC GACGGGTTAT ACGCTGGGTT TCACCCTTTC CTGCATTGTG ACACCTAAGT TCGTCCTGCG TGTCGGCCAC GTTCGCGTCT TCACCGCTCT GATAACGCTG CTTTCGATCG CGATCCTGAT GTGCGGGCTC GTCGTCGATT GGCGTGCGTG GATCGGGTTT CGCGCCATAT CCGGGTTCGC GATCGCAGGA AGTTACCTCG TCATCGAAAG TTGGCTGAAC GAACGGGTGA CGAACGAGAA TCGCGGCCTG CTCTTCTCGC TCTACCTCAT TACCACCATG GTCGGCACCA TCGGCGGTCA ATACCTGGTA CCGCTCGGCG ATCCCAGCAA CACCTCGCTG TTCATCCTCT GCGGCGTCCT GTTCTCGCTC GCCTTGCTGC CGACGGCCCT CTCGTCCTCA CCAATGCCGG CACCGCCGGC ACGGGCGGAC TTCGACATCC CCGCGCTCTA TCGACGCTCG CCGGTGGCCG TCGTCGGCGG TTTTCTCGCC GGCGCGCTCT CCGGTGCCTG GCTCAACCTC GGCGGCGTCT TCACCCAGAA GATCGGTCTT TCCACCGGCG AAGGAGCGAC GCTGCTGGCG TCGCTCCTCG CCGGCAGCGC GATCTCCCAG GTACCGATCG GCCGCGCCTC CGACCGGATG GACCGGCGCA TCGTCATGGT CGCCTGCGGC ATCGCCGGCG TCGTGTCCTG CCTTGCCATG TCCGTCATGA TCGACAGCAG TCCGCCCGTC CTGTATGCGC TCGCGGCCTG CATCGGCACG GTCCTCTTTC CGATCTACGC GTTGAACGTC GCCCACGCCA ACGACCTCGC ACGCCCCGAC GAGTATGTGG AAATTTCATC CGGCCTGATG ATTACCTACG GCCTGGGCAC GATTTCAGGA CCGCTGATGG TCGGCCCGGT GATGGATCGT TTCGGCCCCG TCGCACTGTT CGTCGCGCTC GCCGTGTATT TTGCTCTCTA CAGCGGCTAT GCCGCCTGGC GCATTCTCCG GCGAGAGCAG CATGACGGCC TCGTATCGAA GACCGATTTT CAGGCGACCA CTGTCATGCC TACACCCGGT CCGGACGTCA CCGGTCCGGT CTCACAGCAG GACGCGGGCG ATCTCATCGA GGACGATACC GTTCCTGCCT GGGAGGAAGG TGCCCGGTAG
|
Protein sequence | MLANFASVFS LMLSTFLMMV AFGLQSYVIP VRSVAESWST LTISIFATGY TLGFTLSCIV TPKFVLRVGH VRVFTALITL LSIAILMCGL VVDWRAWIGF RAISGFAIAG SYLVIESWLN ERVTNENRGL LFSLYLITTM VGTIGGQYLV PLGDPSNTSL FILCGVLFSL ALLPTALSSS PMPAPPARAD FDIPALYRRS PVAVVGGFLA GALSGAWLNL GGVFTQKIGL STGEGATLLA SLLAGSAISQ VPIGRASDRM DRRIVMVACG IAGVVSCLAM SVMIDSSPPV LYALAACIGT VLFPIYALNV AHANDLARPD EYVEISSGLM ITYGLGTISG PLMVGPVMDR FGPVALFVAL AVYFALYSGY AAWRILRREQ HDGLVSKTDF QATTVMPTPG PDVTGPVSQQ DAGDLIEDDT VPAWEEGAR
|
| |