Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | SbBS512_E1858 |
Symbol | |
ID | 6272612 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Shigella boydii CDC 3083-94 |
Kingdom | Bacteria |
Replicon accession | NC_010658 |
Strand | + |
Start bp | 1701080 |
End bp | 1702291 |
Gene Length | 1212 bp |
Protein Length | 403 aa |
Translation table | 11 |
GC content | 53% |
IMG OID | 641725923 |
Product | inner membrane transport protein YdhC |
Protein accession | YP_001880421 |
Protein GI | 187730494 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG2814] Arabinose efflux permease |
TIGRFAM ID | [TIGR00710] drug resistance transporter, Bcr/CflA subfamily [TIGR00880] Multidrug resistance protein |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 0 |
Plasmid unclonability p-value | 0.0000000000000141458 |
Plasmid hitchhiking | No |
Plasmid clonability | unclonable |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCAACCTG GGAAAAGATT TTTAGTCTGG CTGGCGGGTT TGAGCGTACT CGGTTTTCTG GCAACCGATA TGTATCTGCC TGCTTTCGCC GCCATACAGG CCGACCTGCA AACGCCTGCG TCTGCTGTCA GTGCCAGCCT TAGTCTGTTC CTTGCCGGTT TTGCCGCAGC CCAGCTTCTG TGGGGGCCAC TCTCCGACCG TTATGGTCGT AAACCGGTGT TATTAATCGG CCTGACAATT TTTGCGTTAG GTAGTCTGGG GATGCTGTGG GTAGAAAACG CCGCTACGCT GCTGGTATTG CGTTTTGTAC AGGCTGTGGG TGTCTGCGCC GCGGCGGTTA TCTGGCAAGC GTTAGTGACG GATTATTATC CTTCACAGAA AGTTAACCGA ATTTTTGCGA CCATCATGCC GCTGGTGGGT CTATCTCCGG CCCTGGCTCC TCTGTTAGGA AGCTGGCTGC TGGTCCATTT TTCCTGGCAG GCGATTTTCG CCACCCTGTT TGCCATTACC GTGGTGCTGA TTCTGCCTAT TTTCTGGCTC AAACCCACGA CGAAGGCCGG TAACAATAGT CAGGATGGTC TGACCTTTAC CGACCTGCTA CGTTCTAAAA CCTATCGCGG CAACGTGCTG ATATACGCGG CCTGTTCAGC CAGTTTTTTT GCATGGCTGA CCGGTTCACC GTTCATCCTT AGTGAAATGG GCTACAGCCC GGCAGTTATT GGTTTAAGTT ATGTCCCGCA AACTATCGCG TTTCTGATTG GTGGTTATGG CTGTCGCGCC GCGCTGCAGA AATGGCAAGG CAAGCAGTTA TTACCGTGGT TGCTGGTGCT GTTTGCTGTC AGCGTCATTG CGACCTGGAC TGCGGGCTTC ATTAGCCATG TGTCGCTGGT CGAAATCCTG ATCCCATTCT GTGTGATGGC GATTGCCAAT GGCGCGATCT ACCCTATTGT TGTCGCCCAG GCGCTGCGTC CCTTCCCACA CGCAACTGGT CGCGCCGCAG CGTTGCAGAA CACTCTACAA CTGGGTCTGT GCTTCCTCGC AAGTCTGGTA GTTTCCTGGC TTATTAGTAT CAGCACGCCA TTGCTCACCA CCACCAGCGT GATGTTATCA ACAGTAGTGC TGGTCGCGCT GGGTTACATG ATGCAACGTT GTGAAGAAGC TGGCTGCCAG AATCATGGCA ATGCCGAAGT CGCTCATAGC GAATCACACT GA
|
Protein sequence | MQPGKRFLVW LAGLSVLGFL ATDMYLPAFA AIQADLQTPA SAVSASLSLF LAGFAAAQLL WGPLSDRYGR KPVLLIGLTI FALGSLGMLW VENAATLLVL RFVQAVGVCA AAVIWQALVT DYYPSQKVNR IFATIMPLVG LSPALAPLLG SWLLVHFSWQ AIFATLFAIT VVLILPIFWL KPTTKAGNNS QDGLTFTDLL RSKTYRGNVL IYAACSASFF AWLTGSPFIL SEMGYSPAVI GLSYVPQTIA FLIGGYGCRA ALQKWQGKQL LPWLLVLFAV SVIATWTAGF ISHVSLVEIL IPFCVMAIAN GAIYPIVVAQ ALRPFPHATG RAAALQNTLQ LGLCFLASLV VSWLISISTP LLTTTSVMLS TVVLVALGYM MQRCEEAGCQ NHGNAEVAHS ESH
|
| |