Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Sbal223_3801 |
Symbol | |
ID | 7088836 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Shewanella baltica OS223 |
Kingdom | Bacteria |
Replicon accession | NC_011663 |
Strand | - |
Start bp | 4503410 |
End bp | 4504645 |
Gene Length | 1236 bp |
Protein Length | 411 aa |
Translation table | 11 |
GC content | 53% |
IMG OID | 643462680 |
Product | major facilitator superfamily MFS_1 |
Protein accession | YP_002359701 |
Protein GI | 217974950 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG2814] Arabinose efflux permease |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 0.0888448 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 68 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | TTGAAAACCG TTCAACATCA GGTTCAAAAT CCATCTGTCC AAAATCAACA CAAGGGCGCC AGTACACTAT GGGTGTTGCT CGCCTTAGCG CTCGGGACGT TTGCCTTAGG TACCACAGAA TTTGCTTCCA TGACTCTAGT GCCCTATATC GCCAGCGATT TAGGCGTAGA GGTTGCACAT GTGAGTTATG CCATTAGCGC CTATGCGCTG GGGGTTGTGG TCGGTTCGCC GATTATTATG GTGCTAGCGG TTAGGGTCAG GCGGCGAACA CTCTTGATTG CTTTAGCTGC CTTAATGGCG GTGGCCAATG GCTTAAGTGC GTTAGCGCCA TCATTGAATT GGTTAATCTT TTTTCGTTTT CTCAGTGGCT TGCCCCACGG TGCTTATTTC GGCGTGGCTA TGTTGCTCGC CGCCTCTTTA GTGCCGCCAG AAATGAAGGC CCGCGCCGTA TCGCGGGTGA TTATTGGCCT TACGCTGGCG ACGATTATCG GTGTGCCGTT TGCCACTTGG ATGGGGCAAA CTGTGGGCTG GCGCTCAGGC ATTGGCATAG TGGCGATTTT GGCGACTATT ACCGCTGTGA TGGTGTATTT TTTAGCGCCT GATCAGGCCG TGGCCGCTGA TGCGAGTCCC AGAAAAGAGC TACAAACCCT GAAGAATCGT GAAGTCTGGT TGACGCTTGG CATCGCTGCG ATTGGCTTTG GCGGTATCTT TTGCGTGTAT ACCTATCTGG CTGAAACCTT AATCCAAGTG ACGCAAGTCG AGCCGTTTAA GATCCCGATC ATGATGGCGG TATTTGGTAT TGGCGCAACA TTGGGCACGC TAGTGTGTGG CTGGGCGGCG GATAAGTCGG CCTTAGCGGC GGCGTTTTGG TCGTTAGTGT TAAGCACTGT GGTATTAGCG ATTTACCCGA GTTTGACCGG ACATTATTGG GCGCTGATGC CCGTAGTATT CTTTGTCGGT TGTGGCTTGG GACTTGCCAC CATAGTGCAA GCAAGATTGA TGGATGTGGC GCCCGATGGG CAGGCCATGA CAGGTGCGTT AGTGCAATGT GCCTTTAATC TCGCCAATGC TATTGGTCCT TGGGTGGGCA GTTTAGTGAT CCTGTCTGGA CAAGGGATTG CCGCGACAGG TTATGCGGCG TCTTTGTTGT CATTAGGAGG ACTTGTGATG TGGTGGCTGA CCCACAGGGA GAGTCGCCGC GCGGTGAGCT TAAACACAGC CAATTGCGCT GACTAG
|
Protein sequence | MKTVQHQVQN PSVQNQHKGA STLWVLLALA LGTFALGTTE FASMTLVPYI ASDLGVEVAH VSYAISAYAL GVVVGSPIIM VLAVRVRRRT LLIALAALMA VANGLSALAP SLNWLIFFRF LSGLPHGAYF GVAMLLAASL VPPEMKARAV SRVIIGLTLA TIIGVPFATW MGQTVGWRSG IGIVAILATI TAVMVYFLAP DQAVAADASP RKELQTLKNR EVWLTLGIAA IGFGGIFCVY TYLAETLIQV TQVEPFKIPI MMAVFGIGAT LGTLVCGWAA DKSALAAAFW SLVLSTVVLA IYPSLTGHYW ALMPVVFFVG CGLGLATIVQ ARLMDVAPDG QAMTGALVQC AFNLANAIGP WVGSLVILSG QGIAATGYAA SLLSLGGLVM WWLTHRESRR AVSLNTANCA D
|
| |