Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Sbal223_2963 |
Symbol | |
ID | 7089031 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Shewanella baltica OS223 |
Kingdom | Bacteria |
Replicon accession | NC_011663 |
Strand | + |
Start bp | 3496573 |
End bp | 3497838 |
Gene Length | 1266 bp |
Protein Length | 421 aa |
Translation table | 11 |
GC content | 47% |
IMG OID | 643461848 |
Product | major facilitator superfamily MFS_1 |
Protein accession | YP_002358872 |
Protein GI | 217974121 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG0738] Fucose permease |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 3 |
Plasmid unclonability p-value | 0.0000121972 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | 20 |
Fosmid unclonability p-value | 0.284618 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCAAGCCG AGCTTCAAAA CATGACGCAC AGCGGCAGAA TTATGTTCTG TCTAATCACC ACTTATGTCA TCTTCGCAAT GCTGCTAAAC AGTGTAGGCA CAGTCATCTT ACAAGTGATT AGCAGCTTTG ATGTCAGCAA GAGCGGTGCC AGTGTATTGG AAGGTTTTAA AGATATTCCC ATCGCCTTAG TATCATTCCT ATTTGCCTCT TATCTTCCTA AATTCGGTTA CAAAAACTCC CTGTTACTCG GCACACTTAT CGTCACCCTA ACCTGCGCTC TTATGCCCCT GTTGCCGAGT TTTCTCATGA CCAAACTGTT ATTTCTTACT GTGGGTATTA GCTTTGCTCT CGTAAAAGTA TCAACGTATT CTTTGCTCGG ACAAATTTCC GCGACGGCAA AAATCCATGC CGCCAATATG AATCTGCTGG AAGGGTGTTT TATGCTGGGA GTGCTCGGCG GCTACTGGCT ATTTGGCTTG TTTATTGAGG GAATTGCTCA ACCCCTAGCT TGGTTGTCTG TCTATTGGGT ACTCGCCGCC CTATCTCTGG CAAATCTACT GCTCTTAGTC ACGACTCCGA TAGTCCAAGA TAAACAGTCT GAGGCCACAC TCGATAGCAA AAATAATTTC AGTAAGATGC TGAAATTGCT AGCGCTGCCC TTCGTACTCA CCTTCGTCGT CGCGGCCTTC TTCTATGTTC TGATAGAGCA AGGCGTCGGC AGCTGGTTGC CAACCTTCAA CAATCAAGTG CTCCATCTAT CGCCTAGTAT GAGTGTGCAA ATCACCAGCA TTTTTGCCGC AGGTCTCGCC CTTGGTCGTC TCTCAGCGGG ACTCATTCTG CGTCGAATAC ACTGGTATCC TTTGCTGATG ATCTGTATTG CCTCTATGGC TATACTGCTC GTACTCATCT TACCCGCAGT TGCGACGCTA ACGCCCGAGA CCAAAATCAC CCAATGGCAT CAGGTGCCTA TCGCAGCCTT CATCCTACCT ATGATAGGTA TGTTTATGTC ACCCATTTAT CCCGCCATTA ACTCGGCGGT ATTGAGTGCG CTACCGCGAC ACAGGCACGC TACCATGACG GGACTGATAG TGATTTCCTC CGCCCTTGGC GGTACTCTTG GCAGCCTAGT TACTGGTTTC ATCTTCGACT ATTTCGATGG CCAGAGCGCC TTCTATTTTA TGTTGTTACC GATGACGGTT CTCGCGTTAA TTCTAACGGG CTTCAAACGC TTACTTATCC CTAATGAAAC GGTATTATCT GTATGA
|
Protein sequence | MQAELQNMTH SGRIMFCLIT TYVIFAMLLN SVGTVILQVI SSFDVSKSGA SVLEGFKDIP IALVSFLFAS YLPKFGYKNS LLLGTLIVTL TCALMPLLPS FLMTKLLFLT VGISFALVKV STYSLLGQIS ATAKIHAANM NLLEGCFMLG VLGGYWLFGL FIEGIAQPLA WLSVYWVLAA LSLANLLLLV TTPIVQDKQS EATLDSKNNF SKMLKLLALP FVLTFVVAAF FYVLIEQGVG SWLPTFNNQV LHLSPSMSVQ ITSIFAAGLA LGRLSAGLIL RRIHWYPLLM ICIASMAILL VLILPAVATL TPETKITQWH QVPIAAFILP MIGMFMSPIY PAINSAVLSA LPRHRHATMT GLIVISSALG GTLGSLVTGF IFDYFDGQSA FYFMLLPMTV LALILTGFKR LLIPNETVLS V
|
| |