Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | CHU_0325 |
Symbol | sotB |
ID | 4185507 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Cytophaga hutchinsonii ATCC 33406 |
Kingdom | Bacteria |
Replicon accession | NC_008255 |
Strand | + |
Start bp | 391197 |
End bp | 392408 |
Gene Length | 1212 bp |
Protein Length | 403 aa |
Translation table | 11 |
GC content | 40% |
IMG OID | 638070333 |
Product | sugar efflux transporter |
Protein accession | YP_676955 |
Protein GI | 110636748 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG2814] Arabinose efflux permease |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 33 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 13 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCAACCCA CTCAAAAAAG CGCCATCAAC GAAGGCTTGT TAATAGCCAT GCTGGCAGCA ATCAATTTTA CACACATCAT GGATTTTGTA ATTATGGCTC CCTTAAGTGC TACGTTAAAA ATAGCCATGT CAATTACAAC AAAGGAGTTT GGGTATCTGG TATCCATTTA CACATTTGCT GCTGCCGTAG GTGCTATTAT CGCATTCTTT AAGATTGATA AGTATGACAG AAGAACAGCA ATCATCTTTG TGTATACAGG ATTTATAGTT GCAAATATAT TATGTGCATT GGCACCGCAA TATAAATTCT TTATGCTGGC GCGTCTGTTC GCAGGTTTAT TCGGTGGTGT GTTAAACGTG TTGATCATGT CGGTCATCGG AGATGTTATT CCGTTAGAAC GCAGAGGTAA AGCAACCGGT ATGGTGATGG CTGCTTTTTC AGCGGCATCC GTAATTGGTA TCCCGAGCGG CTTGATCCTG GCTGATATAT TTAAAGACTA TCACGCACCG TTCTGGCTGC TTTCTATATT AAGCGCGCTT GTCGGGTTTG TACTGGTATT TAAATTCCCT TCCATTAAAA GCCATATGGA GTTTGAAGGT GCTAAAACAC CTCCAATGGA AATTATTAAA GAATACATGC AGAATTCAAA TGTACGCAGA GCCTTGTTGT TTATCTTTTT ACTAATGATT GCAGGCTTTT CCGTTGTACC ATTTATAAGC GATTATCTGG TAAATAATGT TGGCCTGGAT TTGAAGGAAT TGAAATACGT GTATTTGTGC GGAGGATTAG CTACTGTTGC AAGTAGTATA TTTATCGGTC GCTTATCAGA TAAGCTGGGT AAAGTAAAAA CATTTATCAT TGCCGCATTG GTATCCGTTG CACCAATCGC TATTGTAACG GTATTGCCGG TTATGCCACT AAAACATGTT TTATTGTTTA ACGTGTTATT CTTTATGTGC TTTGGTGCAA GATTTGTTCC GGCTATGACA CTAATGACTT CCTGTGTTCA GCCCAAACGC CGGGGAAGTT TTTTAAGTGT AAGCTCTGCA ATACAACAGC TGGGTTCAGG TGTTGCAGTG CTGATTGCAT CCGCCATTAT TGTGAATGGT CCGAAAGGCG AATTACACAA TTTTGGCTGG GTGGGTATTG TGGCATGTGT TGCTACCGTA ATAAGCATTC TGCTTTCTGT TCGTATTAAA GAAGTTTCTT AA
|
Protein sequence | MQPTQKSAIN EGLLIAMLAA INFTHIMDFV IMAPLSATLK IAMSITTKEF GYLVSIYTFA AAVGAIIAFF KIDKYDRRTA IIFVYTGFIV ANILCALAPQ YKFFMLARLF AGLFGGVLNV LIMSVIGDVI PLERRGKATG MVMAAFSAAS VIGIPSGLIL ADIFKDYHAP FWLLSILSAL VGFVLVFKFP SIKSHMEFEG AKTPPMEIIK EYMQNSNVRR ALLFIFLLMI AGFSVVPFIS DYLVNNVGLD LKELKYVYLC GGLATVASSI FIGRLSDKLG KVKTFIIAAL VSVAPIAIVT VLPVMPLKHV LLFNVLFFMC FGARFVPAMT LMTSCVQPKR RGSFLSVSSA IQQLGSGVAV LIASAIIVNG PKGELHNFGW VGIVACVATV ISILLSVRIK EVS
|
| |