Gene Sbal223_1071 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSbal223_1071 
Symbol 
ID7087844 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameShewanella baltica OS223 
KingdomBacteria 
Replicon accessionNC_011663 
Strand
Start bp1261784 
End bp1264129 
Gene Length2346 bp 
Protein Length781 aa 
Translation table11 
GC content48% 
IMG OID643459982 
Productglycoside hydrolase family 2 sugar binding 
Protein accessionYP_002357009 
Protein GI217972258 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG3250] Beta-galactosidase/beta-glucuronidase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0819877 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value0.0326582 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTATTTCA AGTTGTTGTT TGCAATAGCA TTGCTGTGTA TCTCCAGCGT GCAAGTGCAG 
GCTTCGCCAC AGGCGCAGAC TGCCATCAAA CTGTCGTTCA ACGAAGATTG GCAATTCCAA
AAGTCACCCG CCCACGCTAC CGCGTTTCAG CCTGAGGCTA AAGCATGGGA AAAAGTGAGC
TTACCCCATA CACCAAAGCT TGAATCCCTA CTGGTCAATG ACCAATGGCA GGGTATTGCC
CTGTATCAAA AGCAATTTAA TGCTCCTAAA TCTTGGCAAG GACAGCGGCT TTACCTGCGT
TTTGAAGGTG CAATGAACCA CACCAAAGTT TGGTTAAACG ATAAACCTGT GGGTGAACAT
TTAGGCGGAT ACCTTCCTTT CAGCATCGAT ATTTCAGCTG CGGTTAATTA TGGTGCGGAC
AACCAGTTAC GGGTGGCGTT AGACAATAAC GACAATCCAA TTACCGGTCC TAAACCACTA
CATTTACTCG ACTTTAATAC CTATGGTGGT TTGTACCGCG ACGTTAATTT ATTCATTAAA
GCACCGCTGC ACATCAGCGA TGAGATGCTG ACGAATTTAC CGAACCGTGG TGGTTTAGTG
ATTTCGAGCA AGATCGACGA TAAGGCTAAG GCGAGTGTGG CCGTTCAGGC CGATATCCGT
AATCAAGGCA GTAAAGCGGC TCAGTTCACC TTAAAACAGG TGCTTGAGTT CGACGGTAAA
GTGGTTGCAA CAATAGCGCA AAGCTACACT TTGGCAGCTA ATAGCCAGCA AGTTTTCAGC
CAGAACTTAA CGGTAAAACA GCCAAAGTTA TGGCATCCAA CCCATCCGAA TCTTTATCAA
TTGCAGACTG AACTCTGGCA AGGCAACACC TTGCTCGAGC GAAGCAGTAA TAAGGTCGGT
ATCCGTGAAT TTGCTTTCAA TGACAAACAT GAATTGTTGA TTAACGGTGA ACCCTATTTC
CTGCGCGGTA TAAATCGTCA TCAAGATTAT CCCCATGTCG GTTATGCCAC CTCAAAACAA
GCCGATTACC GTGATGCCAT CAAGATCAAA GAAGCCGGAT TTGATTATGT TCGCTTATCT
CACTATCCCC ATTCGCCCGC CTTTATGGAC GCCGCCGATG AAGTTGGGCT AGTGCTGATT
GATGCCGTAT TAGGTTGGCA GTATTTCTCA CCTGAGCCTG AGTTTGCTGA GCATGTGTAT
CAGTCTTGCA GAGATCTACT GCACCGCGAT CGTAACCACG CCAGCGTGTT AGCGTGGGAA
TGCTCGCTCA ATGAAACTCC AATGCCAGTG GCCTTTATCG ACAACCTGAA AGCCATCATT
CAGGCGGAAC TTCCTGGTGC AATGTCTGCA GGTTGGCAAA CGCAATACGA TATCTATCTG
CAAGCTCGTC AACATAGAAT GCAGCATTAT CAAACACCGA CTCAGCCATA CAATGTGTCT
GAATACGGTG ATTGGGAATA CTACGCCCAA AATGCAGGCC TAAACCAACA GGCGTGGGCC
GATCTTAAGG ATGATGAACG CACCAGTCGG CAGCTACTCG GTGCTGGTGA AAAGCGTTTA
CTACAACAGG CAATGAACCT ACAGGAAGCC CATAACGATA ATTTACACAC GCCCGCGTTT
GCCGACGGTT ATTGGGTGAT GTTTGACTAT AACCGTGGCT ATGCGAATGA CTTAGAGTCC
TCGGGCATTA TGAGCATCTA TCGTCAACCT AAGTTCAGCT ATTACCTGTT CCAAAGTCAG
CGCGACCCCG CGCTAACGTC AACCAAATAC AATGCCGGAC CTATGGTGTA TATCGCCTCA
GATTGGCAAG CGGACTCTTC GCCAAATGTT CGGGTGTTCA GTAACGCGGA TGCCGTCGAG
TTATGGCTAA ACGGTAAATT AATTACCCGT CAAATACCCG ATAACGGCGC GAATACCGAC
AAGTTGCCTC ATCCACCTTT TACTTTCCAT CTGCCCAGTT TTGAGGCGGG AGAGCTAAGT
GCTAAAGCCT TTATTGGCAA TCAGTTCGTC GCGACGCACA GCGTTCGTAC TCCCCAAGCG
ATTCATGCGC TTGATATTTC ACTCGATACC GCAGGTGTGC CGATTAATTC CGCTGGCAGC
GATGTGCTGT TTGTGAATGT CAGATTGGTT GATGCCAATG GCACCACAGT ACCAGTGAAT
GACAAGGTGG TGAGCTTTGA AGTCAATGGC GCGATTCAAG TGCTCAATCC GGATGCGATA
GTGACAGAAA AGGGCGTTGC CAGCGCATTG GTACGAGTGC ATAACGGTGC AAAAGGCGCG
CACTTAAAAG CCTCTTTTGC GGCAGAAAAC ACAGCTGCGC CGTTAACTGC TGAGCTGAAA
TTCTAA
 
Protein sequence
MYFKLLFAIA LLCISSVQVQ ASPQAQTAIK LSFNEDWQFQ KSPAHATAFQ PEAKAWEKVS 
LPHTPKLESL LVNDQWQGIA LYQKQFNAPK SWQGQRLYLR FEGAMNHTKV WLNDKPVGEH
LGGYLPFSID ISAAVNYGAD NQLRVALDNN DNPITGPKPL HLLDFNTYGG LYRDVNLFIK
APLHISDEML TNLPNRGGLV ISSKIDDKAK ASVAVQADIR NQGSKAAQFT LKQVLEFDGK
VVATIAQSYT LAANSQQVFS QNLTVKQPKL WHPTHPNLYQ LQTELWQGNT LLERSSNKVG
IREFAFNDKH ELLINGEPYF LRGINRHQDY PHVGYATSKQ ADYRDAIKIK EAGFDYVRLS
HYPHSPAFMD AADEVGLVLI DAVLGWQYFS PEPEFAEHVY QSCRDLLHRD RNHASVLAWE
CSLNETPMPV AFIDNLKAII QAELPGAMSA GWQTQYDIYL QARQHRMQHY QTPTQPYNVS
EYGDWEYYAQ NAGLNQQAWA DLKDDERTSR QLLGAGEKRL LQQAMNLQEA HNDNLHTPAF
ADGYWVMFDY NRGYANDLES SGIMSIYRQP KFSYYLFQSQ RDPALTSTKY NAGPMVYIAS
DWQADSSPNV RVFSNADAVE LWLNGKLITR QIPDNGANTD KLPHPPFTFH LPSFEAGELS
AKAFIGNQFV ATHSVRTPQA IHALDISLDT AGVPINSAGS DVLFVNVRLV DANGTTVPVN
DKVVSFEVNG AIQVLNPDAI VTEKGVASAL VRVHNGAKGA HLKASFAAEN TAAPLTAELK
F