Gene Sbal223_2161 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSbal223_2161 
Symbol 
ID7085967 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameShewanella baltica OS223 
KingdomBacteria 
Replicon accessionNC_011663 
Strand
Start bp2571254 
End bp2574685 
Gene Length3432 bp 
Protein Length1143 aa 
Translation table11 
GC content46% 
IMG OID643461062 
Productglycoside hydrolase family protein 
Protein accessionYP_002358086 
Protein GI217973335 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG3507] Beta-xylosidase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.000526501 
Plasmid hitchhikingNo 
Plasmid clonabilityunclonable 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0000457644 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGAAATTGA GTAAATTGAT GCTTGCTATG ACGGGGATCA TAGTTCTTGG AGGATGTGGC 
GGCGATGGGG ATTCTGGTTC ATTACCTCCC AATATGGTTC CAGAGAACCA ACAATTACCG
GATCCGACAA TCAAGCAGGA AATGTTAGAT TTTAATGATC TTGCAGTGCA TGATCCGTCA
ATCATTAAAG CAGACGATAC CTATTACGTT TTTGGTTCAC ATTTAGCTGC GGCTAAATCA
ACGGATCTGA TGAACTGGCA ATATTTGACT CGCATTGAGC CGGTACTTGA TGCCAATGGC
GAACCGGTTC TCGATGATGA TAATAATCCT GTCACCATCA CACGCATTGC AAATGCACCC
ACAGATCAAA ATCCATTATT TAATACAGTT TCGAGCGAAA TGGCGGCCGC TTTTAAATGG
ACTGGCGATA AAGTTCAAGG CTCGTGGGCG CCGGATGTTA TCAAACTTAA AGATGGAAAG
TATTATTTCT ATTACGATTT TTGTAACAAT CCAGATGACC CTGGCTATCC AACACCTGTG
TGTAATGCCC CACGATCTTT TATCGGCGTT GCCGTAGCTG ACGCGATAGA AGGCCCATAT
GTCGATAAAG GTGTATTTTT GCGATCAGGC TTAACGGCGG ATGAGATTAC TCAAGGTTTA
GGACCTGATG GTGTAACCGC CTATAACGGT GGCACTATGC CCAACACCAT AGACCCCGCT
GTTTTCTATG ACAAAAATGG ACAACTGTGG ATGACCTATG GATCTTACTT CGGTGGGATC
TGGATCTTAA AAATAGATGA GGCCACAGCA AAACCATTCC CGAATCAAGG CTATGGCAAG
CGGCTAACCG GTGGTGAGTT TGCGGCTAAT GAAGGCTCAT ATATTCTCTA TAGCCCCGAT
GCTAACTATT ACTATTTATT TACTTCTATT GGTGGCTTTG CGGCCAATGG TGGTTATAAC
ATTCGTGTTT CCCGCTCAGA AAATCCTGAC GGACCTTATG TCGATGCCAG TGGTTATGAC
ATGACTCAAG TTTATAAAAA TCCTGAAGAA CATGGCAATA AGATACTCGG TGGCTTCGAA
TTTGTTGCAG AGAAAGGCGA TACGGGAAGC ACTCAGGGTT ATTTATCACC AGGCCATAAC
TCAGCCTATT TTGATAAAAA TACAGGTAAA TATTTCCTCG TTATCCATAC TCGTTTCCCT
AACCGCGATG AGCAACACTC TGTACGTGTT CATGAGATGT TTATCAATGC CGATGGTTGG
TTAGTGGTAT CACCACATCG TTATGCGCCA ATTGACGGTG ATAATGTTGT CGATGCACTC
GATCTCGTCG GTGACTATCG CTTTATTAAT TTTGAAAAAG ATACCAACAC TTCACCAAAA
ACATCCGTTT ACCTCACACT GACCGAGAGT CGTCTTGTGA AAGGGGAAGT CACGGGTAAA
TATCGTTTAT ACGATAACGA ACCGAACAGA ATCACGTTAA CCCTTGATAA TTTAGGTACA
TTCGAAGGTA CTCTCGCATG GCAGTGGGAT GAGACCCTCG CTAAATTGGT ACCCACCTTC
AGCGCATTGT CGAATGAAGG CGTCTCCGTT TGGGGCTCAA AACTGCTCGA TAAGACCTCA
GAACAACTAT TAACCGACAT AGGCAACAGT ATAGATGTTA TCAGTGAAGC GACGGGTGGT
GTGATTGCGC TACCAACGCG CGGTACACGC GGTGCGACAA TTGAGTGGAC ATCAAGTGAT
GCTACTGTGA TCCACCCCGA TGGCACTGTG GTTCGCCCCA ATGTAGGGGA AGGTGACAAG
GTAGTAACTT TGACCGCCAC TATCATGGTT AACGGTAAAA AAGTCACTAA GACGTATCAA
GTGACTGTGT TTGCGAGAAA AACCTATAAC CGTATCGCGC AGTACAGTTT CGAGAACAAC
CTGACAGACT CTTTAGGGCT GTTTGCAGCT GGTCAACCTA CGGGTGATAG AATATTTAAA
GCAGCTGATT CAGTTAGTTA TAGCATGGGA CTTGAAGGGG AAGCACTGGC ACTCGATGGC
GCCCACGGCG TGTTACTGCC TACAGGGATC ATTTCAAGCT ATGAGTACAC TGTCTCATTC
TGGGCCAATC CGACAGTCAT CACTGGGTTT TCAACGGCAT TTTTTGGTGC AGTCAATGAG
GAAACCGCAG CCGATGGCGG TAAGTTCTCT AACACTTGGA TCAGCATGCT GCCCCAAGGC
TGGGATGGCA ATACCATGTT TTGGAGTAAC GGCAGTGCCT GGTTTGATGG CGTTACCGGT
GAGCGCATAC CTGAAAATAC TTGGTCACAT GTCGCGTTTT CGGTGAGCAA TGGGCTGGTA
CGTGTCTTTA TCAACGGTGT TGAAAAATTC AGTCGCGGCA ATCTCAGCGA TTATTTCAGC
GGCCAACAGG GCGTCTTTGG GCTTGGGGTT AACTACTGGG ATCTGCCCTA TAATGGTTTG
ATTGATGAAC TTAAGGTATA TGAAGCGGCA TTGACTGCAG GTGAAGTCAA AGCTTTGGAT
ATTGATAAAT TGCCAGACAG CGAATTACTA GCATCTGCAG TGGATATCCT CGACTTAGGC
GATTTATCCT CAGTACGTGA AAATTTGCAT TTGCCTGTCA CGGGTCCTTA TGCAGCAGCA
ATAACTTGGG TATCTTCAGA TCCGACTATC ATAGATACCA CAGGTGGCGT GACTCAGCCA
GGTCGTGAGG ACACTGATAA AGTGGTGACC TTGACGGCAA CCATAAAGCT GGGACAAGCC
ACTCAAACCA AAGTGTTCAT AGCAAATGTG AAGTCAATGG CGCCACCAAC ACCTTTAGCT
GTCTATAGCT TTGAAGATAA CCTAACGGAC AGCACTGCTC ACTTTGGTGC TGGCACTGTG
GTGGGTAACT TAATTGGTAC AACTGGTGGA ACCATTACTT ACACTGATGG CGCTGTGGGC
AAAGCGGCAG TATTTGACGG AAAATCGGGT ATTGAACTGC CAAACAACCT GATCAAAGAC
AATAGCTATT CAGTCTCTTT GTGGCTGAAT CCTAAAGTCT TGAACAAATA CACCAGTGCG
CTCTTTAGTT ATGCCAGTGC TTCAAGCTGG ACCAGCGTAT TACCTGGCGG ACAAAACGAC
TATGAACATG CGGTATTATG GTCTGGTGAA GCTTGGTATG ACGGTAGAAC TGGCTTTACT
ATGCCAAAAG AGACTTGGTC ACACATGGTG TACACGGTTA ATGGTGGGGA TTTGAAGGTC
TACATCAATG GTAATTTACT GTTTACTGGC GCCAACTTCC CGAACGTTTT CTCTGTACCC
ACCACAAAAT TTGCGGTCGG CGTTAACTTC TGGGATACAC CTTTCACCGG TTCTATCGAT
GAAATCAAGT TCTATGACGA AGCGATATCC GAGCAAGACG TGCAAGATCT GTTTGGCGAA
TCCAATCAAT AG
 
Protein sequence
MKLSKLMLAM TGIIVLGGCG GDGDSGSLPP NMVPENQQLP DPTIKQEMLD FNDLAVHDPS 
IIKADDTYYV FGSHLAAAKS TDLMNWQYLT RIEPVLDANG EPVLDDDNNP VTITRIANAP
TDQNPLFNTV SSEMAAAFKW TGDKVQGSWA PDVIKLKDGK YYFYYDFCNN PDDPGYPTPV
CNAPRSFIGV AVADAIEGPY VDKGVFLRSG LTADEITQGL GPDGVTAYNG GTMPNTIDPA
VFYDKNGQLW MTYGSYFGGI WILKIDEATA KPFPNQGYGK RLTGGEFAAN EGSYILYSPD
ANYYYLFTSI GGFAANGGYN IRVSRSENPD GPYVDASGYD MTQVYKNPEE HGNKILGGFE
FVAEKGDTGS TQGYLSPGHN SAYFDKNTGK YFLVIHTRFP NRDEQHSVRV HEMFINADGW
LVVSPHRYAP IDGDNVVDAL DLVGDYRFIN FEKDTNTSPK TSVYLTLTES RLVKGEVTGK
YRLYDNEPNR ITLTLDNLGT FEGTLAWQWD ETLAKLVPTF SALSNEGVSV WGSKLLDKTS
EQLLTDIGNS IDVISEATGG VIALPTRGTR GATIEWTSSD ATVIHPDGTV VRPNVGEGDK
VVTLTATIMV NGKKVTKTYQ VTVFARKTYN RIAQYSFENN LTDSLGLFAA GQPTGDRIFK
AADSVSYSMG LEGEALALDG AHGVLLPTGI ISSYEYTVSF WANPTVITGF STAFFGAVNE
ETAADGGKFS NTWISMLPQG WDGNTMFWSN GSAWFDGVTG ERIPENTWSH VAFSVSNGLV
RVFINGVEKF SRGNLSDYFS GQQGVFGLGV NYWDLPYNGL IDELKVYEAA LTAGEVKALD
IDKLPDSELL ASAVDILDLG DLSSVRENLH LPVTGPYAAA ITWVSSDPTI IDTTGGVTQP
GREDTDKVVT LTATIKLGQA TQTKVFIANV KSMAPPTPLA VYSFEDNLTD STAHFGAGTV
VGNLIGTTGG TITYTDGAVG KAAVFDGKSG IELPNNLIKD NSYSVSLWLN PKVLNKYTSA
LFSYASASSW TSVLPGGQND YEHAVLWSGE AWYDGRTGFT MPKETWSHMV YTVNGGDLKV
YINGNLLFTG ANFPNVFSVP TTKFAVGVNF WDTPFTGSID EIKFYDEAIS EQDVQDLFGE
SNQ