Gene Sbal223_2157 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSbal223_2157 
Symbol 
ID7085963 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameShewanella baltica OS223 
KingdomBacteria 
Replicon accessionNC_011663 
Strand
Start bp2563130 
End bp2565013 
Gene Length1884 bp 
Protein Length627 aa 
Translation table11 
GC content49% 
IMG OID643461058 
Productglycoside hydrolase family 43 
Protein accessionYP_002358082 
Protein GI217973331 
COG category[R] General function prediction only 
COG ID[COG3940] Predicted beta-xylosidase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0000434172 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value0.0029689 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGACTTATT CTCTGACTAA ATTAGTCGCC GTAATGGCGC TGTCATCCTG CTTGGGGGCA 
GGTATGGTTC ACGCATCGCC CATATCAACG GACGATAACC GCATCACCAG CGAAACGTTT
GCCAACCCTT TGTTTAGAAA TGGGGCAGAT CCTTGGCTTG AATACTTTAA TGGTAACTAT
TATCTGACCA CGACAACGTG GACTTCTGAA TTAGTGATGC GCAAATCGCC CACGATTGCC
GGACTTGCCG ATGCTCCCGC ACACAATATT TGGACCGGTG CAGATAAATC ACGTTGCTGT
AATTTTTGGG CATTTGAATT CCATCCAATG CAAACCGCCG ATGGTCTACG CTGGTATGTT
ATCTATACCT CAGGTGTTGC GGAAAACTTT GACGGTCAGC GCAATCATAT CCTTGAGAGT
GAAGGCAGCG ATCCCATGGG TCCATACACC TATAAAGGTA CGCCTATGCC GGATCATTGG
AATATCGACG GCAGTTATTT GGAATATAAG GGACAGTTGT ATTTCCTTTG GTCTGAGTGG
CACGGTAAAG ATCAAGTCAA TTTAATCGCT AAGATGAGTA ATCCTTGGAC AATTGAAGGC
GAGCATAAGG TGATCACTCA GCCTACTTAC GCTTGGGAAA AGTCCGGTCT AAACGTGAAT
GAAGGACCTG AAATCATTCA GCACCAAGGA CGCACGTTCC TTGTGCATTC GGCAAGCTTT
TGTAACACAG AAGATTACTC GCTTGGCGTG GTCGAACTTA CGGGCACCGA TCCTATGGAT
CCCGCTGCGT GGACGAAATA CGACAAACCT TTTTTCAGTA AAGCCAATGG GGTTTATGGC
CCTGGCCATC ACGGATTCTT TACCTCCCCC GACGGCAGCG AAGATTGGTT GGTTTACCAC
GGCAACTCTT CACCCACAGA TGGTTGCAGT GGCACACGAT CCGCCAGAGC CCAACCTTTT
AAATGGGATG ACAAAGGCCT GCCTAACTTT GGTGAGCCAA TGGCAGACAA GCAACCCTTA
CGCGTTCCAA GTGGTGAGTT TGGGCCATTG AAAGCCCAAG TTGAAGGGGT GAAATACCGT
ATTGTTAACC ATGATACCGA CCAGTGCCTC ATCACCAATG CCAAAGGCGA TGTCAGCGTT
AGCCGTTGTG ACGATAAAGC AAGTACTTGG GTGGTTGATC CCACCAATGA CGGACTTTAT
CGATTTGCTA ACGTGGCCGA AGGCACCTTT CTCACCCAAG AAAATTGCCA AGACAGTGAA
GCCTTGGGCG TGAGTGCTGC GCCTTGGGTT GCTTCCCGTT GCCAACGTTG GTCAGTGGAT
GCCAGCCATG ATGGCTGGTT CCGTTTTGCC AATGAGCGTT CTATTCAAAA TCTGCAAGCC
ACCAATTGCA CTACCCAAAA GGGCGCAGCT GTCGTTACCG GCGAAAACCG CGTCAGCGAT
TGTACTGACT GGCGGATTGA ACCGGTATCT CATTTAGCGA TTGTAAATGC CCACAGTGGA
CGAGTGGTGT CGGCGCAACA ATGCGACGTT AAAGCCAATG CCAATGTGGT TCAACATGAA
TATACCGCCA ATGCCTGCCA GCAATGGCAA GCAACATCCA CCAGTGATGG TTATTACCGC
CTGCAATCGA AGCAGCTAAC GGCGAACAAA CAAGCCCAAT GCTTAGTGAG TGTTGATGGC
AACTTGCAGC TGGGTGGTTG CGAGCAGGCG GACAGTGAAT GGCGAACTGA GTTTATGCCA
AATGGTTCAC TGCGTGTGGT ATCGCGTAAG GGCGGTTCAT CGATGAAAGT GGCAGGCGAG
TCCTATGCCA ATGGCGATAA TATCGTTGAG GATGTCTGGA AAAATACCAT TTCGCAGCAG
TTCTATTTCA GAGAGGTGAA ATAG
 
Protein sequence
MTYSLTKLVA VMALSSCLGA GMVHASPIST DDNRITSETF ANPLFRNGAD PWLEYFNGNY 
YLTTTTWTSE LVMRKSPTIA GLADAPAHNI WTGADKSRCC NFWAFEFHPM QTADGLRWYV
IYTSGVAENF DGQRNHILES EGSDPMGPYT YKGTPMPDHW NIDGSYLEYK GQLYFLWSEW
HGKDQVNLIA KMSNPWTIEG EHKVITQPTY AWEKSGLNVN EGPEIIQHQG RTFLVHSASF
CNTEDYSLGV VELTGTDPMD PAAWTKYDKP FFSKANGVYG PGHHGFFTSP DGSEDWLVYH
GNSSPTDGCS GTRSARAQPF KWDDKGLPNF GEPMADKQPL RVPSGEFGPL KAQVEGVKYR
IVNHDTDQCL ITNAKGDVSV SRCDDKASTW VVDPTNDGLY RFANVAEGTF LTQENCQDSE
ALGVSAAPWV ASRCQRWSVD ASHDGWFRFA NERSIQNLQA TNCTTQKGAA VVTGENRVSD
CTDWRIEPVS HLAIVNAHSG RVVSAQQCDV KANANVVQHE YTANACQQWQ ATSTSDGYYR
LQSKQLTANK QAQCLVSVDG NLQLGGCEQA DSEWRTEFMP NGSLRVVSRK GGSSMKVAGE
SYANGDNIVE DVWKNTISQQ FYFREVK