Gene Sbal195_2523 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSbal195_2523 
Symbol 
ID5754282 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameShewanella baltica OS195 
KingdomBacteria 
Replicon accessionNC_009997 
Strand
Start bp2981426 
End bp2982454 
Gene Length1029 bp 
Protein Length342 aa 
Translation table11 
GC content52% 
IMG OID641288817 
Productbeta-hexosaminidase 
Protein accessionYP_001554951 
Protein GI160875635 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1472] Beta-glucosidase-related glycosidases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.352424 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.669465 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCTATT TAATGTTGGA CCTACTGTCG TTGGACGTCA GTGAGGCAGA ATCTGAGATG 
CTACGCCATC CACAGGTGGG TGGTTTAATT CTATTTTCCC GCAATTTTTC AAGCCGTGAT
CAGCTGATCC GTCTGGTACA ACAGATACGT CAAATCCGCC CTGAACTATT AATTGCCGTC
GATCATGAGG GCGGTCGAGT ACAACGCTTC CGCGATGGCT TTACCGTTAT TCCCGCCATG
GGCGACATTT TACCCGCGGC AAAAGGTGAT ATAGCGTTAG CCAAACGCTG GGCCTGTGAG
CTCGGTTTCT TGATGGCGAT TGAACTACTG GCCTGCGATA TCGACTTAAG TTTCGCGCCC
GTATTGGACT TAAACGGCGT GAGCCAAGTG ATTGGTAAAC GCAGTTTTAG CCCTGAGCCT
GCCGAGGTGA TTACACTGGC CGAAAGCTTT ATTGCCGGCA TGGCCGCCGC TGGCATGGGC
GCCGTGGGTA AACATTTCCC CGGACACGGC AGCGTAGTGG CGGATTCGCA CTACGAGAAA
CCGATTGATG AGCGCGATGC CGAGGCGATT TTTGCGAAGG ATATCCTGCC GTTTAAAGAA
TTGATCGCAA AAGAAAAGTT ATTAGGCGTG ATGCCCGCGC ACGTGGTTTA TCCTAAAGTC
GACCCGAACT CTGCGGGTTT TTCTGAATAC TGGCTAAAAC AAGTGCTGCG CAAAGAACTT
GGCTTTAACG GGGTGATTTT CTCCGACGAT CTCGGCATGC AAGGCGCAGG ATTTGCGGGC
GATTACCGAG CAAGGGCCAG CGCGGCGTTA GCTGCCGGTT GCGACATGAT TTTAGTGTGT
AATGACAATG CGGGCGTAAT GTCGCTGCTG GATGGTTTTA CATGGCCAGC GAGTGCGCCG
CAGTATCCTG CAAGTTTACT CAAGCCCAAT GCCGCACAAA CGGCCGCAGC GCTCGATAAT
ACCGCCCGTT GGGAAAACGC GAAACAGCTT GCAGAGCAAA TTTGTTTAGC CCAACAGGCG
AAAGTTTGA
 
Protein sequence
MSYLMLDLLS LDVSEAESEM LRHPQVGGLI LFSRNFSSRD QLIRLVQQIR QIRPELLIAV 
DHEGGRVQRF RDGFTVIPAM GDILPAAKGD IALAKRWACE LGFLMAIELL ACDIDLSFAP
VLDLNGVSQV IGKRSFSPEP AEVITLAESF IAGMAAAGMG AVGKHFPGHG SVVADSHYEK
PIDERDAEAI FAKDILPFKE LIAKEKLLGV MPAHVVYPKV DPNSAGFSEY WLKQVLRKEL
GFNGVIFSDD LGMQGAGFAG DYRARASAAL AAGCDMILVC NDNAGVMSLL DGFTWPASAP
QYPASLLKPN AAQTAAALDN TARWENAKQL AEQICLAQQA KV