Gene Sbal223_2151 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSbal223_2151 
Symbol 
ID7085957 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameShewanella baltica OS223 
KingdomBacteria 
Replicon accessionNC_011663 
Strand
Start bp2555209 
End bp2556387 
Gene Length1179 bp 
Protein Length392 aa 
Translation table11 
GC content49% 
IMG OID643461053 
ProductAlpha-N-arabinofuranosidase 
Protein accessionYP_002358077 
Protein GI217973326 
COG category[R] General function prediction only 
COG ID[COG3940] Predicted beta-xylosidase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value0.923723 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value0.103273 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGACGCAA ACCTAGCCAT GGATAAGTTA ACCTCAATGA CTCGCTCTAA ACCTATTAGC 
TATAAGCATG GTAACGCTAA TCGAATTGAC CCTAAATCAA ACCGTTTGTC GGCACTGGCG
GTGTCAATTG CGCTGTCGGT TTGTATGGCA AGCCCTGCGA ATGCAGTGCC TGTGATTGAG
CATAAAGCTG TCGTTCATCA GGCCATTGAT GCGCAAAAGA CCACATCCAC CATTCAAAAT
TTGCCTTTTA TTGCCCGTAG TGCCGATCCT TGGGTGATTA AAGCCGACGA TGGCAGTTAT
TACTTTATCG CCTCTGTACC TGAATTCGAT CGAATAGAAC TGCGCCATGC CGCCACGATT
CAAGGGCTGA GCCAAGCAAA GCCTAAGATC ATTTGGCGTA AGCATGAGTC TGGGCCCATG
AGCATCGACA TTTGGGCGCC CGAACTACAT AAGATTGATG GCCGTTGGTA TATCTATTAT
GCCGCCAGTG ATAAAGACCT ACGTTTTCAT AACCGTATGT TTGTATTAGG CTTAAATGGC
GATGATCCGA TGGCGGGTGA GTGGCAGGAA CTCGGACGCC TTAAGACGGC GCACGATGCA
TTTTCTTTAG ATGCCACCAG CTTTCAAGTA GGCGAACAGC GTTATTTTAT TTGGGCGCAG
CAAGATGAGG CTAAAAGTTA CAACACGGGA TTGGTGATCG CCAAAATGGT ATCGCCGACG
CAGGCGTCAG CACAGGAAAC CATCATCACC GAACCTTTGC TCAACTGGGA AAGACTGGGT
TTTAAAGTCA ACGAAGGCGC TGCCGTGCTC ATTAAAAATG GCAAAGTCTT TGTGACCTAT
TCCGCCAGCG CCACAGATGA TCGCTACGCT ATAGGTTTAC TGTGGGCGGA TCAAACGGCC
GATCTCCTCG ATCCCAAGAG CTGGCATAAA GCACCCACAC CCGTATTTAG CAGTAATCCA
GCGCTTAAAC GTTTTGGTCC AGGGCACAAC AGCTTTGTAC TGGCAGAAGA CGGTAAGACG
GAGTTAATGT TCTACCACGC CCGCAATTAC CTTGAACTGC AGGGAACGCC ACTCACCGAC
GGCAATCGCC ATAGCTATTA TCGCGCGATA TCCTGGTCAG CAGATGGCAT ACCACAGTTT
GTTAATGAGC TTAGCGATGA ACAAACGCTT GCTAAGTGA
 
Protein sequence
MDANLAMDKL TSMTRSKPIS YKHGNANRID PKSNRLSALA VSIALSVCMA SPANAVPVIE 
HKAVVHQAID AQKTTSTIQN LPFIARSADP WVIKADDGSY YFIASVPEFD RIELRHAATI
QGLSQAKPKI IWRKHESGPM SIDIWAPELH KIDGRWYIYY AASDKDLRFH NRMFVLGLNG
DDPMAGEWQE LGRLKTAHDA FSLDATSFQV GEQRYFIWAQ QDEAKSYNTG LVIAKMVSPT
QASAQETIIT EPLLNWERLG FKVNEGAAVL IKNGKVFVTY SASATDDRYA IGLLWADQTA
DLLDPKSWHK APTPVFSSNP ALKRFGPGHN SFVLAEDGKT ELMFYHARNY LELQGTPLTD
GNRHSYYRAI SWSADGIPQF VNELSDEQTL AK