Gene Sbal223_2156 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSbal223_2156 
Symbol 
ID7085962 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameShewanella baltica OS223 
KingdomBacteria 
Replicon accessionNC_011663 
Strand
Start bp2562069 
End bp2563127 
Gene Length1059 bp 
Protein Length352 aa 
Translation table11 
GC content46% 
IMG OID643461057 
ProductArabinan endo-1,5-alpha-L-arabinosidase 
Protein accessionYP_002358081 
Protein GI217973330 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG3507] Beta-xylosidase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.000000116592 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value0.00999646 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGCCACCAC TGTCTGTTTC ATCTAAAAAA TACCTTAAGC GGTTCAACTA TGCCTTGATG 
CTCGGCCTTC TTGGCACTGT CGGTAGCGTA AGCGCAAAAC AGGTGAGTAT TCATGATCCG
GTAATGGCGC AAGAAGCGGG TAACTATTAT CTGTTTAGTA CCGGCCCTGG CATTACCTAT
TATTCTTCAA AGGATAAAGT GCACTGGGCA TTAGCGGGCA GAGTATTCGA CACTGAGCCC
ACATGGGCCC GCAGTGTTGC ACCGGGTTTT AATGGTCACC TGTGGGCGCC GGATATCATT
GAGCAAAACG GCAGTTTTTA CCTTTATTAC TCTGTCTCGG CCTTTGGTAA AAATACCTCC
GCGATTGGTG TAACCGTCAA TAAAACCCTT GATAAAAAAT CAAAAGATTA TCAGTGGGTT
GATAAGGGGA TTGTGTTGCA GTCCATTCCC GACCGCGATG CGTGGAATGC TATTGACCCG
AATATTATTG TTGATGAACA AGGCACGCCT TGGATGAGTT TCGGTTCATT CTGGCAAGGG
TTGAAGCTCG TTAAACTCAA TAGTGACTTT ATCTCGATTG CCGAGCCGCA GGAATGGCAT
ACCTTAGCCA AGCTAGCGCG TCCTGCACTG CTAGCAGAAA CCGAACCCGG CCCAGCGCAA
ATTGAAGCAC CGTTTATTTA TAAAAAAGCG GATTTTTACT ATTTATTTGT TTCCTACGGT
CTTTGCTGCC GTGGTGACGA CAGTACCTAT CATTTAGCGG TTGGCCGCTC GAAATCAGTG
ACGGGCCCTT ATCTTGATAA AACCGGTAAA GACATGGCTC AAGGTGGAGG CTCCGTGTTG
CTTAACGGTA CTAAGGCATG GCCAGGATTG GGGCACAACA GCGTGTATCA ATTTGATGGA
AAAGATTATT TAGTCTTTCA CGCCTATGAA TCCGCCGATC ACGGCTTACA AAAACTCAAA
ATAGCTGAAC TGACATGGAA CCAAGGTTGG CCAGTGGTCG ACCCTAACGC GCTCACCCAA
TATCAAAGTG TATTAGTTGA CTCAGTAGGA AATAAATAA
 
Protein sequence
MPPLSVSSKK YLKRFNYALM LGLLGTVGSV SAKQVSIHDP VMAQEAGNYY LFSTGPGITY 
YSSKDKVHWA LAGRVFDTEP TWARSVAPGF NGHLWAPDII EQNGSFYLYY SVSAFGKNTS
AIGVTVNKTL DKKSKDYQWV DKGIVLQSIP DRDAWNAIDP NIIVDEQGTP WMSFGSFWQG
LKLVKLNSDF ISIAEPQEWH TLAKLARPAL LAETEPGPAQ IEAPFIYKKA DFYYLFVSYG
LCCRGDDSTY HLAVGRSKSV TGPYLDKTGK DMAQGGGSVL LNGTKAWPGL GHNSVYQFDG
KDYLVFHAYE SADHGLQKLK IAELTWNQGW PVVDPNALTQ YQSVLVDSVG NK