Gene Sbal223_3174 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSbal223_3174 
Symbol 
ID7085787 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameShewanella baltica OS223 
KingdomBacteria 
Replicon accessionNC_011663 
Strand
Start bp3758694 
End bp3760049 
Gene Length1356 bp 
Protein Length451 aa 
Translation table11 
GC content45% 
IMG OID643462058 
Productbeta-galactosidase 
Protein accessionYP_002359082 
Protein GI217974331 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2723] Beta-glucosidase/6-phospho-beta-glucosidase/beta-galactosidase 
TIGRFAM ID[TIGR03356] beta-galactosidase 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00114145 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones24 
Fosmid unclonability p-value0.862939 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAAATAT CTTTACCAAA GAACTCGATA CTCCAAAGCG AAGCGTTTAC TTTTGGTGTT 
GCGACCGCTT CCTTTCAAAT CGAAGGTGGC GTGGACTCTC GCCAAACCTG TATTTGGGAT
ACCTTCTGTG CAACACCAGA TAAAATCCGT GATGCCTCCA ATGGCGATGT CGCCTGCAAC
CACCTGAATC TATGGCAAGA AGATATCACC TTAATCGCGT CACTCGGGGT TGATGCCTAT
CGTTTTTCCA TCGCATGGGG ACGGGTCTTA AATCAAGATG GCAGCATTAA TCAGCAGGGA
GTTAATTTCT ACATTGGCAT TCTAGACGAA CTAAAACGTA GAAATATCAA AGCATTTGTC
ACGCTTTACC ATTGGGATCT TCCTCAACAT ATTGAGGATC AAGGCGGCTG GTTAAACCGA
GATACCGCTT ACCTTTTCAA AGACTATGCT GACAAAATAA GCCAAGCCTT CGGCGACCGA
GTGTATTCCT ACGCCACTTT AAACGAACCC TTTTGCAGCT CATATTTAGG CTATGAGGCA
GGCATTCACG CCCCAGGTTT AATGAAAAAA GCCTATGGCC GTCAATCGGC TCACCACTTA
TTGCTCGCCC ACGGCTTAGC GATGCAAGTA CTGCAAAAGA ACAGCCCTAA CAGCATGAAT
GGCATAGTTC TTAACTTCAC GCCTTGCTAC GCATTGACAG AAAGTGCTGC CGATATTCAA
GCCGCAAAAC AAGCCGATGA TTACTTTAAC CAGTGGTATA TCAAGCCCAT TTTCGATGCG
GTATACCCAG ACCTTCTCAC AGCATTAGCG CCAGAAGACA GACCGGAAAT TCACGACGGC
GACCTTGAGC TTATCAGTCA ACCAATTGAT TTTTTAGGGG TTAACTTTTA TACCCGCGCC
GTATATCAGG CCGATGCAGA ACAAGGATTT GTGCAAGTTG ATTTACCTGG GGTACCTAAA
ACCGACATAG GCTGGGAGAT CCATCCACAG GCTTTTACCG ATTTACTGGT TTCTTTAAAT
CAAACCTATG ATTTACCGCC TATTTTCATC ACAGAAAATG GCGCCGCTAT GGACGATAAA
TGCATTGATG GGCGTGTCGA TGACTTCGAT AGGCTCAGCT ATTACCAACA CCATTTAACC
GCAGTAGACA ATGCCATAGT ACAAGGTGTT AACATTCAGG GTTACTTTGC CTGGAGCTTG
ATGGATAATT TTGAGTGGGC CGAAGGCTAC TTAAAACGTT TTGGCATTGT CTATGTGGAT
TATGCAAGCC AAACCCGAAC GATAAAGGCC AGTGGTCAAG CCTACAGCGA CTTGATTCGC
TCAAGGGCTC ACTTAACTAA TAACAATAAT AAATAA
 
Protein sequence
MKISLPKNSI LQSEAFTFGV ATASFQIEGG VDSRQTCIWD TFCATPDKIR DASNGDVACN 
HLNLWQEDIT LIASLGVDAY RFSIAWGRVL NQDGSINQQG VNFYIGILDE LKRRNIKAFV
TLYHWDLPQH IEDQGGWLNR DTAYLFKDYA DKISQAFGDR VYSYATLNEP FCSSYLGYEA
GIHAPGLMKK AYGRQSAHHL LLAHGLAMQV LQKNSPNSMN GIVLNFTPCY ALTESAADIQ
AAKQADDYFN QWYIKPIFDA VYPDLLTALA PEDRPEIHDG DLELISQPID FLGVNFYTRA
VYQADAEQGF VQVDLPGVPK TDIGWEIHPQ AFTDLLVSLN QTYDLPPIFI TENGAAMDDK
CIDGRVDDFD RLSYYQHHLT AVDNAIVQGV NIQGYFAWSL MDNFEWAEGY LKRFGIVYVD
YASQTRTIKA SGQAYSDLIR SRAHLTNNNN K