Gene Sama_1402 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSama_1402 
Symbol 
ID4603654 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameShewanella amazonensis SB2B 
KingdomBacteria 
Replicon accessionNC_008700 
Strand
Start bp1701207 
End bp1702565 
Gene Length1359 bp 
Protein Length452 aa 
Translation table11 
GC content54% 
IMG OID639780752 
ProductBeta-glucosidase 
Protein accessionYP_927279 
Protein GI119774539 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2723] Beta-glucosidase/6-phospho-beta-glucosidase/beta-galactosidase 
TIGRFAM ID[TIGR03356] beta-galactosidase 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00588198 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0441628 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACAACAC TCTCGCATTT CACCCTGCCG GGCGATTCAG TAATGATGCA AAAAGATTTT 
TTATTTGGCG TCGCCACCGC CTCGTTTCAA ATTGAAGGCG ACGCTGAACA TCGTCAGCCC
TGTATCTGGG ACACCTTTTG CGATACCCCA GGCAAAATTG CGGATGGTTC GAACGGTCAG
GTTGCCTGCG ATCATGTCAA GCTCTGGCGT GACGATGTTG ACCTGATTGC CTCCCTTGGG
GTGGATGCCT ACCGCCTGTC CATCAGTTGG GGACGGGTGT TACATCCCGA TGGCAGTGTG
AACCAGCGCG GCATGGATTT TTACATTAAT CTCCTGGATG AGCTTGGTCG CCGGGGCATT
AACGTGTTCG TCACCCTCTA CCACTGGGAC TTACCTCAGC ATCTTGAGGA CAAAGGTGGC
TGGCTCAATC GTGACACAGC AGTGGCCTTT GCCAACTACG CCGCCATTGT GGCCAACGCC
CTGGGTAACA GGGTGTATGC CTATTCAACC CTGAATGAGC CATTTTGCAG CGCCTTTCTC
GGCTATGAGG CAGGTATTCA CGCTCCCGGC CACAAGAGCC GTCAGCAGGG CCGCACAGCC
GCCCACAATT TGCTGCTGGC CCACGGTATG GCAATGACTG AAATTCGACG GGAAGCACCA
GAGGCCAAAG CGGGCATAGT GCTTAATTTC AGCCCGGCTT ATCCCTACAC ATCCAGTGCC
GGGGATGCCA ACGCCGCCCG ACTGGCCCAT GAATATCACA ACACCTGGTA CTTGATGCCA
CTGATGGAAG GCCGTTATCC GGACATCATC AATCAACTCG AGCCCCATGA ACGCCCGGTT
GTGGAGCCCG GTGATATGGA TATCATCAGT ACACCAATCG ATTATCTGGG GATCAACTAC
TATACCCGTA ACGTCTACCG CGCTGGCGGC CCGCTTGGCT TTGAAGAAGT GCGTATCGAT
AACGTGCCCC GTACCGCCAT GGATTGGGAA ATTTGCCCCC AGGCCTTTAC CGACTTGCTG
ACAGGTCTGG CACAGGAATT TAACCTGCCA CCAATTTACA TCACTGAAAA TGGCGCTGCC
GAAGACGATG CGCCATTTAA CGGCACTGTG CACGACCCCA TGCGACTGGA CTATTTGCAG
TCTCATCTGC TGGCTGTTCA TCAGGCTATC GAACGCGGAG TGGATATCAA AGGCTACTTT
GCCTGGAGTC TGATGGACAA CTTTGAGTGG GCGGAAGGCT ACCGCAAACG CTTTGGACTG
GTCTATGTCG ACTATGGGAC CCAGCAGCGC ATACTCAAAT CCAGCGCCAA AGCCTATCAG
GGAATGCTTG CCATACGCCA AGAGGCCAGC CAACAATAA
 
Protein sequence
MTTLSHFTLP GDSVMMQKDF LFGVATASFQ IEGDAEHRQP CIWDTFCDTP GKIADGSNGQ 
VACDHVKLWR DDVDLIASLG VDAYRLSISW GRVLHPDGSV NQRGMDFYIN LLDELGRRGI
NVFVTLYHWD LPQHLEDKGG WLNRDTAVAF ANYAAIVANA LGNRVYAYST LNEPFCSAFL
GYEAGIHAPG HKSRQQGRTA AHNLLLAHGM AMTEIRREAP EAKAGIVLNF SPAYPYTSSA
GDANAARLAH EYHNTWYLMP LMEGRYPDII NQLEPHERPV VEPGDMDIIS TPIDYLGINY
YTRNVYRAGG PLGFEEVRID NVPRTAMDWE ICPQAFTDLL TGLAQEFNLP PIYITENGAA
EDDAPFNGTV HDPMRLDYLQ SHLLAVHQAI ERGVDIKGYF AWSLMDNFEW AEGYRKRFGL
VYVDYGTQQR ILKSSAKAYQ GMLAIRQEAS QQ