Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcSMS35_2908 |
Symbol | scrB |
ID | 6145589 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli SMS-3-5 |
Kingdom | Bacteria |
Replicon accession | NC_010498 |
Strand | + |
Start bp | 2981479 |
End bp | 2982882 |
Gene Length | 1404 bp |
Protein Length | 467 aa |
Translation table | 11 |
GC content | 49% |
IMG OID | 641617777 |
Product | beta-fructofuranosidase |
Protein accession | YP_001744932 |
Protein GI | 170680697 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG1621] Beta-fructosidases (levanase/invertase) |
TIGRFAM ID | [TIGR01322] sucrose-6-phosphate hydrolase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 0.0242255 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 54 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCCTTCAC AAATGCCCGC GATTTTACAG GCAGTGATGA AAGGTCAGTC CAAAGCACAG GCAGATGCCC ACTATCCTTG TTGGCATCTC GCTCCCGTCA CCGGACTAAT GAACGATCCA AATGGCTTTT GCTGGTCAGG AGGCCGCTAT CATCTGTTTT ATCAGTGGAA TCCATTGGAT TGTAATCATA AGTACAAATG TTGGGGGCAC TGGAGTTCGA CGGATTTATT GCACTGGCAG CATGAACCAT TAGCGCTAAT GCCCGATAAA GAGTATGACC GTAATGGTTG TTACTCAGGA AGCGCGGTTA ATAATCAGGG CGTTCTCACA CTTTGCTATA CCGGTAATGT TAAGTTTGAT GATGGCAGCA GAACGGCATG GCAATGTCTG GCCACCGAAA ATAACCAGGG CGGGTTTGAT AAATTAGGCC CTGTCATACC GTTACCTGAT GGTTACACCG GACATGTTCG TGATCCAAAA GTCTGGAAGC ACAACAGCCG ATGGTACATG GTGCTTGGCG CGCAAGATAA AGAGAAACGG GGGAAAGTGC TGCTGTATTC CTCTGTCGAT CTCTATACCT GGTCTTTTCA TGGAGAAATT GCTGGTAATG GTCTGAATGA AATTGATAAC GCAGGATACA TGTGGGAATG TCCTGATCTT TTCGCCCTTG ATGGTGAATA CATTCTCCTC TGCTGCCCCC AGGGAATGGC GCGCGAACAT GAGCGCTATC TGAATACCTA TCCCTGCGCC TGGCTACATG GACAGTTTGA TTACGAGACA GGCAAATTTA CGCATGGCGC TTTCTCAGAG CTGGATGCGG GATTTGAATT CTATGCACCA CAGACGATGG AAGCCCCGGA CGGGCGACGA TTACTGGTCG GCTGGATGGG GGTTCCTGAT GGCGAAGAGA TGTTGCAACC AACCAGAAAG CATCATTGGC AGCATCAAAT GACCTGTTTT CGCGAACTCT CATTCCAGAA GGGAAAATTA TTTCAGATGC CAATCAGGGA ACTCGCGCAA CTGCGTGAAG CTGAACACTT CTGGCAAGGA AAAGCGGATC ATGCGCCGCC TGTCGCAATT GAACGTCTGG AAATGGACAT CATTCCATCA GGCGAGCTAC ATCTGAATTT TGGCAACGCA CTGGCACTGC ATCTTAATGA CGATGGTATT CAACTACAGA GGAAAAGCCT TGCAGGGCAA GAGAAACTAA CTCGTTACTG GCGAGGAAGC GTAACGTCAC TGAAGATTCT CTGTGACAGT TCCAGCATAG AGATATTCAT CAATAATGGC GAAGGCGTAA TGAGCAATCG TTATTTTCCC CATCATCCAG CTTCGCTGAT ATTACAGGGC GAGTCTGACG TCACGTTACA CTACTGGTCG CTACGCGCCT GCATGGTAGA ATGA
|
Protein sequence | MPSQMPAILQ AVMKGQSKAQ ADAHYPCWHL APVTGLMNDP NGFCWSGGRY HLFYQWNPLD CNHKYKCWGH WSSTDLLHWQ HEPLALMPDK EYDRNGCYSG SAVNNQGVLT LCYTGNVKFD DGSRTAWQCL ATENNQGGFD KLGPVIPLPD GYTGHVRDPK VWKHNSRWYM VLGAQDKEKR GKVLLYSSVD LYTWSFHGEI AGNGLNEIDN AGYMWECPDL FALDGEYILL CCPQGMAREH ERYLNTYPCA WLHGQFDYET GKFTHGAFSE LDAGFEFYAP QTMEAPDGRR LLVGWMGVPD GEEMLQPTRK HHWQHQMTCF RELSFQKGKL FQMPIRELAQ LREAEHFWQG KADHAPPVAI ERLEMDIIPS GELHLNFGNA LALHLNDDGI QLQRKSLAGQ EKLTRYWRGS VTSLKILCDS SSIEIFINNG EGVMSNRYFP HHPASLILQG ESDVTLHYWS LRACMVE
|
| |