Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcSMS35_3823 |
Symbol | treF |
ID | 6143525 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli SMS-3-5 |
Kingdom | Bacteria |
Replicon accession | NC_010498 |
Strand | + |
Start bp | 3890229 |
End bp | 3891878 |
Gene Length | 1650 bp |
Protein Length | 549 aa |
Translation table | 11 |
GC content | 54% |
IMG OID | 641618649 |
Product | trehalase |
Protein accession | YP_001745789 |
Protein GI | 170680301 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG1626] Neutral trehalase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 21 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 38 |
Fosmid unclonability p-value | 0.121975 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCTCAATC AGAAAATTCA AAACCCTAAT CCAGACGAAC TGATGATCGA GGTCGATCTC TGCTATGAGC TGGACCCGTA TGAATTAAAA CTGGATGAGA TGATCGAGGC AGAACCGGAA CCCGAGATGA TTGAAGGGCT GCCCGCCTCT GATGCGCTGA CGCCTGCCGA TCGCTATCTC GAACTGTTCG AGCATGTTCA GTCGGCGAAA ATTTTCCCCG ACAGTAAAAC CTTTCCCGAC TGCGCACCTA AAATGGACCC GCTGGATATT TTAATCCGCT ACCGTAAAGT GCGCCGTCAT CGTGATTTTG ACTTGCGCAA GTTTGTTGAA AACCACTTCT GGCTGCCGGA GGTCTACTCC AGCGAGTATG TATCGGACCC GCAAAATTCC CTGAAAGAGC ATATCGACCA GCTGTGGCCG GTGCTAACCC GCGAACCGCA GGATCACATT CCGTGGTCTT CTCTGCTGGC GCTGCCGCAG TCATATATTG TCCCGGGCGG CCGTTTTAGC GAAACCTACT ATTGGGATTC CTATTTCACC ATGCTGGGGC TGGCGGAAAG TGGTCGGGAA GATTTGCTGA AATGCATGGC CGATAACTTC GCCTGGATGA TCGAAAACTA CGGTCACATC CCCAACGGCA ACCGCACCTA TTATTTGAGC CGCTCGCAAC CACCGGTTTT TGCGCTGATG GTGGAGTTGT TTGAAGAAGA TGGTGTACGC GGCGCGCGCC GCTATCTCGA CCACCTGAAA ATGGAATATG CCTTCTGGAT GGACGGTGCA GAATCGTTGA TCCCTAATCA GGCCTATCGC CATGTTGTGC GGATGCCGGA CGGATCGCTG CTCAACCGTT ACTGGGACGA TCGCGACACG CCGCGTGACG AATCCTGGCT TGAAGACGTT GAAACCGCGA AACATTCTGG TCGCCCGCCT AACGAGGTGT ACCGCGATTT ACGTGCGGGG GCGGCCTCCG GTTGGGATTA CTCTTCCCGT TGGCTGCGTG ATACTGGTCG TCTGGCGAGC ATTCGTACCA CCCAGTTCAT CCCCATCGAT CTGAATGCCT TCCTGTTTAA ACTGGAGAGC GCCATCGCCA ACATCTCGGC GCTGAAAGGC GAGAAAGAGA CAGAAGCACT GTTCCGCCAG AAGGCCAGTG CCCGTCGCGA TGCGGTAAAC CGTTACCTCT GGGATGATGA AAACGGTATC TACCGCGATT ACGACTGGCG ACGCGAGCAA CTGGCGCTGT TTTCCGCTGC CGCCATTGTG CCGCTCTATG TCGGTATGGC GAACCATGAA CAGGCCGATC GTCTGGCAAA CGCCGTGCGC AGCCGATTAC TGACACCTGG CGGGATTCTG GCAAGCGAGT ACGAAACCGG TGAACAGTGG GATAAACCCA ACGGCTGGGC ACCGTTACAA TGGATGGCGA TTCAGGGATT TAAAATGTAT GGCGATGACC TTCTGGGTGA TGAAATCGCG CGCAGCTGGC TGAAGACGGT GAATCAGTTC TATCTGGAAC AGCACAAACT GATCGAGAAA TACCATATTG CCGATGGTGT CCCTCGCGAA GGCGGCGGTG GCGAGTATCC GTTGCAGGAT GGGTTTGGCT GGACTAACGG TGTGGTGCGC CGTTTAATTG GTTTGTACGG TGAACCATAA
|
Protein sequence | MLNQKIQNPN PDELMIEVDL CYELDPYELK LDEMIEAEPE PEMIEGLPAS DALTPADRYL ELFEHVQSAK IFPDSKTFPD CAPKMDPLDI LIRYRKVRRH RDFDLRKFVE NHFWLPEVYS SEYVSDPQNS LKEHIDQLWP VLTREPQDHI PWSSLLALPQ SYIVPGGRFS ETYYWDSYFT MLGLAESGRE DLLKCMADNF AWMIENYGHI PNGNRTYYLS RSQPPVFALM VELFEEDGVR GARRYLDHLK MEYAFWMDGA ESLIPNQAYR HVVRMPDGSL LNRYWDDRDT PRDESWLEDV ETAKHSGRPP NEVYRDLRAG AASGWDYSSR WLRDTGRLAS IRTTQFIPID LNAFLFKLES AIANISALKG EKETEALFRQ KASARRDAVN RYLWDDENGI YRDYDWRREQ LALFSAAAIV PLYVGMANHE QADRLANAVR SRLLTPGGIL ASEYETGEQW DKPNGWAPLQ WMAIQGFKMY GDDLLGDEIA RSWLKTVNQF YLEQHKLIEK YHIADGVPRE GGGGEYPLQD GFGWTNGVVR RLIGLYGEP
|
| |