Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | ECD_03367 |
Symbol | treF |
ID | 0 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli BL21(DE3) |
Kingdom | Bacteria |
Replicon accession | CP001509 |
Strand | + |
Start bp | 3530780 |
End bp | 3532429 |
Gene Length | 1650 bp |
Protein Length | 549 aa |
Translation table | 11 |
GC content | 54% |
IMG OID | |
Product | cytoplasmic trehalase |
Protein accession | ACT45168 |
Protein GI | 253979498 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 3 |
Plasmid unclonability p-value | 0.00222705 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCTCAATC AGAAAATTCA AAACCCTAAT CCAGACGAAC TGATGATCGA AGTCGATCTC TGCTATGAGC TGGACCCGTA TGAATTAAAA CTGGATGAGA TGATCGAGGC AGAACCGGAA CCCGAGATGA TTGAAGGGCT GCCCGCCTCT GATGCGCTGA CGCCTGCCGA TCGCTATCTC GAACTGTTCG AGCATGTTCA GTCGGCGAAA ATTTTCCCCG ACAGTAAAAC CTTTCCCGAC TGCGCACCCA AAATGGACCC GCTGGATATT TTAATCCGCT ACCGTAAAGT GCGCCGTCAT CGTGATTTTG ACTTGCGCAA GTTTGTTGAA AATCACTTCT GGCTGCCGGA GGTCTACTCC AGCGAGTATG TATCGGACCC GCAAAATTCC CTGAAAGAGC ATATCGACCA GCTGTGGCCG GTGCTAACCC GCGAACCACA GGATCACATT CCGTGGTCTT CTCTACTGGC GCTGCCGCAG TCATATATTG TCCCGGGCGG CCGTTTTAGC GAAACCTACT ATTGGGACTC CTATTTCACC ATGCTGGGGC TGGCGGAAAG TGGTCGGGAA GATTTACTGA AATGCATGGC CGATAACTTC GCCTGGATGA TCGAAAACTA TGGTCACATC CCCAACGGCA ACCGCACCTA TTATTTGAGC CGATCGCAAC CACCGGTTTT TGCGCTGATG GTGGAGTTGT TTGAAGAAGA TGGTGTACGC GGTGCGCGCC GCTATCTCGA CCACCTTAAA ATGGAATATG CCTTCTGGAT GGACGGTGCA GAATCGTTGA TCCCTAATCA GGCCTATCGC CATGTTGTGC GGATGCCGGA CGGATCGCTG CTCAACCGTT ATTGGGACGA TCGCGACACG CCGCGTGACG AATCCTGGCT TGAGGACGTT GAAACCGCGA AACATTCTGG TCGCCCGCCC AACGAGGTGT ACCGCGATTT ACGCGCGGGA GCGGCCTCAG GTTGGGATTA CTCTTCCCGT TGGCTGCGTG ATACTGGTCG TCTGGCGAGC ATTCGTACCA CCCAGTTCAT CCCCATCGAT CTGAATGCCT TCCTGTTTAA ACTGGAGAGC GCCATCGCCA ACATCTCGGC GCTGAAAGGC GAGAAAGAGA CAGAAGCGCT GTTCCGCCAG AAGGCCAGTG CCCGTCGCGA TGCGGTAAAC CGTTACCTCT GGGATGATGA AAACGGCATC TACCGCGATT ACGACTGGCG ACGCGAACAA CTGGCGCTGT TTTCCGCTGC CGCCATTGTG CCGCTCTATG TCGGCATGGC GAACCATGAA CAGGCCGATC GTCTGGCAAA CGCCGTACGC AGCCGGTTAC TGACACCTGG CGGGATTCTG GCAAGCGAGT ACGAAACCGG TGAACAGTGG GATAAACCCA ATGGCTGGGC ACCGTTACAA TGGATGGCAA TTCAGGGATT TAAAATGTAT GGCGATGACC TTCTGGGTGA TGAAATCGCG CGCAGCTGGC TGAAAACGGT GAATCAGTTC TATCTGGAAC AGCACAAAAT GATCGAGAAA TACCATATTG CCGATGGTGT TCCCCGCGAA GGCGGCGGTG GCGAGTATCC GTTGCAGGAT GGGTTTGGCT GGACTAACGG TGTGGTACGC CGTTTAATTG GTTTGTACGG CGAACCATAA
|
Protein sequence | MLNQKIQNPN PDELMIEVDL CYELDPYELK LDEMIEAEPE PEMIEGLPAS DALTPADRYL ELFEHVQSAK IFPDSKTFPD CAPKMDPLDI LIRYRKVRRH RDFDLRKFVE NHFWLPEVYS SEYVSDPQNS LKEHIDQLWP VLTREPQDHI PWSSLLALPQ SYIVPGGRFS ETYYWDSYFT MLGLAESGRE DLLKCMADNF AWMIENYGHI PNGNRTYYLS RSQPPVFALM VELFEEDGVR GARRYLDHLK MEYAFWMDGA ESLIPNQAYR HVVRMPDGSL LNRYWDDRDT PRDESWLEDV ETAKHSGRPP NEVYRDLRAG AASGWDYSSR WLRDTGRLAS IRTTQFIPID LNAFLFKLES AIANISALKG EKETEALFRQ KASARRDAVN RYLWDDENGI YRDYDWRREQ LALFSAAAIV PLYVGMANHE QADRLANAVR SRLLTPGGIL ASEYETGEQW DKPNGWAPLQ WMAIQGFKMY GDDLLGDEIA RSWLKTVNQF YLEQHKMIEK YHIADGVPRE GGGGEYPLQD GFGWTNGVVR RLIGLYGEP
|
| |