Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcolC_3772 |
Symbol | |
ID | 6065637 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli ATCC 8739 |
Kingdom | Bacteria |
Replicon accession | NC_010468 |
Strand | + |
Start bp | 4125859 |
End bp | 4127514 |
Gene Length | 1656 bp |
Protein Length | 551 aa |
Translation table | 11 |
GC content | 53% |
IMG OID | 641603185 |
Product | trehalose-6-phosphate hydrolase |
Protein accession | YP_001726704 |
Protein GI | 170021750 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG0366] Glycosidases |
TIGRFAM ID | [TIGR02403] alpha,alpha-phosphotrehalase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 27 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACTAATC TTCCCCACTG GTGGCAAAAC GGCGTTATCT ACCAGATTTA TCCAAAGAGT TTTCAGGACA CCACGGGTAG CGGCACCGGC GATTTACGTG GCGTTATCCA ACGCCTGGGC TATCTGCATA AACTGGGCGT TGATGCCATC TGGCTAACCC CCTTTTATGT CTCTCCCCAG GTCGATAACG GTTACGACGT AGCGAACTAT ACGGCGATTG ATCCCACTTA CGGCACGCTG GACGATTTTG ACGAACTGGT GACGCAGGCA AAATCGCGCG GGATTCGTAT CATTCTCGAT ATGGTGTTTA ACCATACCTC TACCCAACAT GCCTGGTTTC GCGAGGCGCT GAACAAAGAA AGCCCTTACC GCCAGTTTTA TATCTGGCGC GATGGCGAAC CAGAAACGCC ACCGAACAAC TGGCGTTCAA AATTTGGCGG TAGTGCGTGG CGCTGGCATG CGGAAAGCGA ACAGTACTAT TTGCATCTCT TTGCACCAGA GCAGGCGGAT CTCAACTGGG AGAATCCAGC GGTACGCGCA GAGCTGAAAA AAGTCTGTGA GTTCTGGGCC GATCGTGGGG TCGACGGGTT GCGCCTGGAT GTGGTGAATC TGATCTCCAA AGACCCACGT TTCCCTGATG ACCTGGATGG CGACGGGCGT CGCTTCTACA CCGACGGGCC ACGAGCACAC GAGTTTTTGC ACGAGATGAA CCGCGATGTG TTTACGCCAC GCGGGTTAAT GACCGTAGGT GAAATGTCCT CCACCAGCCT TGAGCATTGC CAGCGATACG CGGCACTGAC AGGCAGTGAA TTGTCGATGA CCTTTAATTT TCATCACCTG AAGGTCGATT ATCCCGGTGG TGAAAAATGG ACGCTGGCTA AACCTGACTT TGTGGCGTTG AAAACATTGT TCCGCCACTG GCAACAAGGA ATGCACAACG TGGCATGGAA TGCCTTGTTC TGGTGTAACC ACGATCAGCC GCGCATTATT TCTCGCTTTG GTGATGAAGG TGAATACCGC GTGCCTGCGG CAAAAATGCT GGCGATGGTG CTGCATGGCA TGCAGGGAAC GCCGTATATC TACCAGGGCG AAGAGATTGG CATGACCAAC CCGCATTTCA CGCGCATTAC TGACTATCGC GACGTGGAGA GCCTCAATAT GTTTGCCGAG CTGCGCAACG ATGGTCGTGA TGCCGACGAG TTATTGGCAA TCCTCGCCAG TAAATCCCGT GACAACAGCC GCACGCCCAT GCAATGGAGC AACGGCGATA ATGCCGGGTT TACGGCTGGC GAACCGTGGA TTGGCCTGGG CGATAACTAT CAACAAATCA ACGTAGAAGC CGCGCTGGCC GATGAGTCCT CGGTGTTTTA CACCTACCAA AAGTTAATCG CACTGCGTAA GCAAGAAGCC ATCCTGACAT GGGGCAATTA CCAGGATCTG CTGCCAAACA GCCCTGTATT GTGGTGCTAT CGCCGTGAAT GGAAGGGGCA AACCTTGCTG GTCATTGCCA ACCTTAGCCG TGAGATCCAA CCCTGGCAGC CAGGGCAAAT GCGCGGCAAC TGGCAGCTTG TGATGCATAA CTACGAAGAA GCCTCACCAC AACCCTGTGC CATGAATTTA CGGCCTTTTG AGGCTGTCTG GTGGTTACAG AAGTAA
|
Protein sequence | MTNLPHWWQN GVIYQIYPKS FQDTTGSGTG DLRGVIQRLG YLHKLGVDAI WLTPFYVSPQ VDNGYDVANY TAIDPTYGTL DDFDELVTQA KSRGIRIILD MVFNHTSTQH AWFREALNKE SPYRQFYIWR DGEPETPPNN WRSKFGGSAW RWHAESEQYY LHLFAPEQAD LNWENPAVRA ELKKVCEFWA DRGVDGLRLD VVNLISKDPR FPDDLDGDGR RFYTDGPRAH EFLHEMNRDV FTPRGLMTVG EMSSTSLEHC QRYAALTGSE LSMTFNFHHL KVDYPGGEKW TLAKPDFVAL KTLFRHWQQG MHNVAWNALF WCNHDQPRII SRFGDEGEYR VPAAKMLAMV LHGMQGTPYI YQGEEIGMTN PHFTRITDYR DVESLNMFAE LRNDGRDADE LLAILASKSR DNSRTPMQWS NGDNAGFTAG EPWIGLGDNY QQINVEAALA DESSVFYTYQ KLIALRKQEA ILTWGNYQDL LPNSPVLWCY RREWKGQTLL VIANLSREIQ PWQPGQMRGN WQLVMHNYEE ASPQPCAMNL RPFEAVWWLQ K
|
| |