Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcE24377A_4026 |
Symbol | |
ID | 5589700 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli E24377A |
Kingdom | Bacteria |
Replicon accession | NC_009801 |
Strand | + |
Start bp | 4010077 |
End bp | 4011756 |
Gene Length | 1680 bp |
Protein Length | 559 aa |
Translation table | 11 |
GC content | 53% |
IMG OID | 640927647 |
Product | hypothetical protein |
Protein accession | YP_001465008 |
Protein GI | 157157989 |
COG category | |
COG ID | |
TIGRFAM ID | [TIGR03368] cellulose synthase operon protein YhjU |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 40 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGACTCAAT TTACACAAAA TACCGCCATG CCTTCTTCCC TCTGGCAATA CTGGCGCGGC CTTTCCGGCT GGAACTTCTA TTTTCTGGTT AAGTTCGGCC TGTTGTGGGC GGGATATCTT AACTTCCATC CGCTCCTCAA TTTGGTGTTT GCCGCGTTTC TGCTGATGCC CATTCCGCGC TACAGCCTGC ATCGCTTGCG CCACTGGATT GCCCTGCCGA TCGGCTTTGC TTTGTTCTGG CATGACACCT GGTTGCCTGG CCCGGAAAGC ATAATGAGCC AGGGTTCGCA GGTGGCGGGG TTCAGTACCG ATTATTTAAT CGACCTTGTC ACACGCTTTA TTAACTGGCA GATGATTGGG GCCATTTTTG TTTTATTAGT GGCCTGGTTA TTCCTGTCAC AATGGATTCG CATTACCGTT TTTGTGGTTG CCATACTGCT ATGGCTGAAC GTACTTACCC TGGCGGGACC AAGTTTCTCC TTGTGGCCAG CCGGACAACC GACGACCACT GTAACAACGA CGGGTGGTAA CGCAGCGGCA ACCGTTGCGG CGACGGGTGG CGCACCGGTA GTGGGTGATA TGCCCGCACA AACTGCACCG CCAACAACGG CGAACCTTAA CGCCTGGCTG AATAATTTCT ATAACGCGGA GGCGAAACGT AAATCGACCT TCCCGTCTTC GCTGCCCGCT GATGCTCAGC CATTTGAACT ACTGGTGATT AACATCTGTT CGCTTTCCTG GTCGGATATA GAAGCCGCCG GGTTGATGTC GCATCCACTG TGGTCGCATT TCGATATTGA GTTCAAGAAC TTTAACTCCG CCACCTCCTA CAGTGGCCCG GCGGCGATCC GTTTACTGCG CGCCAGCTGC GGGCAGACTT CGCACACTAA TCTGTATCAA CCGGCAAATA ACGACTGCTA TCTGTTTGAT AACCTTTCGA AACTGGGCTT TACCCAGCAC CTGATGATGG GGCATAACGG CCAGTTCGGC GGTTTTTTGA AAGAAGTTCG CGAAAATGGC GGCATGCAGA CTGAATTGAT GGATCAAACA AATCTGCCGG TTATTTTGCT GGGCTTTGAT GGTTCGCCGG TTTATGACGA TACCGCCGTG CTTAACCGCT GGCTGGACGT TACCGAAAAA GATAAAAATA GCCGTAGTGC CACGTTCTAC AACACGCTTC CACTGCATGA CGGCAACCAT TATCCGGGGG TCAGCAAAAC AGCGGATTAC AAAGCGCGGG CGCAGAAATT CTTTGATGAA CTGGACGCCT TCTTTACTGA ACTGGAGAAA TCGGGTCGTA AAGTGATGGT GGTCGTGGTG CCGGAACACG GCGGCGCGCT GAAGGGCGAC AGAATGCAGG TATCTGGCCT ACGTGATATC CCTAGCCCGT CTATCACCGA CGTCCCCGTT GGGGTGAAAT TCTTCGGCAT GAAGGCACCA CATCAGGGGG CACCGATTGT CATTGACCAA CCGAGCAGCT TCCTGGCTAT CTCCGATCTG GTGGTTCGCG TTCTTGATGG CAAGATTTTC ACCGAAGACA ATGTTGACTG GAAAAAACTC ACCAGTGGGT TGCCACAAAC AGCACCGGTC TCCGAGAACT CAAATGCAGT AGTTATTCAA TACCAGGATA AACCGTACGT TCGCCTGAAC GGCGGCGACT GGGTGCCTTA CCCGCAGTAA
|
Protein sequence | MTQFTQNTAM PSSLWQYWRG LSGWNFYFLV KFGLLWAGYL NFHPLLNLVF AAFLLMPIPR YSLHRLRHWI ALPIGFALFW HDTWLPGPES IMSQGSQVAG FSTDYLIDLV TRFINWQMIG AIFVLLVAWL FLSQWIRITV FVVAILLWLN VLTLAGPSFS LWPAGQPTTT VTTTGGNAAA TVAATGGAPV VGDMPAQTAP PTTANLNAWL NNFYNAEAKR KSTFPSSLPA DAQPFELLVI NICSLSWSDI EAAGLMSHPL WSHFDIEFKN FNSATSYSGP AAIRLLRASC GQTSHTNLYQ PANNDCYLFD NLSKLGFTQH LMMGHNGQFG GFLKEVRENG GMQTELMDQT NLPVILLGFD GSPVYDDTAV LNRWLDVTEK DKNSRSATFY NTLPLHDGNH YPGVSKTADY KARAQKFFDE LDAFFTELEK SGRKVMVVVV PEHGGALKGD RMQVSGLRDI PSPSITDVPV GVKFFGMKAP HQGAPIVIDQ PSSFLAISDL VVRVLDGKIF TEDNVDWKKL TSGLPQTAPV SENSNAVVIQ YQDKPYVRLN GGDWVPYPQ
|
| |