Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcE24377A_4024 |
Symbol | |
ID | 5586001 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli E24377A |
Kingdom | Bacteria |
Replicon accession | NC_009801 |
Strand | + |
Start bp | 4008387 |
End bp | 4009892 |
Gene Length | 1506 bp |
Protein Length | 501 aa |
Translation table | 11 |
GC content | 49% |
IMG OID | 640927645 |
Product | hypothetical protein |
Protein accession | YP_001465006 |
Protein GI | 157156797 |
COG category | |
COG ID | |
TIGRFAM ID | [TIGR03369] cellulose biosynthesis protein BcsE |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 38 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCCAGCAG GCGGCGTCTG GTGGTTTAAC GTCGATCGCC ATGAAGATGC TATCAGTCTG GCGAATCAAA CAATTGCATC CCAGGCTGCA ACCGCACACG TCGCGGTCAT TAGCATGGAC AGCGATCCGG CGAAAATCTT TCAATTAGAT GATTCTCAAG GGCCGGAAAA AATAAAATTA TTTTCAATGC TAAATCATGA AAAAGGTCTA TACTATTTGA CCCGTGATTT GCAGTGTTCT ATTGATCCCC ATAATTACCT TTTTATTCTT GTTTGCGCAA ATAACGCATG GCAAAACATT CCTGCCGAGC GGCTTCGCTC ATGGTTGGAT AAAATGAATA AATGGAGCAG GTTAAACCAT TGTTCGCTTT TGGTAATTAA TCCCGGAAAT AATAACGATA AACAATTTTC ATTGTTGCTT GAGGAATACC GTTCACTTTT TGGTCTTGCC AGTTTGCGTT TTCAGGGTGA CCAACATTTG CTGGATATTG CCTTCTGGTG CAACGAAAAA GGGGTCAGCG CCCGTCAGCA GCTTAGCGTT CAGCAACAAA ATGGTATCTG GACATTAGTT CAAAGCGAAG AGGCGGAGAT CCAACCACGC AGCGACGAAA AACGCATTCT GAGTAATGTT GCTGTACTGG AAGGTGCGCC GCCGCTATCG GAACACTGGC AACTGTTCAA CAATAACGAA GTCCTGTTCA ATGAAGCCCG TACCGCTCAG GCGGCGACGG TGGTCTTTTC TTTACAGCAA AATGCGCAAA TCGAGCCACT GGCCCGCAGC ATTCATACCC TGCGTCGCCA GCGCGGTAGT GCGATGAAAA TCCTCGTGCG GGAAAATACC GCTAGCCTGC GCGCCACCGA TGAACGTTTG TTATTGGCCT GCGGTGCAAA TATGGTTATT CCGTGGAATG CGCCACTCTC CCGTTGTCTG ACGATGATCG AAAGCGTGCA AGGGCAGAAG TTTAGTCGCT ATGTGCCGGA AGATATCACT ACCTTGCTGT CAATGACCCA GCCGCTCAAA CTGCGTGGTT TCCAGAAGTG GGATGTGTTC TGTAATGCCG TCAACAACAT GATGAATAAC CCTCTCTTGC CTGCCCACGG TAAAGGCGTT CTGGTTGCCC TACGTCCGGT ACCGGGTATC CGCGTTGAAC AAGCCCTGAC GCTGTGTCGC CCTAACCGTA CCGGCGATAT CATGACCATT GGCGGTAATC GGCTGGTGCT GTTTCTCTCA TTCTGTCGGA TTAACGATCT GGATACCGCG TTGAATCATA TTTTCCCATT GCCTACTGGC GACATTTTCT CAAACCGTAT GGTCTGGTTT GAAGATGATC AAATCAGTGC CGAGCTGGTG CAGATGCGCT TGCTTGCCCC AGAACAATGG GGCATGCCGC TGCCTTTAAC GCAAAGTTCT AAACCGGTCA TCAATGCCGA GCACGATGGT CGCCACTGGC GACGAATACC AGAACCCATG CGACTGTTAG ATGATGCTGT GGAGCGCTCA TCATGA
|
Protein sequence | MPAGGVWWFN VDRHEDAISL ANQTIASQAA TAHVAVISMD SDPAKIFQLD DSQGPEKIKL FSMLNHEKGL YYLTRDLQCS IDPHNYLFIL VCANNAWQNI PAERLRSWLD KMNKWSRLNH CSLLVINPGN NNDKQFSLLL EEYRSLFGLA SLRFQGDQHL LDIAFWCNEK GVSARQQLSV QQQNGIWTLV QSEEAEIQPR SDEKRILSNV AVLEGAPPLS EHWQLFNNNE VLFNEARTAQ AATVVFSLQQ NAQIEPLARS IHTLRRQRGS AMKILVRENT ASLRATDERL LLACGANMVI PWNAPLSRCL TMIESVQGQK FSRYVPEDIT TLLSMTQPLK LRGFQKWDVF CNAVNNMMNN PLLPAHGKGV LVALRPVPGI RVEQALTLCR PNRTGDIMTI GGNRLVLFLS FCRINDLDTA LNHIFPLPTG DIFSNRMVWF EDDQISAELV QMRLLAPEQW GMPLPLTQSS KPVINAEHDG RHWRRIPEPM RLLDDAVERS S
|
| |