Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcSMS35_3845 |
Symbol | |
ID | 6142610 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli SMS-3-5 |
Kingdom | Bacteria |
Replicon accession | NC_010498 |
Strand | + |
Start bp | 3921007 |
End bp | 3922566 |
Gene Length | 1560 bp |
Protein Length | 519 aa |
Translation table | 11 |
GC content | 49% |
IMG OID | 641618671 |
Product | hypothetical protein |
Protein accession | YP_001745811 |
Protein GI | 170682494 |
COG category | |
COG ID | |
TIGRFAM ID | [TIGR03369] cellulose biosynthesis protein BcsE |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 21 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 47 |
Fosmid unclonability p-value | 0.725838 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGGACCCTG TATTCTCTAT CGGTATCTCA TCATTATGGG ATGAGCTGCG ACATATGCCA GCAGGCGGCG TCTGGTGGTT TAACGTCGAT CGCCATGAAG ATGCTATCAG TCTGGCGAAT CAAACAATTG CATCCCAGGC TGAAACCGCA CACGTCGCGG TCATTAGCAT GGACAGCGAT CCAGCGAAAA TCTTTCAATT AGATGATTCT CAAGGGCCGG GAAAAATAAC ATTATTTTCA ATGCTAAATC ATGAAAAAGG TCTATACTAT TTGGCCCGTG ATTTGCAGTG TTCTATTGAT CCCCATAATT ACCTTTTTAT TCTTGTTTGC GCAAATAACG CATGGCAAAA CATTCCTGCC GAGCGGCTTC GCTCATGGTT GGATAAAATG AATAAATGGA GCCGGCTAAA CCATTGTTCG CTTTTGGTAA TTAATTCCGG AAATAATAAC GATAAACAAT TTTCATTGTT ACTTGAGGAA TACCGTTCAC TTTTTGGTCT TGCCAGTTTG CGTTTTCAGG GCGACCAACA TTTGCTGGAT ATTGCTTTCT GGTGCAACGA AAAAGGGGTC AGCGCCCGTC AGCAGCTTAG CGTTCAGCAA CAAAATGGTT GCTGGACATT AGTTCAACAC CAAGAGGCGG AAATCCAACC ACGCAGCGAC GAAAAACGCA TTCTGAGTAA TGTTTCTGTA CTTGAAGGTG CGCCGCCGCT ATCGGAACAC TGGCAACTGT TCAACAATAA CGAAGTCCTG TTTAATGAAG CCCGTACCGC TCAGGCGGCG ACGGTGGTCT TTTCTTTACA ACAAAATGCG CAAATCGAGC CACTGGCCCG CAGCATTCAT ACTCTGCGTC GCCAGCGCGG TAGTGCGATG AAAATCCTCG TACGGGAAAA TACCGCTAGC CTGCGCGCCA CCGATGAACG TTTGTTATTG GCCTGCGGTG CAAATATGGT TATCCCATGG AATGCCCCAC TCTCCCGCTG TCTGACGATG ATCGAAAGCG TGCAAGGGCA GAAGTTTAGT CGCTATGTGC CGGAAGATAT CACTACCTTG CTGTCAATGA CCCAGCCGCT CAAACTGCGT GGTTTCCAGA AGTGGGATGT GTTCTGTAAT GCCGTCAACA ACATGATGAA TAACCCTCTA TTACCTGCCC ACGGTAAAGG CGTTCTGGTT GCCCTACGTC CGGTACCGGG TATCCGCGTT GAGCAAGCCC TGACGCTATG TCGCCCTAAT CGCACTGGCG ATATCATGAC CATTGGCGGT AATCGGCTGG TGCTGTTTCT CTCATTCTGT CGGATTAACG ATCTGGATAC CGCGTTGAAT CATATTTTCC CATTGCCGAC TGGCGACATT TTCTCAAACC GTATGGTCTG GTTTGAAGAT GATCAAATCA GTGCCGAGCT GGTGCAGATG CGCCTGCTTG CCCCAGAACA ATGGGGCATG CCGCTGCCTT TAACGCAAAG TTCTAAACCG GTCATCAATG CCGAGCACGA TGGTCGCCAC TGGCGACGAA TACCAGAACC AATGCGACTG TTAGATGATG CTGTGGAGCG CTCATCATGA
|
Protein sequence | MDPVFSIGIS SLWDELRHMP AGGVWWFNVD RHEDAISLAN QTIASQAETA HVAVISMDSD PAKIFQLDDS QGPGKITLFS MLNHEKGLYY LARDLQCSID PHNYLFILVC ANNAWQNIPA ERLRSWLDKM NKWSRLNHCS LLVINSGNNN DKQFSLLLEE YRSLFGLASL RFQGDQHLLD IAFWCNEKGV SARQQLSVQQ QNGCWTLVQH QEAEIQPRSD EKRILSNVSV LEGAPPLSEH WQLFNNNEVL FNEARTAQAA TVVFSLQQNA QIEPLARSIH TLRRQRGSAM KILVRENTAS LRATDERLLL ACGANMVIPW NAPLSRCLTM IESVQGQKFS RYVPEDITTL LSMTQPLKLR GFQKWDVFCN AVNNMMNNPL LPAHGKGVLV ALRPVPGIRV EQALTLCRPN RTGDIMTIGG NRLVLFLSFC RINDLDTALN HIFPLPTGDI FSNRMVWFED DQISAELVQM RLLAPEQWGM PLPLTQSSKP VINAEHDGRH WRRIPEPMRL LDDAVERSS
|
| |