Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcolC_0180 |
Symbol | |
ID | 6064809 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli ATCC 8739 |
Kingdom | Bacteria |
Replicon accession | NC_010468 |
Strand | - |
Start bp | 196625 |
End bp | 198184 |
Gene Length | 1560 bp |
Protein Length | 519 aa |
Translation table | 11 |
GC content | 49% |
IMG OID | 641599582 |
Product | putative protease |
Protein accession | YP_001723189 |
Protein GI | 170018235 |
COG category | |
COG ID | |
TIGRFAM ID | [TIGR03369] cellulose biosynthesis protein BcsE |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 39 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGGACCCTG TATTCTCTAT CGGTATCTCA TCATTATGGG ATGAGCTGCG ACATATGCCA GCAGGCGGCG TCTGGTGGTT TAACGTCGAT CGCCATGAAG ATGCTATCAG TCTGGCGAAT CAAACAATTG CATCCCAGGC TGAAACCGCA CACGTCGCGG TCATTAGCAT GGACAGCGAT CCGGCGAAAA TCTTTCAATT AGATGATTCT CAAGGGCCGG AAAAAATAAA ATTATTTTCA ATGCTAAATC ATGAAAAAGG TCTATACTAT TTGACCCGTG ATTTGCAGTG ATCTATTGAT CCCCATAATT ACCTTTTTAT TCTTGTTTGC GCAAATAACG CATGGCAAAA CATTCCTGCC GAGCGGCTTC GCTCATGGTT GGATAAAATG AATAAATGGA GCAGGTTAAA CCATTGTTCG CTTTTGGTAA TTAATCCCGG AAATAATAAC GATAAACAAT TTTCATTGTT GCTTGAGGAA TACCGTTCAC TTTTTGGTCT TGCCAGTTTG CGTTTTCAGG GTGACCAACA TTTGCTGGAT ATTGCCTTCT GGTGCAACGA AAAAGGGGTC AGCGCCCGTC AGCAGCTTAG CGTTCAGCAA CAAAATGGTA TCTGGACATT AGTTCAAAGC GAAGAGGCGG AGATCCAACC ACGCAGCGAC GAAAAACGCA TTCTGAGTAA TGTTGCTGTA CTGGAAGGTG CGCCGCCGCT ATCGGAACAC TGGCAACTGT TCAACAATAA CGAAGTCCTG TTCAATGAAG CCCGTACCGC TCAGGCGGCG ACGGTGGTCT TTTCTTTACA GCAAAATGCG CAAATCGAGC CACTGGCCCG CAGCATTCAT ACCCTGCGTC GCCAGCGCGG TAGTGCGATG AAAATCCTCG TGCGGGAAAA TACCGCTAGC CTGCGCGCCA CCGATGAACG TTTGTTATTG GCCTGCGGTG CAAATATGGT TATTCCGTGG AATGCGCCAC TCTCCCGTTG TCTGACGATG ATCGAAAGCG TGCAAGGGCA GAAGTTTAGT CGCTATGTGC CGGAAGATAT CACTACCTTG CTGTCAATGA CCCAGCCGCT CAAACTGCGT GGTTTCCAGA AGTGGGATGT GTTCTGTAAT GCCGTCAACA ACATGATGAA TAACCCTCTA TTACCTGCCC ACGGTAAAGG CGTTCTGGTT GCCCTACGTC CGGTACCGGG TATCCGCGTT GAACAAGCCC TGACGCTGTG TCGCCCTAAC CGTACCGGCG ATATCATGAC CATTGGCGGT AATCGGCTGG TGCTGTTTCT CTCATTCTGT CGGATTAACG ATCTGGATAC CGCGTTGAAT CATATTTTCC CATTGCCTAC TGGCGACATT TTCTCAAACC GTATGGTCTG GTTTGAAGAT GATCAAATCA GTGCCGAGCT GGTGCAGATG CGCTTGCTTG CCCCAGAACA ATGGGGCATG CCGCTGCCTT TAACGCAAAG TTCTAAACCG GTCATCAATG CCGAGCACGA TGGTCGCCAC TGGCGACGAA TACCAGAACC CATGCGACTG TTAGATGATG CTGTGGAGCG CTCATCATGA
|
Protein sequence | MDPVFSIGIS SLWDELRHMP AGGVWWFNVD RHEDAISLAN QTIASQAETA HVAVISMDSD PAKIFQLDDS QGPEKIKLFS MLNHEKGLYY LTRDLQUSID PHNYLFILVC ANNAWQNIPA ERLRSWLDKM NKWSRLNHCS LLVINPGNNN DKQFSLLLEE YRSLFGLASL RFQGDQHLLD IAFWCNEKGV SARQQLSVQQ QNGIWTLVQS EEAEIQPRSD EKRILSNVAV LEGAPPLSEH WQLFNNNEVL FNEARTAQAA TVVFSLQQNA QIEPLARSIH TLRRQRGSAM KILVRENTAS LRATDERLLL ACGANMVIPW NAPLSRCLTM IESVQGQKFS RYVPEDITTL LSMTQPLKLR GFQKWDVFCN AVNNMMNNPL LPAHGKGVLV ALRPVPGIRV EQALTLCRPN RTGDIMTIGG NRLVLFLSFC RINDLDTALN HIFPLPTGDI FSNRMVWFED DQISAELVQM RLLAPEQWGM PLPLTQSSKP VINAEHDGRH WRRIPEPMRL LDDAVERSS
|
| |