Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcolC_1866 |
Symbol | |
ID | 6066463 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli ATCC 8739 |
Kingdom | Bacteria |
Replicon accession | NC_010468 |
Strand | - |
Start bp | 2065997 |
End bp | 2067853 |
Gene Length | 1857 bp |
Protein Length | 618 aa |
Translation table | 11 |
GC content | 52% |
IMG OID | 641601279 |
Product | protease 4 |
Protein accession | YP_001724841 |
Protein GI | 170019887 |
COG category | [O] Posttranslational modification, protein turnover, chaperones [U] Intracellular trafficking, secretion, and vesicular transport |
COG ID | [COG0616] Periplasmic serine proteases (ClpP class) |
TIGRFAM ID | [TIGR00705] signal peptide peptidase SppA, 67K type [TIGR00706] signal peptide peptidase SppA, 36K type |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 0.235233 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 2 |
Fosmid unclonability p-value | 0.00000640053 |
Fosmid Hitchhiker | No |
Fosmid clonability | decreased coverage |
| |
Sequence |
Gene sequence | ATGCGAACCC TTTGGCGATT TATTGCCGGA TTTTTTAAAT GGACGTGGCG TCTGCTGAAT TTCGTCCGTG AAATGGTACT TAACCTGTTC TTTATTTTCC TCGTACTGGT TGGTGTGGGG ATTTGGATGC AGGTCAGTGG TGGTGATTCG AAAGAAACGG CCAGTCGTGG CGCACTGCTG CTGGACATTT CTGGTGTGAT CGTCGATAAA CCCGACAGTT CTCAGCGGTT TAGTAAATTA AGCCGCCAGC TGCTTGGTGC CAGTTCCGAT CGTCTGCAGG AAAACTCACT GTTTGATATC GTCAACACTA TTCGCCAGGC GAAGGACGAC CGCAATATCA CCGGTATTGT GATGGATCTG AAAAACTTCG CAGGCGGCGA CCAACCGTCT ATGCAGTACA TCGGCAAAGC TCTGAAAGAG TTTCGTGACA GCGGGAAACC GGTTTATGCC GTTGGCGAGA ACTACAGCCA GGGGCAATAT TATCTCGCCA GTTTCGCCAA TAAAATTTGG CTGTCTCCGC AAGGCGTGGT TGATCTGCAC GGCTTTGCCA CCAACGGTCT GTACTACAAA TCGTTGCTGG ATAAGCTGAA AGTTTCCACC CATGTGTTCC GCGTGGGTAC GTATAAATCT GCCGTTGAAC CGTTTATTCG TGATGATATG TCACCGGCAG CCCGCGAAGC TGACAGCCGC TGGATTGGTG AGCTGTGGCA AAACTATCTG AATACTGTTG CCGCTAACCG GCAGATCCCT GCTGAGCAGG TATTCCCTGG CGCGCAAGGG TTGCTTGAGG GTTTAACCAA AACCGGTGGC GATACCGCGA AATATGCACT GGAAAACAAG CTGGTCGATG CACTGGCATC GAGTGCGGAA ATCGAAAAAG CACTGACCAA AGAATTCGGC TGGAGTAAGA CTGATAAAAA TTATCGCGCC ATCAGTTATT ACGATTACGC ATTGAAAACG CCGGCAGATA CCGGTGACAG CATCGGTGTC GTCTTTGCTA ATGGCGCAAT TATGGATGGC GAGGAAACTC AGGGGAATGT TGGCGGTGAT ACCACTGCGG CACAAATCCG CGACGCTCGC CTTGACCCGA AAGTGAAAGC GATTGTCCTG CGTGTTAATA GCCCAGGCGG CAGCGTTACC GCGTCTGAAG TGATTCGCGC TGAACTGGCA GCAGCCCGGG CAGCGGGTAA GCCTGTGGTT GTATCGATGG GCGGCATGGC GGCATCTGGT GGTTACTGGA TTTCCACGCC AGCTAATTAC ATTGTGGCTA ACCCCAGCAC CCTGACCGGT TCTATCGGTA TCTTCGGCGT GATCACCACC GTAGAAAATA GTCTGGATTC GATTGGTGTT CATACTGATG GTGTCTCAAC TTCACCGCTG GCGGATGTTT CTATCACCAG GGCACTGCCG CCGGAAGCGC AGCAGATGAT GCAATTAAGC ATTGAGAATG GCTATAAACG CTTTATCACG CTGGTTGCTG ATGCGCGTCA TTCGACGCCG GAGCAAATTG ATAAAATCGC CCAGGGCCAC GTCTGGACCG GTCAGGATGC AAAAGCTAAC GGGCTGGTCG ATAGTCTCGG GGATTTCGAT GATGCGGTTG CCAAAGCAGC AGAGCTCGCA AAAGTGAAAC AGTGGCATCT GGAATACTAC GTTGATGAAC CGACCTTCTT CGACAAAGTG ATGGACAACA TGTCTGGTTC TGTCCGGGCA ATGTTGCCAG ATGCGTTCCA GGCCATGTTA CCTGCACCGC TTGCCTCGGT AGCCTCTACC GTTAAAAGTG AAAGCGACAA GCTGGCCGCG TTTAACGACC CACAAAACCG TTATGCGTTT TGCCTGACCT GCGCCAACGT GCGTTAA
|
Protein sequence | MRTLWRFIAG FFKWTWRLLN FVREMVLNLF FIFLVLVGVG IWMQVSGGDS KETASRGALL LDISGVIVDK PDSSQRFSKL SRQLLGASSD RLQENSLFDI VNTIRQAKDD RNITGIVMDL KNFAGGDQPS MQYIGKALKE FRDSGKPVYA VGENYSQGQY YLASFANKIW LSPQGVVDLH GFATNGLYYK SLLDKLKVST HVFRVGTYKS AVEPFIRDDM SPAAREADSR WIGELWQNYL NTVAANRQIP AEQVFPGAQG LLEGLTKTGG DTAKYALENK LVDALASSAE IEKALTKEFG WSKTDKNYRA ISYYDYALKT PADTGDSIGV VFANGAIMDG EETQGNVGGD TTAAQIRDAR LDPKVKAIVL RVNSPGGSVT ASEVIRAELA AARAAGKPVV VSMGGMAASG GYWISTPANY IVANPSTLTG SIGIFGVITT VENSLDSIGV HTDGVSTSPL ADVSITRALP PEAQQMMQLS IENGYKRFIT LVADARHSTP EQIDKIAQGH VWTGQDAKAN GLVDSLGDFD DAVAKAAELA KVKQWHLEYY VDEPTFFDKV MDNMSGSVRA MLPDAFQAML PAPLASVAST VKSESDKLAA FNDPQNRYAF CLTCANVR
|
| |