Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cthe_0191 |
Symbol | |
ID | 4808607 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Clostridium thermocellum ATCC 27405 |
Kingdom | Bacteria |
Replicon accession | NC_009012 |
Strand | + |
Start bp | 230789 |
End bp | 232588 |
Gene Length | 1800 bp |
Protein Length | 599 aa |
Translation table | 11 |
GC content | 36% |
IMG OID | 640105602 |
Product | proteinase inhibitor I4, serpin |
Protein accession | YP_001036625 |
Protein GI | 125972715 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG4826] Serine protease inhibitor |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 24 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | TTGCTTCCTG GTTTTGCATT GCTGTCTTCT TTTTTGCTTG TCGGCAACTC ATTTGCCGCA AATTGGTACA CTTATTACGA GTTATATCCT GATCCGGTTG TGACGGATAC CAGTGCTGTC ATTAGATTTA AGCTTACCAC GGAAAAAATT GACGATTATA TAACAATGCC GGGATATATT TATGAGTTTA GGTATTGGAA GGTTGATGAA CCTTCAAAAG TCAAGAGTGT AAGTATAGGA TTGTATGTTA ATGAAGTTCA AACAGTAGAG CTCACTGATT TGGAGCCGAA CACAGAGTAT GAATGTAAGG TTTGGGGACA AATTTATACT TCGAATAAAG AAGGTACTCC TAAAACTATC ACATTTAAGA CCCTTCCTAA AAAAGGTGAC CTGAACGGAG ACGCAAAAAT AAACAGCACA GACCTTAACA TGATGAAACG TTATCTGCTT CAAATGATAG ATAGATTTGG TGTAGATGAC GAGTCTTGTG CAGATTTGAA TGGCGATGGA AAAATAACCA GCTCAGACTA TAACTTGTTA AAAAGATATA TTCTTCATTT GATAGACAAA TTTCCAATAG GAAATGATGA AACAGATGAA GGTATAAATG ATGGCTTTAA TGATGAAACA GATGAAGATA TAAATGATAG CTTTATTGAA GCAAACAGCA AATTTGCATT TGATATTTTC AAACAGATAA GTAAAGACGA GCAGGGTAAA AATGTGTTCA TTTCACCTTT CGGAATTTCA ATGGCGTTGT CTATGGTTTA TCAGGGAGCG GAATCTGATA CCAGGGAAGA AATGGCAAAG GTTTTGGGCT ATGAAGGACT TGATATTGAA GAGGTTAACA AGAGTTATAA GTTATTATTG AAATATTTCA ATGAACTTAT TGGGAATGTT AAGCTTAAAA ACAGCAATTC AATTTGGAAA AATTCTTTAA AGGGCGATGT TATTAAAGAA GATTTTATAT CCGTTAATAA GGATGTTTTT AATGCATTGG TTGAGACAAG GGATTTTTCC GATGAAAGTG TTGTAGATAA AATAAACAAC TGGATTTCTG ATGCCACAGA AGGGAAAGTA AAGAAAGCGC TCAATGCTGT TAATCCGGAT GAATTATTAT ACATTATATC TGCATTATAT TTTAATGGAG CGTGGAAAGA AGAGTTTGAG TTTGATATTA ACGACACAAC CATGAGTACT TTTAAATCCG AAGACGGAAG CACTGACTAT GTTATGATGA TGAGAAAAAG CTATAATAAT TGGGTGGGAG TGATGGAATT TGGTAAGGGT GATGGATACA GCGCCATCAG ATTGCCCTAT GGTAATGGTG AAATGGCTAT GTACTGTATT CTTCCTGACG AAGATATATC AATAAATGAT TTTATTCAGA ATCTGGATGT ATCTCTGTGG AATGAAATTA AAAACAGCAT AAGGAAAACA CTACAGGGAT TGATTTGTTT ACCTCGTTTT AAAATTGAAT ATTTCAAGGA TGGAAACGGC AGCATAAAGG AAAGCTTGAA AGCTTTGGGT TTGGAGAAGG TATTCAGCCT GGCTGAAGCT GACTTGACCG GTATGTCTGA AACTAATGCT TATGTTTCTG ATGTTTTGCA TAAGGCGGTA GTGGAAGTAA ATGAAAAGGG AACAGAGGCA TCCTCCTCAG TAGTTGTAAT ACCGGTTCCG GGTTTTGGAA CGAGATCGGA GTTTATTGCA GACAGGCCAT TTGTATTCAT AATTGCGGAT GAAAAATATG ATACCATACT CTTTATGGGT AAATTAGCAA AAGGTGAGTT GATTAATTAA
|
Protein sequence | MLPGFALLSS FLLVGNSFAA NWYTYYELYP DPVVTDTSAV IRFKLTTEKI DDYITMPGYI YEFRYWKVDE PSKVKSVSIG LYVNEVQTVE LTDLEPNTEY ECKVWGQIYT SNKEGTPKTI TFKTLPKKGD LNGDAKINST DLNMMKRYLL QMIDRFGVDD ESCADLNGDG KITSSDYNLL KRYILHLIDK FPIGNDETDE GINDGFNDET DEDINDSFIE ANSKFAFDIF KQISKDEQGK NVFISPFGIS MALSMVYQGA ESDTREEMAK VLGYEGLDIE EVNKSYKLLL KYFNELIGNV KLKNSNSIWK NSLKGDVIKE DFISVNKDVF NALVETRDFS DESVVDKINN WISDATEGKV KKALNAVNPD ELLYIISALY FNGAWKEEFE FDINDTTMST FKSEDGSTDY VMMMRKSYNN WVGVMEFGKG DGYSAIRLPY GNGEMAMYCI LPDEDISIND FIQNLDVSLW NEIKNSIRKT LQGLICLPRF KIEYFKDGNG SIKESLKALG LEKVFSLAEA DLTGMSETNA YVSDVLHKAV VEVNEKGTEA SSSVVVIPVP GFGTRSEFIA DRPFVFIIAD EKYDTILFMG KLAKGELIN
|
| |