Gene Cthe_0191 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCthe_0191 
Symbol 
ID4808607 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium thermocellum ATCC 27405 
KingdomBacteria 
Replicon accessionNC_009012 
Strand
Start bp230789 
End bp232588 
Gene Length1800 bp 
Protein Length599 aa 
Translation table11 
GC content36% 
IMG OID640105602 
Productproteinase inhibitor I4, serpin 
Protein accessionYP_001036625 
Protein GI125972715 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG4826] Serine protease inhibitor 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
TTGCTTCCTG GTTTTGCATT GCTGTCTTCT TTTTTGCTTG TCGGCAACTC ATTTGCCGCA 
AATTGGTACA CTTATTACGA GTTATATCCT GATCCGGTTG TGACGGATAC CAGTGCTGTC
ATTAGATTTA AGCTTACCAC GGAAAAAATT GACGATTATA TAACAATGCC GGGATATATT
TATGAGTTTA GGTATTGGAA GGTTGATGAA CCTTCAAAAG TCAAGAGTGT AAGTATAGGA
TTGTATGTTA ATGAAGTTCA AACAGTAGAG CTCACTGATT TGGAGCCGAA CACAGAGTAT
GAATGTAAGG TTTGGGGACA AATTTATACT TCGAATAAAG AAGGTACTCC TAAAACTATC
ACATTTAAGA CCCTTCCTAA AAAAGGTGAC CTGAACGGAG ACGCAAAAAT AAACAGCACA
GACCTTAACA TGATGAAACG TTATCTGCTT CAAATGATAG ATAGATTTGG TGTAGATGAC
GAGTCTTGTG CAGATTTGAA TGGCGATGGA AAAATAACCA GCTCAGACTA TAACTTGTTA
AAAAGATATA TTCTTCATTT GATAGACAAA TTTCCAATAG GAAATGATGA AACAGATGAA
GGTATAAATG ATGGCTTTAA TGATGAAACA GATGAAGATA TAAATGATAG CTTTATTGAA
GCAAACAGCA AATTTGCATT TGATATTTTC AAACAGATAA GTAAAGACGA GCAGGGTAAA
AATGTGTTCA TTTCACCTTT CGGAATTTCA ATGGCGTTGT CTATGGTTTA TCAGGGAGCG
GAATCTGATA CCAGGGAAGA AATGGCAAAG GTTTTGGGCT ATGAAGGACT TGATATTGAA
GAGGTTAACA AGAGTTATAA GTTATTATTG AAATATTTCA ATGAACTTAT TGGGAATGTT
AAGCTTAAAA ACAGCAATTC AATTTGGAAA AATTCTTTAA AGGGCGATGT TATTAAAGAA
GATTTTATAT CCGTTAATAA GGATGTTTTT AATGCATTGG TTGAGACAAG GGATTTTTCC
GATGAAAGTG TTGTAGATAA AATAAACAAC TGGATTTCTG ATGCCACAGA AGGGAAAGTA
AAGAAAGCGC TCAATGCTGT TAATCCGGAT GAATTATTAT ACATTATATC TGCATTATAT
TTTAATGGAG CGTGGAAAGA AGAGTTTGAG TTTGATATTA ACGACACAAC CATGAGTACT
TTTAAATCCG AAGACGGAAG CACTGACTAT GTTATGATGA TGAGAAAAAG CTATAATAAT
TGGGTGGGAG TGATGGAATT TGGTAAGGGT GATGGATACA GCGCCATCAG ATTGCCCTAT
GGTAATGGTG AAATGGCTAT GTACTGTATT CTTCCTGACG AAGATATATC AATAAATGAT
TTTATTCAGA ATCTGGATGT ATCTCTGTGG AATGAAATTA AAAACAGCAT AAGGAAAACA
CTACAGGGAT TGATTTGTTT ACCTCGTTTT AAAATTGAAT ATTTCAAGGA TGGAAACGGC
AGCATAAAGG AAAGCTTGAA AGCTTTGGGT TTGGAGAAGG TATTCAGCCT GGCTGAAGCT
GACTTGACCG GTATGTCTGA AACTAATGCT TATGTTTCTG ATGTTTTGCA TAAGGCGGTA
GTGGAAGTAA ATGAAAAGGG AACAGAGGCA TCCTCCTCAG TAGTTGTAAT ACCGGTTCCG
GGTTTTGGAA CGAGATCGGA GTTTATTGCA GACAGGCCAT TTGTATTCAT AATTGCGGAT
GAAAAATATG ATACCATACT CTTTATGGGT AAATTAGCAA AAGGTGAGTT GATTAATTAA
 
Protein sequence
MLPGFALLSS FLLVGNSFAA NWYTYYELYP DPVVTDTSAV IRFKLTTEKI DDYITMPGYI 
YEFRYWKVDE PSKVKSVSIG LYVNEVQTVE LTDLEPNTEY ECKVWGQIYT SNKEGTPKTI
TFKTLPKKGD LNGDAKINST DLNMMKRYLL QMIDRFGVDD ESCADLNGDG KITSSDYNLL
KRYILHLIDK FPIGNDETDE GINDGFNDET DEDINDSFIE ANSKFAFDIF KQISKDEQGK
NVFISPFGIS MALSMVYQGA ESDTREEMAK VLGYEGLDIE EVNKSYKLLL KYFNELIGNV
KLKNSNSIWK NSLKGDVIKE DFISVNKDVF NALVETRDFS DESVVDKINN WISDATEGKV
KKALNAVNPD ELLYIISALY FNGAWKEEFE FDINDTTMST FKSEDGSTDY VMMMRKSYNN
WVGVMEFGKG DGYSAIRLPY GNGEMAMYCI LPDEDISIND FIQNLDVSLW NEIKNSIRKT
LQGLICLPRF KIEYFKDGNG SIKESLKALG LEKVFSLAEA DLTGMSETNA YVSDVLHKAV
VEVNEKGTEA SSSVVVIPVP GFGTRSEFIA DRPFVFIIAD EKYDTILFMG KLAKGELIN