Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cthe_0190 |
Symbol | |
ID | 4808606 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Clostridium thermocellum ATCC 27405 |
Kingdom | Bacteria |
Replicon accession | NC_009012 |
Strand | + |
Start bp | 228556 |
End bp | 230358 |
Gene Length | 1803 bp |
Protein Length | 600 aa |
Translation table | 11 |
GC content | 38% |
IMG OID | 640105601 |
Product | proteinase inhibitor I4, serpin |
Protein accession | YP_001036624 |
Protein GI | 125972714 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG4826] Serine protease inhibitor |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 0.927493 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCAAAAGA AATTAATTTG TTTCCTGGTT TTTGCATTGC TGTCTTCTTT TTTGTTTGTC GGCAACTCAT TTGCGGACGG AAACTGGAAA ACTTATTACG AGTTATATCC TGATCCGGTT GTGACGGATA CCAGTGCTGT CATTAGATTT AAGATTATCA AAGTAAAAAT TGACGACTAT ATAACAATGC CGGGATATGT GTACGATGTT GAATACTGGA AAACTGATGA ACCTTCAAAC GTTATGCATA GCTACAACAT CTTATGGGGG GAATCCAGCA AGCAGATCTC AGAAGGGATT AGTGAAGTAG TGTATAGGAC TGAGCTAACC GGTTTGGAGC CGAACACAGA GTATACATAT AAGATTTATG GACAAATGCC TATTCGGAAT AAAGAAGGTA CTCCTGAAAC TATCACATTT AAGACCCTTC CTAAAAAACT TGTTTATGGT GAGTTGAATG GAGACGGAAA AATAAACAGC TCGGATCTTA ACATGATGAA ACGTTATTTA CTCCGTTTGA TAGACGGATT AAATGACACT GCTTGTGCGG ATTTGAACGG AGATGGGAAA ATAAACAGTT CTGACTATAG TATTTTAAAA AGGTATCTTC TTCGTATGAT TGACAAATTT CCTGTAGAAA AGGAAGAAAA AAATGAAGGT GTAAGTGAAA ACTTTATTAG AGGAAACAGC AATTTTGCAT TTAACATTTT CAAAGAAATT AATAAAGATG AACAGGGCAA AAATGTGTTC ATTTCTCCTT TTGGCATTTC AACTGCACTT TCCATGGTAT ATCAGGGGGC AAAATCCGAT ACCAGGGAAG AAATGGCGAA GGTTTTGGGC TATGAAGGAC TTGATATTGA AGAGGTTAAC AAAAGCTACA AGTATTTATT GCAATATTTT AATGGTCTTG ATAATGATAC AAAAATTAAA AGCAGCAATT CAATTTGGAT GAATTCTTTA CACGGCAATG CTATTAAAGA AGATTTCATA TCCACAAACA AGGATGTGTT TGATGCGTTG GCTGAGACCC GCGACTTTTC TGATAAAGGT GTTGTGGATG AGATAAATGA TTGGATCAGC AAGGCTACAG AAGGTCAAAT AGACAAGATG CTCAGCGAAA TTGATATGGA TATGCTGGCG TATATTATAT CTGCATTGTA TTTTAAAGGT ACATGGACGG AAGAGTTTGA TATTGAGAAA ACGGTTAGTG TGCCTTTTGC ATCTGAGGAC GGCGGCGCTG ACCATGTTAT GATGATGAGA AAGGAACTCT GCACTATAGA ATTCGGTGAA GGTGATGGAT ACAAGGCTGT AAGATTGCCA TATGGTGATG GTGAAATGGC CATGTATTGT ATTCTTCCCG ATGAAGATAC ATCGATAAAT GATTTTATTC AGAAGTTGGA TTTGTCCATG TGGGAGAAAA TTAAAAACAG TATAACTAAA AGAGAAAACG GAACGATTTA TTTACCTCGC TTTAAAATGG AGTATGCCAA GGGCGAAAGC GGCAGTATAA TGGAAAGTTT GAAGGCTTTA GGTATGAAAA AGGCGTTTGA GGAAGACGCT GACTTGTCCG GTATGACTGA AGCCGACGCA TTTATTAGTG ATGTTTTGCA TAAGGCAGTA GTGGAAGTAA ATGAAAAAGG AACGGAAGCT TCCGGAGTGG TTGTAATACC TATAGCTCCG ACGAGTATAG CACCCGGACC CAAGTTTATT GCAAACAGAC CGTTTGCATT CGTAATTGCG GATGAAAAAT ATGACACAAT ACTCTTTATG GGTAAATTAT GTGACGGCGG GTTGATTAAT TAA
|
Protein sequence | MQKKLICFLV FALLSSFLFV GNSFADGNWK TYYELYPDPV VTDTSAVIRF KIIKVKIDDY ITMPGYVYDV EYWKTDEPSN VMHSYNILWG ESSKQISEGI SEVVYRTELT GLEPNTEYTY KIYGQMPIRN KEGTPETITF KTLPKKLVYG ELNGDGKINS SDLNMMKRYL LRLIDGLNDT ACADLNGDGK INSSDYSILK RYLLRMIDKF PVEKEEKNEG VSENFIRGNS NFAFNIFKEI NKDEQGKNVF ISPFGISTAL SMVYQGAKSD TREEMAKVLG YEGLDIEEVN KSYKYLLQYF NGLDNDTKIK SSNSIWMNSL HGNAIKEDFI STNKDVFDAL AETRDFSDKG VVDEINDWIS KATEGQIDKM LSEIDMDMLA YIISALYFKG TWTEEFDIEK TVSVPFASED GGADHVMMMR KELCTIEFGE GDGYKAVRLP YGDGEMAMYC ILPDEDTSIN DFIQKLDLSM WEKIKNSITK RENGTIYLPR FKMEYAKGES GSIMESLKAL GMKKAFEEDA DLSGMTEADA FISDVLHKAV VEVNEKGTEA SGVVVIPIAP TSIAPGPKFI ANRPFAFVIA DEKYDTILFM GKLCDGGLIN
|
| |