Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cthe_1270 |
Symbol | |
ID | 4809775 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Clostridium thermocellum ATCC 27405 |
Kingdom | Bacteria |
Replicon accession | NC_009012 |
Strand | + |
Start bp | 1544378 |
End bp | 1545601 |
Gene Length | 1224 bp |
Protein Length | 407 aa |
Translation table | 11 |
GC content | 36% |
IMG OID | 640106693 |
Product | proteinase inhibitor I4, serpin |
Protein accession | YP_001037695 |
Protein GI | 125973785 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG4826] Serine protease inhibitor |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 42 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAAAGAA CAGCTTGCGC AGTTTTATGC ATATTTGTTT TAACATTTTT ATTTTCCGGG TGTTCAAGGG AAAAAAAGGT TTTTGACGAA TTAAAACTTG ACACCGAACT TGAAAAGAAA AATACAGAAT TTTGCTTTGA CATATTCTCA AAGCTGAACG AAGAGGACTA TAATAAAAAC ATTTTTATTT CTCCCTTGAG TATTTCAACC GTACTCTCCA TGACCGTGCA AGGTGCCGGA ACCACAACAA AAGACGGTAT GCTGAAAGCT TTGAAATATG ACGGTATGGA TCTCGATAAA ATTAACGAAT CCTACAGATA TATACTCGAC TATTTAAGCA AAACTGATAA AACAATTGAA CTTGAGATTA ATAATTCCAT CTGGATAAGG GAAGGAAAGC AAATCAAAAA AGATTTTATT GACATTAACA AAGATGTATA CAATGCATAC GTTACCGAGC TTGACTTTTC AAGCCCAAAT GCAGCAGACA GGATTAACAA ATGGATTTCC GACTCAACGA AGAAAAAAAT CACAGACATA ATTGATTCAC CAATACCTGA AAATACTGCA ATGTTTCTTA TAAACGCCAT TTACTTCAAG GGAGACTGGG CGGAAAAGTT TAAAAAACAA GATACGTTCA CCGCCAAGTT CCAATCGGGC AACGGCCAAA CAAAAGAAGT TATGATGATG GAAAGAAAAG ATACAATAGA ATACGGAGCC AAAGAGGATT TCAAGGTTGT AAGGCTTCCT TACGGAAAAG GCACAACATC AATGTATTGT GTTTTGCCTG CCAAAGACGT TTCAATAAAT GATTTCATAA AAACCCTTGA TGTCAATAAA TGGGAAGAGA TAAAAAACAG TATTTCTAAA GCTGAAAACG TAACCTTAAA TATTCCAAGG TTTAAAATAG CTTATGGAAC TAAAGAATTA AGAGACTGTC TTATTGCCAT GGGAATGGAA GAAGCATTCA CCGAGCGGGC TGATTTTTCC GGAATAAGTG AGGGCCTTCT CTTCATAGAC AGTGTAATTC ATAAAGCAAT AATTGAGGTT AATGAGGATG GAAGCACGGC GGCAGGCAGT ACAGTGGTCA GAATGATAGA TGGTGCTGCA ATAGGAGAAC CGCTTTCTTT CATTGCAGAC AGACCGTTTC TGTTTTTCAT AACCGAAGAT GTTACAGGTA CTATACTATT TATGGGCAAA TTGTATGATT GTGAAAAATA TTAA
|
Protein sequence | MKRTACAVLC IFVLTFLFSG CSREKKVFDE LKLDTELEKK NTEFCFDIFS KLNEEDYNKN IFISPLSIST VLSMTVQGAG TTTKDGMLKA LKYDGMDLDK INESYRYILD YLSKTDKTIE LEINNSIWIR EGKQIKKDFI DINKDVYNAY VTELDFSSPN AADRINKWIS DSTKKKITDI IDSPIPENTA MFLINAIYFK GDWAEKFKKQ DTFTAKFQSG NGQTKEVMMM ERKDTIEYGA KEDFKVVRLP YGKGTTSMYC VLPAKDVSIN DFIKTLDVNK WEEIKNSISK AENVTLNIPR FKIAYGTKEL RDCLIAMGME EAFTERADFS GISEGLLFID SVIHKAIIEV NEDGSTAAGS TVVRMIDGAA IGEPLSFIAD RPFLFFITED VTGTILFMGK LYDCEKY
|
| |