Gene Cthe_0190 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCthe_0190 
Symbol 
ID4808606 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium thermocellum ATCC 27405 
KingdomBacteria 
Replicon accessionNC_009012 
Strand
Start bp228556 
End bp230358 
Gene Length1803 bp 
Protein Length600 aa 
Translation table11 
GC content38% 
IMG OID640105601 
Productproteinase inhibitor I4, serpin 
Protein accessionYP_001036624 
Protein GI125972714 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG4826] Serine protease inhibitor 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.927493 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCAAAAGA AATTAATTTG TTTCCTGGTT TTTGCATTGC TGTCTTCTTT TTTGTTTGTC 
GGCAACTCAT TTGCGGACGG AAACTGGAAA ACTTATTACG AGTTATATCC TGATCCGGTT
GTGACGGATA CCAGTGCTGT CATTAGATTT AAGATTATCA AAGTAAAAAT TGACGACTAT
ATAACAATGC CGGGATATGT GTACGATGTT GAATACTGGA AAACTGATGA ACCTTCAAAC
GTTATGCATA GCTACAACAT CTTATGGGGG GAATCCAGCA AGCAGATCTC AGAAGGGATT
AGTGAAGTAG TGTATAGGAC TGAGCTAACC GGTTTGGAGC CGAACACAGA GTATACATAT
AAGATTTATG GACAAATGCC TATTCGGAAT AAAGAAGGTA CTCCTGAAAC TATCACATTT
AAGACCCTTC CTAAAAAACT TGTTTATGGT GAGTTGAATG GAGACGGAAA AATAAACAGC
TCGGATCTTA ACATGATGAA ACGTTATTTA CTCCGTTTGA TAGACGGATT AAATGACACT
GCTTGTGCGG ATTTGAACGG AGATGGGAAA ATAAACAGTT CTGACTATAG TATTTTAAAA
AGGTATCTTC TTCGTATGAT TGACAAATTT CCTGTAGAAA AGGAAGAAAA AAATGAAGGT
GTAAGTGAAA ACTTTATTAG AGGAAACAGC AATTTTGCAT TTAACATTTT CAAAGAAATT
AATAAAGATG AACAGGGCAA AAATGTGTTC ATTTCTCCTT TTGGCATTTC AACTGCACTT
TCCATGGTAT ATCAGGGGGC AAAATCCGAT ACCAGGGAAG AAATGGCGAA GGTTTTGGGC
TATGAAGGAC TTGATATTGA AGAGGTTAAC AAAAGCTACA AGTATTTATT GCAATATTTT
AATGGTCTTG ATAATGATAC AAAAATTAAA AGCAGCAATT CAATTTGGAT GAATTCTTTA
CACGGCAATG CTATTAAAGA AGATTTCATA TCCACAAACA AGGATGTGTT TGATGCGTTG
GCTGAGACCC GCGACTTTTC TGATAAAGGT GTTGTGGATG AGATAAATGA TTGGATCAGC
AAGGCTACAG AAGGTCAAAT AGACAAGATG CTCAGCGAAA TTGATATGGA TATGCTGGCG
TATATTATAT CTGCATTGTA TTTTAAAGGT ACATGGACGG AAGAGTTTGA TATTGAGAAA
ACGGTTAGTG TGCCTTTTGC ATCTGAGGAC GGCGGCGCTG ACCATGTTAT GATGATGAGA
AAGGAACTCT GCACTATAGA ATTCGGTGAA GGTGATGGAT ACAAGGCTGT AAGATTGCCA
TATGGTGATG GTGAAATGGC CATGTATTGT ATTCTTCCCG ATGAAGATAC ATCGATAAAT
GATTTTATTC AGAAGTTGGA TTTGTCCATG TGGGAGAAAA TTAAAAACAG TATAACTAAA
AGAGAAAACG GAACGATTTA TTTACCTCGC TTTAAAATGG AGTATGCCAA GGGCGAAAGC
GGCAGTATAA TGGAAAGTTT GAAGGCTTTA GGTATGAAAA AGGCGTTTGA GGAAGACGCT
GACTTGTCCG GTATGACTGA AGCCGACGCA TTTATTAGTG ATGTTTTGCA TAAGGCAGTA
GTGGAAGTAA ATGAAAAAGG AACGGAAGCT TCCGGAGTGG TTGTAATACC TATAGCTCCG
ACGAGTATAG CACCCGGACC CAAGTTTATT GCAAACAGAC CGTTTGCATT CGTAATTGCG
GATGAAAAAT ATGACACAAT ACTCTTTATG GGTAAATTAT GTGACGGCGG GTTGATTAAT
TAA
 
Protein sequence
MQKKLICFLV FALLSSFLFV GNSFADGNWK TYYELYPDPV VTDTSAVIRF KIIKVKIDDY 
ITMPGYVYDV EYWKTDEPSN VMHSYNILWG ESSKQISEGI SEVVYRTELT GLEPNTEYTY
KIYGQMPIRN KEGTPETITF KTLPKKLVYG ELNGDGKINS SDLNMMKRYL LRLIDGLNDT
ACADLNGDGK INSSDYSILK RYLLRMIDKF PVEKEEKNEG VSENFIRGNS NFAFNIFKEI
NKDEQGKNVF ISPFGISTAL SMVYQGAKSD TREEMAKVLG YEGLDIEEVN KSYKYLLQYF
NGLDNDTKIK SSNSIWMNSL HGNAIKEDFI STNKDVFDAL AETRDFSDKG VVDEINDWIS
KATEGQIDKM LSEIDMDMLA YIISALYFKG TWTEEFDIEK TVSVPFASED GGADHVMMMR
KELCTIEFGE GDGYKAVRLP YGDGEMAMYC ILPDEDTSIN DFIQKLDLSM WEKIKNSITK
RENGTIYLPR FKMEYAKGES GSIMESLKAL GMKKAFEEDA DLSGMTEADA FISDVLHKAV
VEVNEKGTEA SGVVVIPIAP TSIAPGPKFI ANRPFAFVIA DEKYDTILFM GKLCDGGLIN