Gene Cthe_1270 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCthe_1270 
Symbol 
ID4809775 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium thermocellum ATCC 27405 
KingdomBacteria 
Replicon accessionNC_009012 
Strand
Start bp1544378 
End bp1545601 
Gene Length1224 bp 
Protein Length407 aa 
Translation table11 
GC content36% 
IMG OID640106693 
Productproteinase inhibitor I4, serpin 
Protein accessionYP_001037695 
Protein GI125973785 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG4826] Serine protease inhibitor 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones42 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAAAGAA CAGCTTGCGC AGTTTTATGC ATATTTGTTT TAACATTTTT ATTTTCCGGG 
TGTTCAAGGG AAAAAAAGGT TTTTGACGAA TTAAAACTTG ACACCGAACT TGAAAAGAAA
AATACAGAAT TTTGCTTTGA CATATTCTCA AAGCTGAACG AAGAGGACTA TAATAAAAAC
ATTTTTATTT CTCCCTTGAG TATTTCAACC GTACTCTCCA TGACCGTGCA AGGTGCCGGA
ACCACAACAA AAGACGGTAT GCTGAAAGCT TTGAAATATG ACGGTATGGA TCTCGATAAA
ATTAACGAAT CCTACAGATA TATACTCGAC TATTTAAGCA AAACTGATAA AACAATTGAA
CTTGAGATTA ATAATTCCAT CTGGATAAGG GAAGGAAAGC AAATCAAAAA AGATTTTATT
GACATTAACA AAGATGTATA CAATGCATAC GTTACCGAGC TTGACTTTTC AAGCCCAAAT
GCAGCAGACA GGATTAACAA ATGGATTTCC GACTCAACGA AGAAAAAAAT CACAGACATA
ATTGATTCAC CAATACCTGA AAATACTGCA ATGTTTCTTA TAAACGCCAT TTACTTCAAG
GGAGACTGGG CGGAAAAGTT TAAAAAACAA GATACGTTCA CCGCCAAGTT CCAATCGGGC
AACGGCCAAA CAAAAGAAGT TATGATGATG GAAAGAAAAG ATACAATAGA ATACGGAGCC
AAAGAGGATT TCAAGGTTGT AAGGCTTCCT TACGGAAAAG GCACAACATC AATGTATTGT
GTTTTGCCTG CCAAAGACGT TTCAATAAAT GATTTCATAA AAACCCTTGA TGTCAATAAA
TGGGAAGAGA TAAAAAACAG TATTTCTAAA GCTGAAAACG TAACCTTAAA TATTCCAAGG
TTTAAAATAG CTTATGGAAC TAAAGAATTA AGAGACTGTC TTATTGCCAT GGGAATGGAA
GAAGCATTCA CCGAGCGGGC TGATTTTTCC GGAATAAGTG AGGGCCTTCT CTTCATAGAC
AGTGTAATTC ATAAAGCAAT AATTGAGGTT AATGAGGATG GAAGCACGGC GGCAGGCAGT
ACAGTGGTCA GAATGATAGA TGGTGCTGCA ATAGGAGAAC CGCTTTCTTT CATTGCAGAC
AGACCGTTTC TGTTTTTCAT AACCGAAGAT GTTACAGGTA CTATACTATT TATGGGCAAA
TTGTATGATT GTGAAAAATA TTAA
 
Protein sequence
MKRTACAVLC IFVLTFLFSG CSREKKVFDE LKLDTELEKK NTEFCFDIFS KLNEEDYNKN 
IFISPLSIST VLSMTVQGAG TTTKDGMLKA LKYDGMDLDK INESYRYILD YLSKTDKTIE
LEINNSIWIR EGKQIKKDFI DINKDVYNAY VTELDFSSPN AADRINKWIS DSTKKKITDI
IDSPIPENTA MFLINAIYFK GDWAEKFKKQ DTFTAKFQSG NGQTKEVMMM ERKDTIEYGA
KEDFKVVRLP YGKGTTSMYC VLPAKDVSIN DFIKTLDVNK WEEIKNSISK AENVTLNIPR
FKIAYGTKEL RDCLIAMGME EAFTERADFS GISEGLLFID SVIHKAIIEV NEDGSTAAGS
TVVRMIDGAA IGEPLSFIAD RPFLFFITED VTGTILFMGK LYDCEKY