Gene Cthe_1144 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCthe_1144 
Symbol 
ID4810812 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium thermocellum ATCC 27405 
KingdomBacteria 
Replicon accessionNC_009012 
Strand
Start bp1360911 
End bp1362332 
Gene Length1422 bp 
Protein Length473 aa 
Translation table11 
GC content27% 
IMG OID640106566 
Productrestriction modification system DNA specificity subunit 
Protein accessionYP_001037569 
Protein GI125973659 
COG category[V] Defense mechanisms 
COG ID[COG0732] Restriction endonuclease S subunits 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones28 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGATTAATA ATGTGTTGGT ATGGAAGCAA CCAAATATCT CTTGGATTCA TCCTATTAAT 
ATAGATAATA GTATTAATGC CAATAACTTT AATTTTGACT ATCTAGATAC ATTAAATAAG
TTAAGAAGTA GCAATTTAAA AATTTTAGAA TTAAGAGATA TAGCAGATAA AATATCAGAT
GGACCGTTTG GCAGTCAATT GAAGGTAGAA GAATATAAGG AACAAGGATT TCCAGTATAT
AGAGTTAAAA ATATTATTGA TACTCAAATT TTGGATGATG ATATTGTATA TATTGATGCT
AAAAAGCAAC AACAATTAAA GAGAAGTGAA GTATTACCTG GGGATGTATT AATAACTAAA
GCAGGCAGAA TAGGTTCTGC TGCTGTTGTA CCAAGTAAAT TTGGAAATGG GAACATAACT
TCACATTTAG TGTTAGTTAG ATTAAAAAAA ACAATCAATA ACTATTATTT GGTTGCTTAT
TTAGAATGTA AGTATGGTAA AGTTATTACA GGTCGAGAGA GTTATAAGTC AACAAGACCT
GAATTGACAA AAAATGAAAT AGGAAATGTT ATAATCCCCA TCCCATCTCC TGAAATTCAA
AAATACATAG GAGATAAGGT TAGAAAAGCA GAAGAGTTGA GAGAAGAAGC GAAAAGGTTG
AAGAAAGAGG CTGAAACATT TCTTTATGAA ATGATTCAAC TTAAACCATT AAATGATTTT
GATAAAGATA TGTTTTCATT TGTCAATAGT AATTATATTG ATTCTGAAAG ATTAGATTCA
GAGTATTATA AAACAAAATA TATTACATTA GAGAAACTCT TAAAAAGTAA AAAAGTTACT
TCTTTTAAGG ATATTATAAT CGAAAGTAAG TATGGAGCAT CTGTACCAGC AGATTACACA
ATGGTTGGTA TACCTTTTAT TAGAGGAAAT AATTTAACTG ATAATGAAAT TAATATTGAT
GATATTGTAT ATTTAAATAA AAAATTAAAA GATGAAGTTA AAGACCATCA TGTAAATACT
GGAGATATTT TGATAACAAG AAGTGGAACT GTTGGTATTA GTGCAGTTGT TGATGAAAAA
TGCGATGGGT TCTCATTTGG TTCATTTATG ATAAAACTAC GTATTGATAT GAGAATATGG
AACCCTTATT ATATAGCAGC ATTCTTAAAT TCATTTTGGG GAAAATGGCA AATTGAAAGG
TTACAAAATG GTGCTGTTCA GCAAAATATT AATTTACAAG AAATTGGTAG AATTATAATA
CCTATTATTT CAAAAGAAAA TCAAGATAAA ATTGAAGAAT TAATCAAAAA TTATATTAAT
AAAAAAAGAC AATCAAAACA ACTAATTCAA GAAGCAAAAC AGGACGTAGA AGACCTTATA
GAAGGCAACT TTGATATGTC AAAAGTAAAA GCAAATAGTT AA
 
Protein sequence
MINNVLVWKQ PNISWIHPIN IDNSINANNF NFDYLDTLNK LRSSNLKILE LRDIADKISD 
GPFGSQLKVE EYKEQGFPVY RVKNIIDTQI LDDDIVYIDA KKQQQLKRSE VLPGDVLITK
AGRIGSAAVV PSKFGNGNIT SHLVLVRLKK TINNYYLVAY LECKYGKVIT GRESYKSTRP
ELTKNEIGNV IIPIPSPEIQ KYIGDKVRKA EELREEAKRL KKEAETFLYE MIQLKPLNDF
DKDMFSFVNS NYIDSERLDS EYYKTKYITL EKLLKSKKVT SFKDIIIESK YGASVPADYT
MVGIPFIRGN NLTDNEINID DIVYLNKKLK DEVKDHHVNT GDILITRSGT VGISAVVDEK
CDGFSFGSFM IKLRIDMRIW NPYYIAAFLN SFWGKWQIER LQNGAVQQNI NLQEIGRIII
PIISKENQDK IEELIKNYIN KKRQSKQLIQ EAKQDVEDLI EGNFDMSKVK ANS