Gene Cthe_0744 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCthe_0744 
Symbol 
ID4810362 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium thermocellum ATCC 27405 
KingdomBacteria 
Replicon accessionNC_009012 
Strand
Start bp906953 
End bp909805 
Gene Length2853 bp 
Protein Length950 aa 
Translation table11 
GC content35% 
IMG OID640106161 
Productcopper amine oxidase-like protein 
Protein accessionYP_001037172 
Protein GI125973262 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0793] Periplasmic protease 
TIGRFAM ID[TIGR00225] C-terminal peptidase (prc) 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0284695 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAAAAAA TCAAACAAAT GAAAGTGGCG TTATTTTTGT TTTCTTTTTT AAGTGTAATT 
GTTTTTATGT CTGTAAGTTC TTTGGCTTTG CCTGATGTAC CGAAAAAAAG TTCAAACACT
TCGGGGAATA TGGCCAATAA CGGTCGTGTT TTAAAATACG ACGGCTGGGT TTATTATTCT
TTCGATGAAA GCGGACTGTA TAGAATGAAA GAGGATGGAA GCCAGAAGAA AAAGATATGT
GACGGAATGT ATGACAACCT GACCGCATAT GACGGCTATA TTTACGGCTA TTGCAGATAT
ACAAAAACAA ATAACCCTGA GGAAACAGGA TTGTTCAGGT TGAAGCCTGA TGGAACGGAA
AGAGTGAAGA TTAGCGATAA ATCCATGCTT TTTGTTACGA TATATGATGG ATGGATATAT
TATACATCGT TTGATGATAA TTTCAAACCT TATAAAATGA AACTTGACGC TACCGATGAC
CAAAAACTGA GTGATTATTC TGCTAGTTAC ATAAATGTTG ATAATCAATG GGTATATTTC
CAGAATGATG CGAATGGTGG GCGTATATAT AAAGTGAAAC ATGACGGCAG TCAAATTACT
GAAGTTAGCG ATCATGGTAA TATGTATACG TATTTGAACA TTGACGGTGA ATGGATTTAC
TTTTCCGGTC ATTATTTCCT TTATAAAATG AAGGTTGATG GAACTGAATT AACTCCGCTG
TTTGAGGAAC TTATAAATAA TGTAAATTCT AAGGATAGTT GGATTTACTT TTCTGTTTTT
GAAGAAGGGA TATACAGAAT TAAAACGGAC GGAACGCAAT TACAGAAACT GCGTGATGTG
GAGGATTTTG TTTCAGGTAT AAGTCTTACG GATGAATGGC TGTATTATGA GGTATATGAT
GTTAAGGATT CCAGTACTCG TGTTTACAGA ATGAGGTTGG ACGGTTCTAG TCACCAGAAG
TTTAAAATTA CCGAGGATCA TGTTCCCGAT GAAGTGGAAA ATGTAAAAAT CAGAATTGAC
GGCAAGTTCG GTGAATACAG CAATGTACCG TTAAATCTCT ATGGAAGGAT TTTGCTTCCT
TTCAGGGAGA TTCTGAAAAA TCTTGGAGTT CCCGACGATG ATAAACATAT TATCTGGGAC
GGAAAAAACA GGACTGTTAC CGTAAAAAAG GACAACATTA CGATTCTTCT GACAATAGGG
AAAAACACGG CATTGGTAAA CGGAAAAGAA TATGTACTGG ATGTTGCACC CATTATATAC
AATGACCGTA CGTATATTCC GACAAGGTTT ATCGCAGAAA GCTTGAACAA GAAGGTTTTA
TGGGACGGGG AAAAGCAAAT TGTGTCTATT TGTGAGCCTG CAGAGTTTGA AAAGGTAAAA
GATATACTTG CAAAAACCAA TGATGCAATG GTGAACAGTG TTGAAAGATA TAGTGTTGAC
CAGAAAAATA GTATGAAATA TAACAATGAC TATTTGGATT ATGAGATTCA GATGACAGCG
AAACTTGAAA TTGATTCGAA AAAGCGGTTG TGCAATATTG TTGGCGAAAG AAAAATATTG
GAAACGGGTA ATATTACAAA TTTTAATTCT GAAGAAACAT CAAATATATA TTGCTGGTGT
GAAAATGAGA AGATTTATTT GAAAGATGAG AATTATGATT TGTGGTATGA GTCGGAATTA
ACAGAAGATG AATGGGAAAA TTTTAAACGC AGGGTTTTAA GCAACTATAG CTATATAAAC
GCTGATGATT TAATCTGTTC AGGTTTTACT GTGGACGAAA CCGACGAACA TTATATATTA
AAGGGTGAGC ATATTTTTGA TGAGTTGATA TCCGGCGCAT TGTATGAACT GGATATAGTA
AATGAAATAA TCAGTGGTAC AAACACCGAA TTGTTAATCA ATAAGAAAAC TTATTATTTG
GAAAAAATAA ATACTACAGT AACCGGTAAA GAGAAAAGTT CTTATGGAGG AAATTTCAGC
ATTGCTGTTA GCCTGGAAAA TACGAATTTT AACAGCGAAG CACGGGTTAC AGCTCCAGAA
GGATTCGATC CTGATAAACT TATAAATAAA AAAGCTGTTG AATTGGTGAA TTTTATATCC
CAGGCATACC TTTATTTTAA TGAAATAGAT AACTTTGAAG AAAAGAGTAA AGAATTTATA
ACAAAGAAGG ACTTTAATTT TGATGATGTA AAACAGTATA TTGAAGCTAT TAAAGCAGAT
GATGATGTTT TTACCGTATG CGTGGATGAA AATAGTCTTG AATATGGATA TAAAGAAGAA
CAGATTGAGA CAAAAGACTT GGGGAAAGAC GCTGTCTATA TAAAAATAAA AAGCTTTACG
GAGGATGTCG GGGATAAATT TATTGAGGAA GCGGACAAGA TTGAGAATTC CGAGGATAAA
ACTCTTGTAA TTGATTTAAG AGATAACGGA GGGGGATTTA TAATATCCGC AAATGATATT
CTTGACTATT TGCTGCCAAG GTGCCTGATG AATTATTATA TCAGCCGGAG CGGAGATATG
TTGCCTGTTT ATTCTAATGA TGATTATAAA GAGTTTAAAC AAATACTCAT TCTTGTTAAC
GAGAATACTG CAAGCAGTGC GGAACTTCTT GCATTGGGAC TTAAAAAGCA CTTGAAAAAT
ACCACTGTTA TAGGACGGAC AACCCTTGGA AAAGGTGTCG GACAACTTGT ATACAGAGAT
GATGATAAAA AGTTTTCGGT ATACCTTGTA AGTTTTTACT GGAATGTAAA AGAACAGAAC
GTAATGAAAA GCGGCATTAC ACCGGATATA GTTGTTAACA GTGATTCTGA TTATTTGAAG
GAAGTGGAAA AATTATTGAA GTCAAAACGT TGA
 
Protein sequence
MKKIKQMKVA LFLFSFLSVI VFMSVSSLAL PDVPKKSSNT SGNMANNGRV LKYDGWVYYS 
FDESGLYRMK EDGSQKKKIC DGMYDNLTAY DGYIYGYCRY TKTNNPEETG LFRLKPDGTE
RVKISDKSML FVTIYDGWIY YTSFDDNFKP YKMKLDATDD QKLSDYSASY INVDNQWVYF
QNDANGGRIY KVKHDGSQIT EVSDHGNMYT YLNIDGEWIY FSGHYFLYKM KVDGTELTPL
FEELINNVNS KDSWIYFSVF EEGIYRIKTD GTQLQKLRDV EDFVSGISLT DEWLYYEVYD
VKDSSTRVYR MRLDGSSHQK FKITEDHVPD EVENVKIRID GKFGEYSNVP LNLYGRILLP
FREILKNLGV PDDDKHIIWD GKNRTVTVKK DNITILLTIG KNTALVNGKE YVLDVAPIIY
NDRTYIPTRF IAESLNKKVL WDGEKQIVSI CEPAEFEKVK DILAKTNDAM VNSVERYSVD
QKNSMKYNND YLDYEIQMTA KLEIDSKKRL CNIVGERKIL ETGNITNFNS EETSNIYCWC
ENEKIYLKDE NYDLWYESEL TEDEWENFKR RVLSNYSYIN ADDLICSGFT VDETDEHYIL
KGEHIFDELI SGALYELDIV NEIISGTNTE LLINKKTYYL EKINTTVTGK EKSSYGGNFS
IAVSLENTNF NSEARVTAPE GFDPDKLINK KAVELVNFIS QAYLYFNEID NFEEKSKEFI
TKKDFNFDDV KQYIEAIKAD DDVFTVCVDE NSLEYGYKEE QIETKDLGKD AVYIKIKSFT
EDVGDKFIEE ADKIENSEDK TLVIDLRDNG GGFIISANDI LDYLLPRCLM NYYISRSGDM
LPVYSNDDYK EFKQILILVN ENTASSAELL ALGLKKHLKN TTVIGRTTLG KGVGQLVYRD
DDKKFSVYLV SFYWNVKEQN VMKSGITPDI VVNSDSDYLK EVEKLLKSKR