Gene Cthe_1302 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCthe_1302 
Symbol 
ID4809554 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium thermocellum ATCC 27405 
KingdomBacteria 
Replicon accessionNC_009012 
Strand
Start bp1579458 
End bp1581125 
Gene Length1668 bp 
Protein Length555 aa 
Translation table11 
GC content42% 
IMG OID640106725 
Producthypothetical protein 
Protein accessionYP_001037727 
Protein GI125973817 
COG category[R] General function prediction only 
COG ID[COG0595] Predicted hydrolase of the metallo-beta-lactamase superfamily 
TIGRFAM ID[TIGR00649] conserved hypothetical protein 


Plasmid Coverage information

Num covering plasmid clones30 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGGCAAAAA AGAAGAGAAA GTTAAAAGTC ATACCCCTTG GCGGATTGGG GGAGATAGGA 
AAAAACATTA CTGTTTTTGA ATATGGCGAT GATATATTTG TGGTTGACTG CGGTATTGCT
TTCCCGGAAG ACGATATGCT GGGAATAGAT CTTGTTATAC CGGATATATC ATATCTGACC
AAGAACAGGG AAAAGGTGAG AGGTATAGTT CTTACCCACG GACATGAGGA TCATATTGGT
GCATTGCCTT ATGTTCTGAA GGATTTGAAC GTACCCGTAT ACGGCACAAA GCTTACTTTG
GGACTTTTGG AGCAAAAACT TGAAGAGCAT GGGCTTTTAA ACAATGTGGT TCTCAATGTT
GTCAAACATT CCGATGTGAT AGAACTGGGA TGTTTCAAGG TTGAATTTAT CCGGTCAACT
CACAGTATAG CAGACTCAAC GGCTTTGGCT ATTTTTACCC CTGTGGGTAC AATTTTTCAT
ACCGGAGATT TCAAAATTGA TTACACGCCC ATAGAGGGTG AGCCCATTGA TCTGGCAAGG
CTTGCTGAAC TTGGGAAAAA AGGTGTGCTG CTTCTTATGT GTGACAGCAC CAACGTTGAA
AGAGAAGGCT ATACAATGTC GGAGAAAACC GTTGGAGAAA CCTTTGATGA GATTTTCATG
AATGCAAAGA ACAGGATACT TGTAGCAACC TTTGCGTCCA ATGTTCATCG AATTCAGCAA
ATTGTCAATG CTGCAATCAA ATTCGGAAGA AAAATCGCCA TATGCGGAAG AAGCATGGTC
AATGTCGTAA ATGTTGCCAT GGAACTTGGC TATATGAATG TACCCGAAGG GCTGATTATT
GATATAGACC ACATAAACAA ATATCCGCCT GAAAAGATAG TGATAATCAC TACGGGAAGC
CAGGGAGAAC CAATGTCAGC CCTGACGCGA ATGGCTTCCG GTGACCATAA GAAGGTTGAA
ATCATACCAG GCGACCTTGT TATTATTTCC GCAAATCCCA TACCCGGAAA TGAAAAACTT
GTTTCAAGAG TGGTAAATGA CCTTTTCAAA AAGGGTGCGG AAGTTATATA CGAATCTTTG
GCAGATATTC ATGTTTCAGG TCATGCGAGC CAGGAAGAGT TAAAGCTTAT CCACAGACTG
ATAAGGCCAA AGTACTTTAT GCCGGTGCAT GGTGAGTACA GGCATTTGAA GCGCCATGCA
AATCTTGCCG TTGAGCTGGG AATGTCGCCC GAAAACATTT TTATCATGGA TATTGGAAAA
GTCCTGGAGC TTACCAATGA CTCTGCGAAG ATAAACGGCA GTGTGAATGC CGGAAGAGTG
CTGGTTGACG GTCTTGGAGT GGGAGATGTG GGAAATATAG TCTTAAGGGA CAGAAAACAT
TTGTCTCAGG ACGGACTTAT AGTTGTGGTT ATTACCATAG AAGGAGATAC CGGCAATGTA
ATTGCAGGAC CTGATGTGAT ATCCAGAGGT TTTGTATATG TGCGGGAATC CGAAGACCTC
ATGGAAGAAA TAAGAGAAGT GTGCAAAGCT GCGCTTCAAA AATGCAATGA CAAGAAAAAG
AATGACTGGT CTACGAAGAA AAGCATTATA AGAGATGCCT TAAGAGACTT TCTCTATGAG
AGAACCAAGA GAAGGCCGAT GATTCTGCCA ATAATCATGG AAGTGTAA
 
Protein sequence
MAKKKRKLKV IPLGGLGEIG KNITVFEYGD DIFVVDCGIA FPEDDMLGID LVIPDISYLT 
KNREKVRGIV LTHGHEDHIG ALPYVLKDLN VPVYGTKLTL GLLEQKLEEH GLLNNVVLNV
VKHSDVIELG CFKVEFIRST HSIADSTALA IFTPVGTIFH TGDFKIDYTP IEGEPIDLAR
LAELGKKGVL LLMCDSTNVE REGYTMSEKT VGETFDEIFM NAKNRILVAT FASNVHRIQQ
IVNAAIKFGR KIAICGRSMV NVVNVAMELG YMNVPEGLII DIDHINKYPP EKIVIITTGS
QGEPMSALTR MASGDHKKVE IIPGDLVIIS ANPIPGNEKL VSRVVNDLFK KGAEVIYESL
ADIHVSGHAS QEELKLIHRL IRPKYFMPVH GEYRHLKRHA NLAVELGMSP ENIFIMDIGK
VLELTNDSAK INGSVNAGRV LVDGLGVGDV GNIVLRDRKH LSQDGLIVVV ITIEGDTGNV
IAGPDVISRG FVYVRESEDL MEEIREVCKA ALQKCNDKKK NDWSTKKSII RDALRDFLYE
RTKRRPMILP IIMEV