Gene Cthe_0667 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCthe_0667 
Symbol 
ID4810284 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium thermocellum ATCC 27405 
KingdomBacteria 
Replicon accessionNC_009012 
Strand
Start bp821833 
End bp823602 
Gene Length1770 bp 
Protein Length589 aa 
Translation table11 
GC content40% 
IMG OID640106083 
ProductGerA spore germination protein 
Protein accessionYP_001037095 
Protein GI125973185 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones28 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCGGACT TAAAAAAACT GATTAAAGAC ATGTTTGTTT TCAAAGAGCC CGAACCTCGG 
GAAGAATTCA TATTGGAGGA AAAAGAGTCT GAAAAAATTG TTAATGTTAC GGACGCACAT
GATGAAACTT CGGAAAAAAA GAAAAAAGGT TCGTTAAGAA ACCTTTTTTC AAAATCAGCC
AATAGTGAAA AGAAAGACAG CGAGTCCGGC AAAGAAAGCA ACAACACGGA AAATATGACA
TGTGTAAGCA AAAATCTTAA AGAAAATGTC GAGTATTTAA AGAAAAGGTT TTCAATTCCT
ATTAATGGAG ATGTTGTTTT AAGAGAGTTT GATATTGTAA TAAAAGACAG GAAAATTCCC
GCATGCCTTA TATTTTATGA CGGAATGGTT AACGGCATGC TCATAAATCT TAATATTCTG
CAGCCCCTTA TGCTTCTTTC CAATTTGGAT GTAAAGGGCA AGAACGGGGA AAAAGACATT
GCCGAGTATA TTCACAAAAG TCTGGTTACT CACAATCAGG TAAAAGTATC CCATGAGTTT
GATGAAATTG TTGGAGAGAT TAATTTTGGC GGTTGCGGGG TTTTCATCGA CGGAATAGAT
GTTGCCTATG CCTGTGATGT AAAGGGATGG CAGCACAGGG GAGTGGACAG GCCAAACAAC
GAGATTGTTA TAAGAGGTCC GCAGGAAAGC TTCAATGAGA TACTCAGGGT AAACACCGCC
CTTGTAAGGA AAATCTTAAA AGATGAAGAT CTTGTGGCGG AAAGCATAGA AATAGGAAAA
AGAAGTAAAA CTCCTTGTTC ACTTTTGTAT ATAAAGGATA TTGCAAACGA GTCTTTGGTA
AATGAAGTAA GAAGAAGGCT TCAAAACATA AAAACGGATT ACATATTTGA CACCGGTGAG
CTGGAGCAGT ATATAGAGGA CAATACCCTA ATGTCCACTC CGCAAATAGT GGCCACGGAA
AGACCCGACA GAGTGGCATC CATGCTTGCT GAGGGGAAAG TGGCTGTGAT TATGTCGGGA
AGCCCATTTG CTCTGGTAAT GCCTACTACC AACAATGATT TCCTTCAATC GGCGGAAGAT
GCTTATGTAA GGTTTCCATA TGCAAACCTG CTTCGGATAA TGAGGGTTAT AGCCATATTT
ATGTCACTGC TTTTACCTGG GCTGTATGTG GCAATAACAA ATTATCATCA CGAAATGATT
CCAACAGACC TTCTTTTTGC CATAGAAGCT TCCAGGGAAA GAGTACCGTT TCCATCGGTT
GTGGAAATAA TCATAATGGA ATTTGCCTTC GAGCTTATTC GTGAAGCGGG TCTCAGAGTT
CCAAGCCCCA TAGGTCCCAC CCTTGGCATA ATTGGGGCTC TGATACTCGG GCAGGCGGCA
GTTGCGGCAA ATATTGTAAG CCCGATTCTT ATAATTGTAG TTGCGGTTAC CGGTATTGGG
TCTTTTGCAA TACCAAACTT TTCGTTGGGA TTTTCTTTCA GGATTTTAAG ATTTGCCTAC
GTTTTTCTAG CTGCTATGGC TGGCTTTTTG GGCATTACTT TTGGTTTGTT TGTGCAAAGC
ATTATTTTGT GCAATGCAAA ATCTTTTGGA GTACCTTTTA TGGCGCCTTT TGGACCGAAA
ACCAAGAGCC GTTTCCAGGA TCAGTTCTTC AGGTCGCCTA TTTGGAAGCA GGAAAAAAGG
CCGGATTTTT TAAATACCAA AGATACTCAA AAGCAGCCTA AGATATCGCG GCAATGGAGA
AAAAGCGAAA AGAAGAAAGG TAAGCAATAA
 
Protein sequence
MADLKKLIKD MFVFKEPEPR EEFILEEKES EKIVNVTDAH DETSEKKKKG SLRNLFSKSA 
NSEKKDSESG KESNNTENMT CVSKNLKENV EYLKKRFSIP INGDVVLREF DIVIKDRKIP
ACLIFYDGMV NGMLINLNIL QPLMLLSNLD VKGKNGEKDI AEYIHKSLVT HNQVKVSHEF
DEIVGEINFG GCGVFIDGID VAYACDVKGW QHRGVDRPNN EIVIRGPQES FNEILRVNTA
LVRKILKDED LVAESIEIGK RSKTPCSLLY IKDIANESLV NEVRRRLQNI KTDYIFDTGE
LEQYIEDNTL MSTPQIVATE RPDRVASMLA EGKVAVIMSG SPFALVMPTT NNDFLQSAED
AYVRFPYANL LRIMRVIAIF MSLLLPGLYV AITNYHHEMI PTDLLFAIEA SRERVPFPSV
VEIIIMEFAF ELIREAGLRV PSPIGPTLGI IGALILGQAA VAANIVSPIL IIVVAVTGIG
SFAIPNFSLG FSFRILRFAY VFLAAMAGFL GITFGLFVQS IILCNAKSFG VPFMAPFGPK
TKSRFQDQFF RSPIWKQEKR PDFLNTKDTQ KQPKISRQWR KSEKKKGKQ