Gene Cthe_2064 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCthe_2064 
Symbol 
ID4810662 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium thermocellum ATCC 27405 
KingdomBacteria 
Replicon accessionNC_009012 
Strand
Start bp2455560 
End bp2456831 
Gene Length1272 bp 
Protein Length423 aa 
Translation table11 
GC content38% 
IMG OID640107471 
Productspore germination B3 GerAC like 
Protein accessionYP_001038464 
Protein GI125974554 
COG category 
COG ID 
TIGRFAM ID[TIGR02887] germination protein, Ger(x)C family 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0147375 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGGTTAA AAAATAAAAA AACAGCAAAA ATGTTAATTA CAGTTTTGAT TATAATACCA 
AGCCTGACCA TTTTGCTTAC CGGGTGCTGG GATTCCATAG ACATAGAAGA CCGTGCGTAT
GTAATCGGCA TTGCCATTGA CGAGTACCCT CAACTTCCTC AAGGCATCAA AAATAAAGAA
AACATTCCAG AAAATGAACA GGAAAGAATG TTTGAATCCA GTACGGAAGT TGACACGGGA
GTTCCTTCTT ATGCCATGAC CATACAAATT CCTATTATAA AACATGCCTC ACTCCCCAAC
ATTTTGTCCG GAGGAACTTC AGAGCCCAAT ACGCTGAAAA CCTGGGACAT CACCCAGGTG
GGCAACTCGT TTATGGAAAT AAACAGATCC ATTACAACAA GAATGAACTT GATACCCAAT
TACGAACATC TTCAGGTTAT CATTATCTCG GAAAAAGTTG CAAGAAAAGG CCTTAGAAAT
GTTCTTGACC TTTTTATAAG GGATCATGAA ATGAGAAGCA GGACAAAATT GTTCATAACT
GACGGAGATG CAAAAAAGGC CCTTGATGTC ATTCCAAGAA TTGAAGACTA TGCTTCAATA
TATCTTACCA AAATGCCAAG AAGTGCCAGA GTAAACGGAG AAATACTGCA CTGGATGGAT
CTCGGTCAGG CCGTTCAGGC CATCTATTCC GGTGAGGACT TCGAACTTCC GGCTTTGGAA
GTAACCGAGT ACGAGGTAAT GAACAAAGGC GCAGCTTTGT TTAAAAATGA CAAAATGGTC
GGATGGGCTG ACGGCAAAGA TGTGGAAATT ATTAAAATCA TGCATAATGT GCTTTTAGGC
GGCATCTTTA CTTCAAAATT TGTTTCAGAT GAACATGATT CCGAAAATGG CGTAATGAGC
CTTGAAATAA TCAAATCAAA GACCAAAATC ACACCCGTAA TCCAGGACGA TGATATAACT
TTCAAAATAA ATGTGGACAT TAAAGGGAAT TATTCGGATA GTGTAAATCA TCCTCTCACC
GAAAAAATTG ACAAAGATTT TATAGAAAAA GCTGAAGAAG CCTTTGAGGA GTCAATAAAA
GAACAGTGTA TCAAAACAAT TAAAAAAATG CAGGACCTGG GTGTGGATAC TTTTCATTTT
GGAACCGTTA TAAGAAGCAA GAAGCCCTCC CATTGGTCAA AAATTAAAGA CAGATGGGAC
GAAATTTTTC CTGAAGTTAA AACTGAAGTA AATGTAAAGG TAAATATAAG GCAAATAGGA
AACATCCACT AA
 
Protein sequence
MRLKNKKTAK MLITVLIIIP SLTILLTGCW DSIDIEDRAY VIGIAIDEYP QLPQGIKNKE 
NIPENEQERM FESSTEVDTG VPSYAMTIQI PIIKHASLPN ILSGGTSEPN TLKTWDITQV
GNSFMEINRS ITTRMNLIPN YEHLQVIIIS EKVARKGLRN VLDLFIRDHE MRSRTKLFIT
DGDAKKALDV IPRIEDYASI YLTKMPRSAR VNGEILHWMD LGQAVQAIYS GEDFELPALE
VTEYEVMNKG AALFKNDKMV GWADGKDVEI IKIMHNVLLG GIFTSKFVSD EHDSENGVMS
LEIIKSKTKI TPVIQDDDIT FKINVDIKGN YSDSVNHPLT EKIDKDFIEK AEEAFEESIK
EQCIKTIKKM QDLGVDTFHF GTVIRSKKPS HWSKIKDRWD EIFPEVKTEV NVKVNIRQIG
NIH