Gene Cthe_1513 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCthe_1513 
Symbol 
ID4810551 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium thermocellum ATCC 27405 
KingdomBacteria 
Replicon accessionNC_009012 
Strand
Start bp1837328 
End bp1838269 
Gene Length942 bp 
Protein Length313 aa 
Translation table11 
GC content35% 
IMG OID640106933 
ProductDNA adenine methylase 
Protein accessionYP_001037934 
Protein GI125974024 
COG category[L] Replication, recombination and repair 
COG ID[COG0338] Site-specific DNA methylase 
TIGRFAM ID[TIGR00571] DNA adenine methylase (dam) 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0000000971855 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
TTGGAATTTG CAATTAGCAT TAATAATATT AATTCTACTG TATGCGCAAA ACCTTTTGTA 
AAGTGGGCCG GAGGAAAAGG GCAATTGCTT GACACTTTTA GACAATATTA TCCTTCTACG
CTTATTAAAG GCTATATAAG ACGTTATATT GAGCCTTTTG TCGGCGGTGG GGCGGTTTTA
TTTGAAATTT TGCAGAAATA TAAAGTTGAG GAAGCTTTTA TATTTGATAT AAATGAGGAC
TTAATTAACA CTTATGTAGT GATTAAAAAT GATGTGCACA ACCTCGTGGA ATATCTTTCA
GATTTAGAGT GCAAGTATTT AAATTTGGAT GAAAAATCTC GCAAAGACAT GTATTATGAT
ATAAGAGATG CATACAATTC ACGAGCTTTA AAGAACAATC AGCCGGATGT TGAAAGAGCT
GCACAGTTTA TTTTTCTAAA TCGTACATGT TTTAACGGGC TTTATCGTGT TAATCGTGCG
GGACATTTTA ATGTGCCGTC CGGAGATTAT AAAAATCCAA CCATTTGTGA TGAGAAGAAT
TTGTATGCAG TAAGTTCTTT GCTTCAAAGG GTGCATATAT TTGTCGGTGA TTATAGAGAA
TGTGCCGGAT ATGTAGACAA GGATAGTTTT GTTTATTTTG ACCCTCCGTA CAGGCCGCTT
AATGTTACAT CCAGTTTTAC ATCTTATAGT AAATTTGATT TTACGGATGA AGATCAAATA
CAGCTGGCAA AATTCTTTTC AGAAATGAAT GATACAGGTG CTTTGCTTAT GCTGAGCAAT
TCCGACCCTA AAAATGAAAA CCCTGATGAT AATTTTTTTG ATGAATTGTA TAAGGAGTTT
TTCATTCACA GGATAAAGGC TAAGCGGGCG ATTAATTCAA ACGGCAGTCG GAGAGGATTA
ATTAGTGAAC TTCTTGTTAC GAACTATGAA GTAAAAGACT AG
 
Protein sequence
MEFAISINNI NSTVCAKPFV KWAGGKGQLL DTFRQYYPST LIKGYIRRYI EPFVGGGAVL 
FEILQKYKVE EAFIFDINED LINTYVVIKN DVHNLVEYLS DLECKYLNLD EKSRKDMYYD
IRDAYNSRAL KNNQPDVERA AQFIFLNRTC FNGLYRVNRA GHFNVPSGDY KNPTICDEKN
LYAVSSLLQR VHIFVGDYRE CAGYVDKDSF VYFDPPYRPL NVTSSFTSYS KFDFTDEDQI
QLAKFFSEMN DTGALLMLSN SDPKNENPDD NFFDELYKEF FIHRIKAKRA INSNGSRRGL
ISELLVTNYE VKD