Gene Cthe_2037 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCthe_2037 
Symbol 
ID4811007 
Type 
Is gene splicedNo 
Is pseudo geneYes 
Organism nameClostridium thermocellum ATCC 27405 
KingdomBacteria 
Replicon accessionNC_009012 
Strand
Start bp2419749 
End bp2421277 
Gene Length1529 bp 
Protein Length 
Translation table 
GC content35% 
IMG OID640278310 
Product 
Protein accession 
Protein GI 
COG category[L] Replication, recombination and repair 
COG ID[COG3328] Transposase and inactivated derivatives 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.620147 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGTCAACAT TATCAAAAGA GCAAGTGAAA GAAATAATTA AGGGCAATAA TTTCCAAAGT 
GTTGCCGATG TAAGTGCATA CCTAAGAGAT ATCTTTAAGG ACATTATTCA AGAACTTCTT
GAAGCAGAAC TTGAAGCTAA ATTGGGATAT GCAAAAGATG ATGTAGAAAA CAAAAATACA
GATAATAGCC GAAACGGATA TTCACCAAAG ACCATAAAAA GTGAATTTCG AGAAGTTGAA
ATCCAAGTAC CAAGGGATCG CAAAGGAGAG TTTAAACCCC AAATTATACC TAAGTATCAG
AGGAATGTTT CCGGAATTGA AGAAAAAGTT ATTGCTCTGT ATGCCAGAGG AATGTCCACC
CGGGATATTA GTGAACAAAT TGAAGAACTT TACGGCTTTA GTTTGTCGCC CGAAATGGTT
AGTAAGATTA CAGACAGAAT AGCTCCAGAA ATCAAGGAAT GGCAACAAAG ACCGCTGGAA
CCTATATACA CGTTTGTTTT TATGGATGAA ATTCACTAGA CAACGAATCA TTGAAATTTT
GGCTTGGAGT ATTAAATGAT CTTAAGAACA GAGGAGTACA GGATATGCTA ATATTTTGCG
TTGATGGACT GACAGGTATA AAAGAAGCGA TTAATGCGGC ATATCCAAAG GCCGAAGTAC
AACGCTGCAT AATACATCAA CTTCGAAATT CCTTTAAATA TGCACCATTC AAGGACATAA
AAGCTTTTAG CAATGACTTT AAGGAAGTAT ATCGGGCAAT TAACGAAGAC AGACATGCTA
TAGTTAATTA TTTGAAAAGC AGGGCTGATG TAATAGTTGT AACAAGGACA GTGTTAATGG
AATTGACTGC CAATTCTTTT GAGTTGCATC CGGTTCAAAT CAAATATTTC AAGGAACTTA
ATAGCAGTAG TTTTAAAGTT GTTCTGTTTG ATGAAGAAAC GGTATATGAT TGTTTAAAAG
AAGTTTTGAA TATTAGTACC GAGGAAGCAA ACAGATTGCT GGGATATGCT GTTAAAGAAG
TATGCAGATA CAAGGCAAAA ACAAGTGAAA TTATTGAAAA CATGGACAAG CATAGATCTC
TAAAACTTAA AAGTACAAAT CCGGGAAAGA GGGAACTTTT CAGTACTTTC TTTCGGTATG
CCAGAACACG GAAAAGTGAG GGGAACAGTA TTGCAGAGGA GTTAATTCTT ATATGCATAA
TAGTACTTAC AATAATTCCC ATGGGGCGGT ACATTTTAAT AAGTGATGAT ATGAGGATAA
GGCCTCAGGT AATAAGCGTA AATGATTATA TTTTAAGACA TCACGGGAGA AAAGAGCCTT
ACCAGCTGAC AACATCGGCA TTTGTGTATA AAATGTATAA GGATAATGTG CTTACCAACA
GAGAAGACAT GATTGAAATC ATGAAAGCAG CTTTTAAAGA AAATGTTAGA GTCTTTTTTG
TTGGTGAATA TGACATTCAG CAAAGATATG AGCCTTTTAA GTGTGAAGAT TTGATTGACA
GGCTTTTAAA TGAGAGGATT TTAAGATAA
 
Protein sequence