Gene Cthe_2749 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCthe_2749 
Symbol 
ID4810252 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium thermocellum ATCC 27405 
KingdomBacteria 
Replicon accessionNC_009012 
Strand
Start bp3243871 
End bp3245376 
Gene Length1506 bp 
Protein Length501 aa 
Translation table11 
GC content37% 
IMG OID640108169 
Producthypothetical protein 
Protein accessionYP_001039141 
Protein GI125975231 
COG category[L] Replication, recombination and repair 
COG ID[COG4584] Transposase and inactivated derivatives 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.596289 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCTGACTA TGACTGATAT AAAGTATATC AAAGATTTAT TCGAAAAGAA AGGCCTATCT 
CTTAGGGAAA TTACAAGAGT AACCGGACAT AATTTTAGAA CAGTCCGGAA ATATATTGAT
AAAGAGGATT GGTCACAACC TCTGGTTAAT AGAACAAGGG AATCTTTGAT TAATAAATAT
AAAGCAGATA TTGATGAATG GCTGGAGAGT GACGTTGATG CACCAAGAAA ACAGAGACAT
ACGGCAAAAA GAATTTTTAA CAAACTGAAG CATAAATACA ATAATGAATT TAACTTGTCC
TACCGAACTG TTGCAAGGTA TGTAAGCCTT AAAAAGAAAG CTTTGTATCA AGACACTGAT
GGATATATAC CTTTGGAACA CCCTACTGGT GAGGCACAGG TTGATTTTGG CAGAGCTGCT
TTTTTTGAAA ACGGTATTAG GTATGAAGGG TATTATGTTA CCATGTCGTT TCCATACAGC
AATGGAGGGT ATATACAGCT TTTCAAAGGT GCTAATATAG AATGCCTATT ACAAGGAATG
AAAAAGATTT TTGAACACAT GGGAAAAGTA CCGACATGTA TCTGGTTTGA CAATGATAAA
ACAATTGTCA AAAAAATATT TGCTAATGGA GAAAGAAAAG TTACTGAAGC TTTTGCACGA
TTCCGCATGC ATTACGGCTT TGAAAGTAAT TTTTGTAATC CAAGCAGTGG GCACGAAAAA
GGTCATGTTG AAAACAAGGT TGGATATTCA AGAAGAAATA TGCTAGTGCC CATACCGAAG
TTTAAAGATA TTGTAGAATT TAACCGTCAA TTGCTTATCC AATGTGATGA AGATATGCAG
AGGGAACACT ACAAAAAGAA TGTATTCATC AATATTTTAT TTGAAGAGGA CAAAAAAGCG
ATGCGGGATA TTCCAAAAGC TGAATATGAA ATATACCGCA TCGAAAAGTT AAAGTCGGAT
AAATATGGCA AACTGAACTT TGACAACAGA AAATACTCTT CCGGGCCTCA ATATGCCCAG
AGAGAATTAA TGATAAAAGC AGATGCCTTC TCGGTTGCAA TTATGGATGA ACAGTACAAT
ACAGTTCAGG TACATAAACG TTTGTATGGA GAAGAGAAAG AGTCAATGAA GTGGGGGCCA
TATCTTGAGC TAATGAGCCG TAGACCAACT GCTTTAAAGT ATACCGGTTT CTTCCGGGAG
TTGCCGCAAA CTCTTCAGGA TTATCTTACA GTATGTGACT ATGAACAGAA AAAAGGTGCT
TTGCGACTAT TAGTGAAGAT GTTGGAACAA AGCGAACTTG ATATAGCCAT TGAAGCCTTT
AGATTCTGTA TCGAGAGGGG AATAAAAGAT TTAGACAGCA TATGGGCAAA ATATTACACT
ATGGTCTGTA CACACATACA AGTTCAAGAT GTTTTACTCA ACACTAAGAC TCCTGATGTT
GTGCCTTACA CTGTGGATAA TAGCATATAT GATAACCTGC TGGCTGGGGG TGTCCAATAT
GTATGA
 
Protein sequence
MLTMTDIKYI KDLFEKKGLS LREITRVTGH NFRTVRKYID KEDWSQPLVN RTRESLINKY 
KADIDEWLES DVDAPRKQRH TAKRIFNKLK HKYNNEFNLS YRTVARYVSL KKKALYQDTD
GYIPLEHPTG EAQVDFGRAA FFENGIRYEG YYVTMSFPYS NGGYIQLFKG ANIECLLQGM
KKIFEHMGKV PTCIWFDNDK TIVKKIFANG ERKVTEAFAR FRMHYGFESN FCNPSSGHEK
GHVENKVGYS RRNMLVPIPK FKDIVEFNRQ LLIQCDEDMQ REHYKKNVFI NILFEEDKKA
MRDIPKAEYE IYRIEKLKSD KYGKLNFDNR KYSSGPQYAQ RELMIKADAF SVAIMDEQYN
TVQVHKRLYG EEKESMKWGP YLELMSRRPT ALKYTGFFRE LPQTLQDYLT VCDYEQKKGA
LRLLVKMLEQ SELDIAIEAF RFCIERGIKD LDSIWAKYYT MVCTHIQVQD VLLNTKTPDV
VPYTVDNSIY DNLLAGGVQY V