Gene Cthe_1756 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCthe_1756 
Symbol 
ID4810186 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium thermocellum ATCC 27405 
KingdomBacteria 
Replicon accessionNC_009012 
Strand
Start bp2075853 
End bp2077019 
Gene Length1167 bp 
Protein Length388 aa 
Translation table11 
GC content46% 
IMG OID640107169 
Productputative virion core protein (lumpy skin disease virus)-like protein 
Protein accessionYP_001038170 
Protein GI125974260 
COG category[S] Function unknown 
COG ID[COG4260] Putative virion core protein (lumpy skin disease virus) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0000000281872 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGGTTTGA TAAACTTTAT CAAAGGACAA TTCATTGAAG TTATCGAATG GACGGATTCT 
TCTGCAGATA CCATAGTTTA CCGCTTTCCT GTAGCAAACA AGGAAATAAA AATGGGAGCC
CAGCTAACTG TGAGAGAATC GCAGGTGGCT ATATTCATAA ACGAAGGTCA GTTGGCGGAC
GTTTTTGAAC CTGGAAGATA CGAACTTACG ACTGAAAACT TGCCGATTCT GACAAAGCTG
AAATCATGGA AGTACGGATT TAATTCACCT TTTAAAGCAG AAGTGTATTT TATAAACACC
AGACTGTTTA CCGAACAGAC TTGGGGTACC CAGTCACCCC TTCCGATGTT GGATCCTATG
TTCGGCCCAA TTGAGGTCGG AGCAAGGGGA ACATATGCCT TCAGAGTGTC AGACCCGGTA
AAATTCTTAA AAGACGTGTC AGGAACAAGG GGTACCATGG CAACTCAGGA TTTGACCAGG
ACTTTAAGGT CATATATTAT GACGTACCTT AAAGATACGG TTGCAGAATC CAAAAAGTCC
TTCTTTGAGA TGCAGAGCAA CATGCCTGAG TTTGCCGAAA TGGTTAAGGT GCATGCAAAG
AGCAAATTTG AAGCTCTTGG GCTTGAACTG GTTGAATTTA CGATTGAATC GCTGATTCTT
CCGGAAGAGC TCAGAAAAGC TTACCAGGAA GGCGCACAGA TAAATTTGAT GGGCGGTATG
GACACTTACG CCAAAAAGAG AGCTCTCGAC GCAATGAACT CAGCCGCGTC AAACCAGGGG
GGTGGCTCTT TTGCAAGCAT GGGTGCCGGA ATGGGAGCCG GAGCTGCGAT AGGCAATATA
ATGGGACAGG TTTTTGGCGG AGGTTTTCAG CAGAATCCCC AATACTATCA GCCTCAGCCA
CAGCCTCAGG TGCAGCCTCC GCAGCAGAGT GTGGTGTGTC CTTCGTGTAA AACCGGTGTG
CCCGCCGGGA CAAAATTCTG TCCAAACTGC GGCAAGTCTC TTGTGGAAGA GAAGGACAGG
TGTATTAAAT GCAATCATGA GATCAGCAAA GGAGCAAAGT TCTGTCCTGA ATGCGGAGAA
AAGCAGGAGG TAGTGTGCAA CAACTGCGGT GCGAAGCTTT CACCGGGAAC AAAATTCTGT
TCCGAATGCG GAACAAAGGT AGAGTAA
 
Protein sequence
MGLINFIKGQ FIEVIEWTDS SADTIVYRFP VANKEIKMGA QLTVRESQVA IFINEGQLAD 
VFEPGRYELT TENLPILTKL KSWKYGFNSP FKAEVYFINT RLFTEQTWGT QSPLPMLDPM
FGPIEVGARG TYAFRVSDPV KFLKDVSGTR GTMATQDLTR TLRSYIMTYL KDTVAESKKS
FFEMQSNMPE FAEMVKVHAK SKFEALGLEL VEFTIESLIL PEELRKAYQE GAQINLMGGM
DTYAKKRALD AMNSAASNQG GGSFASMGAG MGAGAAIGNI MGQVFGGGFQ QNPQYYQPQP
QPQVQPPQQS VVCPSCKTGV PAGTKFCPNC GKSLVEEKDR CIKCNHEISK GAKFCPECGE
KQEVVCNNCG AKLSPGTKFC SECGTKVE