Gene Cthe_0975 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCthe_0975 
Symbol 
ID4811269 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium thermocellum ATCC 27405 
KingdomBacteria 
Replicon accessionNC_009012 
Strand
Start bp1164781 
End bp1165932 
Gene Length1152 bp 
Protein Length383 aa 
Translation table11 
GC content40% 
IMG OID640106393 
Productstage V sporulation protein E 
Protein accessionYP_001037400 
Protein GI125973490 
COG category[D] Cell cycle control, cell division, chromosome partitioning 
COG ID[COG0772] Bacterial cell division membrane protein 
TIGRFAM ID[TIGR02210] rod shape-determining protein RodA
[TIGR02614] cell division protein FtsW
[TIGR02615] stage V sporulation protein E 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0000101202 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAAGCTG CAGCAATAAA ATCTTCAACA ATGAAATCTG CAATTACAAC TAAAAAGCCG 
TTTGATTTTT TAATATTTCT GACTGTGTTG ATAATGCTTA CCATTGGTTC TATAATGGTA
TTTAGTTCCA GTGCTCCCCA TGCATATAAC TATATGAAAG GTGATTCGTA TCATTTTTTA
AAGAAGCAGT TGCTGTATGT TCCTGTGGGA CTTTTTGCGA TGTTTGTCAC AATGAATATT
GACTACAGAA AACTTGGTAA ATTGTCGCCC ATTATCATGC TTGTAAGTTT AGGGATGCTT
TCTGTTGTGT GGATTGACGG AATCGGTGCC ACCCGTAACA ATGCAACCCG GTGGTTTGAC
CTTGGATTTG TTGATTTTCA GCCTTCTGAA TTTGCCAAGC TGGCTATGAT ACTGTTTTTG
TCTTACAGTC TTTCTAAAAG GCAGGATAGC CTGAAGTACT TTTTCAGGGG TCTTGTTCCA
TACCTGATAC TGATAGGTAT TCATGCATTG CTGCTGCTTT TGGAACCCCA CATGAGTGCG
ACAATTATTA TAGGTTTGGT ATCGTGTGTA ATTCTTTTTT GCGCAGGAGC AAAGATAAAA
CATTTTGTAT TAATGGGAGT GCCTGCTGTT GCGGCGGTAA GTTATTTGAT TTTTACTTCC
GAATACAGGA TGAAAAGAGT TTTATCCTTT TTAAATCCGT GGGAAGACCC AAAAGGAGCA
GGATGGCAGG TTATACAATC CCTTTATGCC ATTGGTTCCG GCGGATTGTT TGGAAGAGGA
TTGGGAAACA GTCTTCAGAA GTTCCTTTAT ATTCCTGAAC CGTATAATGA CTTTATTCTG
GCGGTATTGG CCGAAGAATT GGGATTTATA GGAGTTGCCC TGGTACTTCT TTTGTTCCTT
ATCTTTATAT GGCGTGGAGT TAAAGTTTCC ATGAACGCGC CTGACGTTTT TGGAAGTCTG
GTCGCCATAG GAATAACTTC GCTGATTGCT TTTCAGGCGA TTATAAATGT TGCGGTTGTT
ACATCTTCCA TGCCGGTTAC GGGAATGCCC CTTCCGTTCT TCAGCTACGG AGGAACCTCT
CTTATTTTCC TGATGGCAGG AGTGGGCATA CTTCTTAATA TTTCCAAATA TGCAAATTAT
GAAAGAATCT GA
 
Protein sequence
MKAAAIKSST MKSAITTKKP FDFLIFLTVL IMLTIGSIMV FSSSAPHAYN YMKGDSYHFL 
KKQLLYVPVG LFAMFVTMNI DYRKLGKLSP IIMLVSLGML SVVWIDGIGA TRNNATRWFD
LGFVDFQPSE FAKLAMILFL SYSLSKRQDS LKYFFRGLVP YLILIGIHAL LLLLEPHMSA
TIIIGLVSCV ILFCAGAKIK HFVLMGVPAV AAVSYLIFTS EYRMKRVLSF LNPWEDPKGA
GWQVIQSLYA IGSGGLFGRG LGNSLQKFLY IPEPYNDFIL AVLAEELGFI GVALVLLLFL
IFIWRGVKVS MNAPDVFGSL VAIGITSLIA FQAIINVAVV TSSMPVTGMP LPFFSYGGTS
LIFLMAGVGI LLNISKYANY ERI