Gene Cthe_1072 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCthe_1072 
Symbol 
ID4811370 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium thermocellum ATCC 27405 
KingdomBacteria 
Replicon accessionNC_009012 
Strand
Start bp1279894 
End bp1281090 
Gene Length1197 bp 
Protein Length398 aa 
Translation table11 
GC content40% 
IMG OID640106494 
Productputative stage IV sporulation YqfD 
Protein accessionYP_001037497 
Protein GI125973587 
COG category 
COG ID 
TIGRFAM ID[TIGR02876] sporulation protein YqfD 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.0129367 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTTGATAT TTAGGCTATG GAATTATATA AGAGGATATG TTATTATATT TGTTGAAGGA 
TATTTCCTGG AGAAGTTTGT GAATATATGT ACCAGAAGAC AAATTTTGCT GTGGGATATC
CAAAGGGACA GAAACAGCAA AATGACGCTC AAAGTCAGCA TCCGGGGCTT TAAGATGTTA
AAGCCCGTGG CAAAAAAGAC GGGCTGCAGG GTAAAGATAC TTGAAAAAAG AGGCTTGCCT
TTTTTGCTAA ACAGATACCG GCACAGGAAA ACTTTTTTAC TTGGTGCCGC AGTATTTGTT
GTGTTATTTT ATATAATGAC ATCCTTTGTG TGGAGTGTTG AAGTTGTCGG TAATAAAAAG
ATTGAAACGG ACGGTATTTT AAAATGCCTT GAAAAATACG GGGTAAAGCC CGGAGTGCTT
AAATACAGGA TAAACCCCGA GGAAGTTGCA AACGGTGTGA TTTTGGACAT AGACGGGCTT
TCCTATGTGA ATGTGCTGGT AAGAGGTACA AAAGTAAAAG TGGAAGTGGC CGAGGGTGTC
AAGCGTCCTT CGATTATACC TTTGAATGTG CCCTGCGATA TTGTGGCCAA GAAGGACGGC
GTAATAAAGT CCGTCATTGT CAAGATTGGC CAGGCGCAGG TCAAGGAGGG AGACACGGTA
AAAAAGGGAC AGCTTCTTGT ATCGGGAAGC ATACCGATAA AGGGAGCTGA AGACAACCCA
AAAAGAGTGC ATGCGATGGC GGAAGTTCTT GCCAGGACAT GGTATGAAGG AAGGCAGCCG
GTAGAGCTTA AAGCCGTTGA AAAAATAAGG ACCGGCAGAA AAAAGGACAA TGTAACTTTG
GTTTTGTTTT CGAAAAAAAT TAATTTGTTT CATAAAGAGA TAGATTTTAA AGATTTTGAA
AAGGTGGAAA TAAAAAAGAA TCTTTCAATA GGTGAAGAAT TTGTTCTGCC CTTTGGGCTT
GTTATTGAAA GATATTATGA AAATGATTTG GTGGAGGCCG ATATTTCTTT GGAAGATGCA
AAAGAGAATG CCGCAGGCAT TGCATACAGG AAAGCCGCGG AAAATATCCC CGAAGGTGCC
ACGATAGTTG ACAAAAGGGT TAATTTTATT GAGAATGAAA ATGGGGAAAT TATTGCGGAT
GTTATTATAG AATGCCTGGA GGATATTGGA GTAGCAAAAG AGAATGGAGG AGAATGA
 
Protein sequence
MLIFRLWNYI RGYVIIFVEG YFLEKFVNIC TRRQILLWDI QRDRNSKMTL KVSIRGFKML 
KPVAKKTGCR VKILEKRGLP FLLNRYRHRK TFLLGAAVFV VLFYIMTSFV WSVEVVGNKK
IETDGILKCL EKYGVKPGVL KYRINPEEVA NGVILDIDGL SYVNVLVRGT KVKVEVAEGV
KRPSIIPLNV PCDIVAKKDG VIKSVIVKIG QAQVKEGDTV KKGQLLVSGS IPIKGAEDNP
KRVHAMAEVL ARTWYEGRQP VELKAVEKIR TGRKKDNVTL VLFSKKINLF HKEIDFKDFE
KVEIKKNLSI GEEFVLPFGL VIERYYENDL VEADISLEDA KENAAGIAYR KAAENIPEGA
TIVDKRVNFI ENENGEIIAD VIIECLEDIG VAKENGGE