Gene Cthe_0813 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCthe_0813 
Symbol 
ID4810431 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium thermocellum ATCC 27405 
KingdomBacteria 
Replicon accessionNC_009012 
Strand
Start bp982146 
End bp983486 
Gene Length1341 bp 
Protein Length446 aa 
Translation table11 
GC content40% 
IMG OID640106230 
ProductSpoIVB peptidase 
Protein accessionYP_001037241 
Protein GI125973331 
COG category 
COG ID 
TIGRFAM ID[TIGR02860] stage IV sporulation protein B 


Plasmid Coverage information

Num covering plasmid clones34 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
TTGAAATTTA TAAAACCCGC TAAAAAAAAA CTTATAATTT TTTTTAGCGC TTGTGCCGCA 
ATTTTGAGCA TAGCATATAT AAGATTAATT TCCGTCTTTC CAAACCAGCT GACATTGTTT
GAAAACCAGG AATATATTTA TAAATTCAAA AGCCCGCTTC TTGTAAACCT TGTAAACTTT
AAATCAGATA ATAATGACAT ATTGATATTT GACAGCGAAG ACATAAAAAT CAATAATGAT
GATTATAAGG ATACCAAAAA CAAAGTAATG TTCAAAGCCA GCAAACTTGG CAGAACAAGC
CTTAGCTTAA AGCTTTTGGG ACTCATTCCC TTAAAAACCA TGTATGTTGA TGTTGTCCCG
TATAAAGAGG TTGTGGCTTG CGGAAATACA GTAGGGGTGA AAATAAAGGT GGACGGTATA
CTCGTCATTG GATTGTCGGA TGTTGAAACC CCGGATGGAA GAAGGTTGAT TCCGGCCAGA
GATTCCGGAC TGAAACCCGG TGACCTTATT GTTGAAGTGA ACAACAACAA GGTTGATACT
GCATATGATT TAATGAATGA AGTAGAAAAT AGCATGGGCG AAAACATATG GGTAAAATAT
AAGAGAGGAA ACAGCTACAA CAATACAAAA GTAACGCCGG TCAAATCGGC TGAAGACAAT
AAGTACCGTG TCGGAATGTG GGTGAGAGAC AGCACGGCAG GAATCGGAAC GTTGACATTT
TACGACCCTG TGACTAAAGG CTTTGGGGCT TTGGGACACG GAATCACCGA TATTGATACG
GGAGCCATTA TGCCCGTTCA AAGAGGTGAG CTTGTCGAAT CAAATATTTT GACCGTAAAA
AAAGGTACCA AAGGCAATCC CGGGGAACTT AAAGGTGTAT TGATAGAAGA CAGCGGTGTT
CTTGGAACAA TAGTGAAAAA CAGCCATTAT GGCATATATG GTACTTTGAA TGACGCGGCG
TTGGATAAAT TTCCGAACGT AAAATATCCT ATAGCTTTGA GAAACGATAT AAAGGTGGGA
CCTGCCACCA TACTTGCCAA TATAGACGGC AAAAAAGTTG AAGAATACAG TATTGAAATT
GAAAAAGTTT CAAGAAAAAG CGCCAATGGT TTGAAGGGAA TGGTTATCAG AGTGACTGAT
GACAGACTTC TTGAGGCAAC GGGAGGTATT GTTCAGGGAA TGTCAGGCAG CCCTATTTTA
CAGGATGGCA AACTGGTGGG AGCTGTAACC CATGTACTGG TGAATGACCC TGCAAGAGGC
TATGGCATAC TGATTGAGTG GATGATTAAA AACATGACTG ATGCAAATTT GCAAAATGTT
GAAATGGCCA ATGCTTCTTA A
 
Protein sequence
MKFIKPAKKK LIIFFSACAA ILSIAYIRLI SVFPNQLTLF ENQEYIYKFK SPLLVNLVNF 
KSDNNDILIF DSEDIKINND DYKDTKNKVM FKASKLGRTS LSLKLLGLIP LKTMYVDVVP
YKEVVACGNT VGVKIKVDGI LVIGLSDVET PDGRRLIPAR DSGLKPGDLI VEVNNNKVDT
AYDLMNEVEN SMGENIWVKY KRGNSYNNTK VTPVKSAEDN KYRVGMWVRD STAGIGTLTF
YDPVTKGFGA LGHGITDIDT GAIMPVQRGE LVESNILTVK KGTKGNPGEL KGVLIEDSGV
LGTIVKNSHY GIYGTLNDAA LDKFPNVKYP IALRNDIKVG PATILANIDG KKVEEYSIEI
EKVSRKSANG LKGMVIRVTD DRLLEATGGI VQGMSGSPIL QDGKLVGAVT HVLVNDPARG
YGILIEWMIK NMTDANLQNV EMANAS