Gene Cthe_0296 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCthe_0296 
Symbol 
ID4808514 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium thermocellum ATCC 27405 
KingdomBacteria 
Replicon accessionNC_009012 
Strand
Start bp370827 
End bp372302 
Gene Length1476 bp 
Protein Length491 aa 
Translation table11 
GC content39% 
IMG OID640105707 
Producthypothetical protein 
Protein accessionYP_001036727 
Protein GI125972817 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.000000491065 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
TTGAAAAATA CTTTTAGATG GACATTGAAC AAACTGGTTG TTACAGGAGC ATGTGCGTAT 
GTTCTTTTAA TTCTGCTGTT TATGTTAAAT TCCGGTTTGG TTCGGGTGTC CGGGAATGTT
TTGCTGAATA AGTCTTTGGA TGTGTTGTTT TTCACGGTGG CGGCAGTTGT AGGTTACGGA
GTGCTACTGC TCTTTGAAAA AGTTTTAAAT TTGCGTATGT TGAATAAGGA GTATGCAAGG
ATACTGATTG TGGTTTTAAT GACCCTGATT ACGAGGCTTG TTTGGATACA TATTGTAGAT
ATTACTCCCA AAAGTGATTT TGAGCTGTAC AATACTTTGG CGGAGGCATT CTCGCGGGGA
GAGGCCGCCG GAGGCAAGTA TGTGGCTCTT TTTCCCCATA CATTTGGTTA TCCCTTTATA
CTGGCAACGG TATATCGAAT TTTTTCTCCT GACAAGTATT TTGCTTTGCT TTTAAATATT
TTGTTTGAAG CAGGCACGGG TGTTGTTTTA TATTATCTTG GGAAGATGGT CTCAAACTGG
AAAACAGGTT TCTTTGCAGG CATTATATGT GCTTTATGGC CTTCCCATGT GTTTTATTCT
TCAATTGTTT GCACCGAGCC GCTGTATACA TTATTAATGG CGCTTTTGAT ATTTGTATAT
TTTAAGGTGT CAGTCAAAAA TAAAAGCTTA TTGCATTCCT GCGTTTTGTA TCTTTTGCTC
GGGTTTTTAT GTGCGGCGGC CAATGCGATC CGTCCCATGG GTACTCTGCT TGTTGCAGTT
CTGGGAATAA CTGAGGTTGT GAGAATAATT AAGAAAAAGG AGGGGTTAAA ACAAAGTTTT
GCAGGATTTG TGCCCTTTGC CGTATTCTTA ATAGCATATT TCTCTTTTGT TAATTTGACA
GGCATGTATG TTTCCTATAA AATTGGCTAC AATACGGCTA AAAATCCCAT AGGTTTTAAT
ACTTATGTCG GCGCCAATAT TAATTCCAGC GGGATGTGGA ACCAGAGTGA TGCCAATGTG
CTAATGGATT TTATGAAGCA GGAGCCTTTT GATGCCCAAA AAATACACGA ACAGTTGCTC
AATCTGGCAA TTCAAAGGGT GAAAAGCCAG GGAACGGGTA ATTTGAAGCT TGTAATAAAA
AAGAATATGA TCATGTGGGG CAGGGACGAT GAAGTTGTAA CCTATATGAT TGCCGGAAGC
GGTGATAAAA CCTCATCGTT ATTGGAAGTA AAGAACTCTG AAGGTCTTTT GAGGTATATT
TGCAACTTCT ATTACTACAT GATAGTTATA TTGGCATTTG GTGGCCTTTT GAAGCAGTGT
GCTAAAGAGG ATAATCCAAT CCTTATGGCT TTACTGCTGC TGTTTCTTGG AATTGGTGCT
ATACATACCG TTGTTGAGGT ACATGGCAGG TATCATTATT CATCTATGGC TGTTTTTGCC
ATACTTGCCG GAATAAACAA TTTTAAAATT AAATAG
 
Protein sequence
MKNTFRWTLN KLVVTGACAY VLLILLFMLN SGLVRVSGNV LLNKSLDVLF FTVAAVVGYG 
VLLLFEKVLN LRMLNKEYAR ILIVVLMTLI TRLVWIHIVD ITPKSDFELY NTLAEAFSRG
EAAGGKYVAL FPHTFGYPFI LATVYRIFSP DKYFALLLNI LFEAGTGVVL YYLGKMVSNW
KTGFFAGIIC ALWPSHVFYS SIVCTEPLYT LLMALLIFVY FKVSVKNKSL LHSCVLYLLL
GFLCAAANAI RPMGTLLVAV LGITEVVRII KKKEGLKQSF AGFVPFAVFL IAYFSFVNLT
GMYVSYKIGY NTAKNPIGFN TYVGANINSS GMWNQSDANV LMDFMKQEPF DAQKIHEQLL
NLAIQRVKSQ GTGNLKLVIK KNMIMWGRDD EVVTYMIAGS GDKTSSLLEV KNSEGLLRYI
CNFYYYMIVI LAFGGLLKQC AKEDNPILMA LLLLFLGIGA IHTVVEVHGR YHYSSMAVFA
ILAGINNFKI K