Gene Cthe_0404 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCthe_0404 
Symbol 
ID4808407 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium thermocellum ATCC 27405 
KingdomBacteria 
Replicon accessionNC_009012 
Strand
Start bp504225 
End bp505793 
Gene Length1569 bp 
Protein Length522 aa 
Translation table11 
GC content41% 
IMG OID640105818 
Producttype 3a, cellulose-binding 
Protein accessionYP_001036835 
Protein GI125972925 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.0894635 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAATCTTG GAGTGGTAAT AAAAATAAAA AGGAAGAAGG CCATAATTGT TACGGAAACC 
GGCGAATTTA AAGCTGTAAA TGCCAGAAAC GGTATGTTTT TGGGACAAAA GATTTTATTT
GATCAGCAAG ATGTTATTGA AAATAACAGA AATGGCATTG GTCTTGCATA TTCTGCAGCT
ATAGCGGGAA TGGTTGCTGT TTTTGTATTC ATGTTTACAT ATTTCGGCTT GCATAATTTT
AATGGCACTT TTGCATATGT TGACGTGGAT ATAAATCCAA GTGTCGAATT TGCGGTAAAC
AGGGACGGTA TTGTTGTAAA TGCCGAACCG CTTAATGATG ATGGGAGAAA AGTACTGGAA
GAGTTGATAT ATAAAGATGC TTTGCTGGAA GATGTGATTT TGGATCTGGT TGACAAGTCG
AGAAAGTACG GATTTATAGA AGATAATGAT AGGAAGAATA TCATATTGAT TTCGGCAGCG
TTAAACAGTG ATGAGCAGGA ACAAAGAAAT GACTTTGAAA AGAAGCTGGT TGACAATTTA
ATGCCGGAAC TTGAGAATTT GGATGTAAAT ATTGAAATGA GGTTTGTCAT TGCCTCAAAA
GAGCAAAGGA AGAAGGCACA GGAAAACAAA GTGTCCATGG GTAAGTATAT GATTTATGAA
ATGGCGAGAC GGCAAGGTGA AAAACTGACT TTGGAGTCAA TTATGTCAGA AACATTGGAA
AATTTACTTT TGGGTCAGGA CTTTGGTGTA ATTGAAACTG AGAAAACACC TGTGAATACA
CCGGTTAAAT CTACTGCTAC TCCGACGAAG GCGCTGGCTG CCGAGATTAC TCCCACAAAG
ACACCGGAAC AGGTTGTGAT GACGCCTGCA AATACGCCGG CTAAGCCTAC AGCTGCTCCA
ACAAAGGCAC CGGCTGCTGT GGCTGTGACC TCGGCAAAAA CACCGGAAAG AGCTACGACA
GTGCCTGTGA ATACACCGGT TAAACCTACG GATGCTCCGA CAAAATCACC GGCCACTGCC
ACAGCAACTG CAACCAGGGC ACCTGTAAAA GCTACAGCAA CACCTGCGAA GACACTCAAA
CCATCAGACA CTCCTGTAAA GACCCCGGAT GGTGAGCAGA GTGTCAAAGT GAGGTTCTAC
AACAATAACA CTTTGTCTGA AACCGGTGTA ATTTACATGA GAATAAATGT TATTAACACC
GGAAATGCAC CTTTGGACCT TTCGGATTTA AAACTAAGAT ATTATTACAC TATTGACAGT
GAGAGTGAAC AGAGATTCAA CTGTGATTGG TCGTCCATTG GAGCTCACAA TGTAACGGGA
AGTTTCGGAA AGGTAAATCC ATCTCGAAAC GGAGCGGATA CTTATGTTGA AATAGGATTT
ACAAAAGAAG CTGGAATGCT TCAACCGGGC GAAAGCGTTG AACTTAATGC GCGCTTTTCA
AAAACTGACA ATACACAGTA TAATAAAGCA GATGATTATT CATTTAATTC CCATTATTAC
GAATATGTAG ACTGGGACAG AATTACAGCG TATATTTCCG GCATTTTAAA ATGGGGAAGA
GAACCATGA
 
Protein sequence
MNLGVVIKIK RKKAIIVTET GEFKAVNARN GMFLGQKILF DQQDVIENNR NGIGLAYSAA 
IAGMVAVFVF MFTYFGLHNF NGTFAYVDVD INPSVEFAVN RDGIVVNAEP LNDDGRKVLE
ELIYKDALLE DVILDLVDKS RKYGFIEDND RKNIILISAA LNSDEQEQRN DFEKKLVDNL
MPELENLDVN IEMRFVIASK EQRKKAQENK VSMGKYMIYE MARRQGEKLT LESIMSETLE
NLLLGQDFGV IETEKTPVNT PVKSTATPTK ALAAEITPTK TPEQVVMTPA NTPAKPTAAP
TKAPAAVAVT SAKTPERATT VPVNTPVKPT DAPTKSPATA TATATRAPVK ATATPAKTLK
PSDTPVKTPD GEQSVKVRFY NNNTLSETGV IYMRINVINT GNAPLDLSDL KLRYYYTIDS
ESEQRFNCDW SSIGAHNVTG SFGKVNPSRN GADTYVEIGF TKEAGMLQPG ESVELNARFS
KTDNTQYNKA DDYSFNSHYY EYVDWDRITA YISGILKWGR EP