Gene Cthe_0052 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCthe_0052 
Symbol 
ID4808747 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium thermocellum ATCC 27405 
KingdomBacteria 
Replicon accessionNC_009012 
Strand
Start bp67772 
End bp69031 
Gene Length1260 bp 
Protein Length419 aa 
Translation table11 
GC content43% 
IMG OID640105461 
Producthypothetical protein 
Protein accessionYP_001036486 
Protein GI125972576 
COG category[S] Function unknown 
COG ID[COG1306] Uncharacterized conserved protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.115032 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTCAGCCA AAAATGCACA AACAATTTTA AGCAACAAAT GTTTTTCAGT AATTTTAGTG 
GTATTGGTTT TAGTTTTTTC TTTAGCAGCC TGCAAAGATA CAGGTGTCCA GGTAAACAAT
CCGGCACCTA CTGCACAAGC GACCGATAAT GTAGAAGCAA ACAATGGCGG TAATACGGAC
GGCGAAGCTC CTGAAACTTC AGAACCGAGT GAAGAAAGTC CGGTACCGAA CAATTCCGGC
GAGGATGCCA AACCTGTAAA GAAGGATATC AAGGTAAAAG CATTGTATCT TACAGGATGG
ACTGTGGGAA GTGATGAAAG ACTGCAGCAT TATGTTGATC TTGCAAACAG GACGGAGATT
AATGCCTATG TTGTTGACAT CAAGGATGAT GACGGATATG TCGGTTATGA GTCGAATATA
CCGGCAGTGA GAGAAATAGG CGCATGGAAG AGTAAGTACA ATGTGGACAA AGTATTGAAA
ACCTTCCATG ATAACAATAT TCATGTTATT GGAAGATTGG TGTGCTTTAA GGACCCTGTT
TTATCTTCCA AAAAGCCGGA GTTGGCGGTT AAGAGCGTAA ATGGAGGTTC CTGGCGTGAC
AATCATAATC TTACATGGCT GGACCCCTAT AACAAGGATT CATGGCCTTA TTTGATTGAG
ATAGCCAAGG AAGCGGTTGA AAAAGGTTTT GATGAAATAC AGTTTGACTA TATCAGATTT
CCCAATGACG GAAGCAAAAA GAGCATGAGC TTTAATACCG GCGGCAAGGA AAAGCACGAA
ATAATAAATG AGTTTTTGGC TTATGCCAGA GAGCAGCTTC CGGGAGTTGT CCTGTCCGCG
GATGTGTTTG GGATAATACT GGAGAGTCCG GCAGATACCG AAGATATCGG TCAATATCTT
GAAAAGATAG TAAAAGATGT GGATTATATT TCGCCGATGG TCTATCCGTC CCACTATGCC
GTTGGACAGA TAGTTAACGG TGTTCAGTTC ATGAAGCCGG ATCTTGATCC TTATGGAGTG
GTGTATCAAA GTCTTGTTAA GTGCAATAAC AGACTGGCAC AGGTGGAAGG CTACAAAGCC
GATGTAAGGC CATATATCCA GGACTTTACT GCATCGTGGC TTGGCAAGGG TTATTACCAG
AGTTACGGAC CCGAGCAGCT AAGACAGCAA ATTCAGGCCG TTTATGATGC AGGCTATGAG
GAATGGATCT GCTGGGATGC GAACAATACG TACTCGGAAG AAGCATTCTT GAAAGAGTAA
 
Protein sequence
MSAKNAQTIL SNKCFSVILV VLVLVFSLAA CKDTGVQVNN PAPTAQATDN VEANNGGNTD 
GEAPETSEPS EESPVPNNSG EDAKPVKKDI KVKALYLTGW TVGSDERLQH YVDLANRTEI
NAYVVDIKDD DGYVGYESNI PAVREIGAWK SKYNVDKVLK TFHDNNIHVI GRLVCFKDPV
LSSKKPELAV KSVNGGSWRD NHNLTWLDPY NKDSWPYLIE IAKEAVEKGF DEIQFDYIRF
PNDGSKKSMS FNTGGKEKHE IINEFLAYAR EQLPGVVLSA DVFGIILESP ADTEDIGQYL
EKIVKDVDYI SPMVYPSHYA VGQIVNGVQF MKPDLDPYGV VYQSLVKCNN RLAQVEGYKA
DVRPYIQDFT ASWLGKGYYQ SYGPEQLRQQ IQAVYDAGYE EWICWDANNT YSEEAFLKE