Gene Cthe_2091 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCthe_2091 
Symbol 
ID4810951 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium thermocellum ATCC 27405 
KingdomBacteria 
Replicon accessionNC_009012 
Strand
Start bp2486389 
End bp2487687 
Gene Length1299 bp 
Protein Length432 aa 
Translation table11 
GC content41% 
IMG OID640107498 
Producthypothetical protein 
Protein accessionYP_001038491 
Protein GI125974581 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0000589193 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGATAATA GATTTAAATA TCACAGGTCA TTTTTAATAT TTACCCACGA AGATTCAAGA 
AACGGAGAAG GGAGAGAACC GTCAGGATAT GTAAAAATAG AAATCAGAGA CGGCAGGGGA
AAGCTTTGCT GTCAGGTGTC AAACCTTAGG GAAAGCAACG ACGCAGTTTA TAAACTGTAT
TTAATAAATG TGGATGAAGC GGGACTTAAG GCGGCTTGCC CGGGAGTCAT TGATTTGGCG
AAAGGAAAAG GTGAATTGAT ATGGAATTTT GACCCCGCAA ATATTGACGG AACGGGAATG
AGTGTCTGTG ACATCAATGT TGCGGCAGTT ATCCTTGAGA ACAATAATGA ACGGTACATA
GACAATATTC TCTGCCCCCT TGCGGCGTAT AAAAACGGGA AAATTGAGTG GAGACAGAAA
ATGAAAAAGT TTTTGAACAA GCAAAGGGCG CAAGAAGCGG AAGAAACGGA AGAAAGACAA
GATTTTGAAG AAGCACAGGA AACCGGGGAA AGTCAAAGGA CCCAGGATAT CCCAAAAAAA
GAAGAGATGC AAAAAACGGA ACGTGCGCAA AGAACGAAAG ATGATGCAAT GCCGAAAACG
GGTTCTGAAA AGAGTGAAGA TGGCGGCAAA ATAGGGGAAG ATATTGCAAG TGGAAATGAA
AATATTGAGG ATAAAAATAA AAACATGATT GAAAGCGTTC AGAGAAAAGA GGACAATAAG
AGCGGTGCTG ATACCATTGT GGAAGAAAAC GGTCGTGGCG GGGTTTCTGA CGATGATGAG
AGAAAAACAA AAATTGAGGC TGAGAATGAA GTTGAAGATA AGAATGAAAA AGAAAATGAA
AATGAAACTG AAAGTAAAGA TGAGTTTGGG AATAAAAGAA AAGCCGGCAG TGAAATAAAT
TTTGAAGAAT TGGTGGCAAA ATTTGATAGA TGCTTTGAAA AGTGTAATCC GTTTATGTCC
GGCAGAAAAG ATTACAGGTG GTGGAAGATA GCAAGTCCGG TTCATTTGAA CAACATACTG
TACCAAATGA AAGTAGATGT ACCCATCCTT TTTAATCCCC TGGTGCTTAT GGCCCACTTT
AAATACAGGC ATTTGATTGT GGGTACTTAT GAAGACAAGG CCAGAAATCT TCGTTATATT
GTCTGCGGTG TTCCCGGAGT GTATTGGGTT GACGAGAAGC CTTTCGGCAA AATATGCAGG
TGGGCCCAGG TTGACGGAAA CGTGCCGAAA TACGGTGCAT TTGGATATTG GCTTGTTTAT
ATAAATCCCA ATACAGGCGA GATATTGAAC GTTGGCTAG
 
Protein sequence
MDNRFKYHRS FLIFTHEDSR NGEGREPSGY VKIEIRDGRG KLCCQVSNLR ESNDAVYKLY 
LINVDEAGLK AACPGVIDLA KGKGELIWNF DPANIDGTGM SVCDINVAAV ILENNNERYI
DNILCPLAAY KNGKIEWRQK MKKFLNKQRA QEAEETEERQ DFEEAQETGE SQRTQDIPKK
EEMQKTERAQ RTKDDAMPKT GSEKSEDGGK IGEDIASGNE NIEDKNKNMI ESVQRKEDNK
SGADTIVEEN GRGGVSDDDE RKTKIEAENE VEDKNEKENE NETESKDEFG NKRKAGSEIN
FEELVAKFDR CFEKCNPFMS GRKDYRWWKI ASPVHLNNIL YQMKVDVPIL FNPLVLMAHF
KYRHLIVGTY EDKARNLRYI VCGVPGVYWV DEKPFGKICR WAQVDGNVPK YGAFGYWLVY
INPNTGEILN VG