Gene Cthe_2093 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCthe_2093 
Symbol 
ID4810953 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium thermocellum ATCC 27405 
KingdomBacteria 
Replicon accessionNC_009012 
Strand
Start bp2488992 
End bp2490050 
Gene Length1059 bp 
Protein Length352 aa 
Translation table11 
GC content44% 
IMG OID640107500 
Producthypothetical protein 
Protein accessionYP_001038493 
Protein GI125974583 
COG category[S] Function unknown 
COG ID[COG3583] Uncharacterized protein conserved in bacteria 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.00901653 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
TTGTCGCAGC TTGTTGAAAG TATTAAAAGG CGTGTTTCGT TCAAACTGTT GGCTTGTTTG 
GCTGTTGCAT TCGTTGTTGC GGGAATTGCA GGATGGGGAA CATATTACAG CTGTCAAAAA
GAAGTAGTGA TAAACCTGGA TGGCGAGCGG CTTGTTGTAA AAACAGTAAA ATCCACCGTG
AAAGAAGTAT TGAAGCAAAG CGGGATAAAT ATAACCGAGG ATGACTATGT TAGTGTTCCG
CTTGATACGA AACTTAAGAG TAAAAAAGGC AACGTGATAG ATATAAAAAA AGCAGTACCT
GTTACAGTAA TTGCCGATGG ACAGGAATTC AAGCTTATGA CTTCCAAGAA GACAGTCCGG
GAAGCGTTGG AAGGAGAACC GGTCAATCTT GGACATTTGG ACCGGGTTGA AGGGGCCGGA
CTTGATGACG AAATAGTTGG AGGCATGAAG CTCAAAGTCG TCAGAGTAAA GAAGAAACTT
GTCAGTGAAA ACGAAATTAT CCCTTACAAC GTGATAAAAA GGGAAAACGG CAGCATGGAC
AAGGGAGATT ACAGAGTAAT CAAAGAAGGA AAAGAAGGAG TAAGAGAAAA GGTCTATGTG
GTTTCGTATG AGGACGGCAA AGAAGTGGGG AAACAGCTTG TAAGCTCGAC CGTTGTTTCA
GAACCGGAAA CCAGGATTGT GGAATACGGT ACTGTTCCGG TCTATATGAC GGCCAGAGGA
GAGAAATTCA GATACAAAAA GGTGCTGACT ATGAAAGCCA CAGCTTATAC CGCATCTTAT
GAGGACACAG GGAAAACCCC CGATCATCCG GAGTTTGGAA TCACTTATAC CGGAATTAGG
GCAAAGAAAG GTGTTGTTGC GGTGGATCCA AAGGTTATAC CTTTGGGAAC AAAGTTGTAC
ATAGAAGGCA TAGGAGGGAC GCCCGACTAC GGATTTGCCG TTGCCGCGGA CATTGGAAGT
GCTGTAAAGG GTAATGTCAT CGACCTTTAT ATGGACAGCA GGCAAGCCGT TAAACAGTGG
GGAGTAAAAA AGGTAAGAGT CTACATACTT TACGATTAA
 
Protein sequence
MSQLVESIKR RVSFKLLACL AVAFVVAGIA GWGTYYSCQK EVVINLDGER LVVKTVKSTV 
KEVLKQSGIN ITEDDYVSVP LDTKLKSKKG NVIDIKKAVP VTVIADGQEF KLMTSKKTVR
EALEGEPVNL GHLDRVEGAG LDDEIVGGMK LKVVRVKKKL VSENEIIPYN VIKRENGSMD
KGDYRVIKEG KEGVREKVYV VSYEDGKEVG KQLVSSTVVS EPETRIVEYG TVPVYMTARG
EKFRYKKVLT MKATAYTASY EDTGKTPDHP EFGITYTGIR AKKGVVAVDP KVIPLGTKLY
IEGIGGTPDY GFAVAADIGS AVKGNVIDLY MDSRQAVKQW GVKKVRVYIL YD