Gene Cthe_1168 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCthe_1168 
Symbol 
ID4810120 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium thermocellum ATCC 27405 
KingdomBacteria 
Replicon accessionNC_009012 
Strand
Start bp1392636 
End bp1394024 
Gene Length1389 bp 
Protein Length462 aa 
Translation table11 
GC content43% 
IMG OID640106590 
Producthypothetical protein 
Protein accessionYP_001037593 
Protein GI125973683 
COG category[S] Function unknown 
COG ID[COG2078] Uncharacterized conserved protein
[COG3885] Uncharacterized conserved protein 
TIGRFAM ID[TIGR00296] uncharacterized protein, PH0010 family 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value0.773328 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGGAAGAA TAATAAGTTC TTATATTTTT CCTCATCCTC CTTTGATTGT ACCTGAGATT 
GGCAAGGGAG ATGAGAAGGG CGCAATTAAA ACTATAGAGG CATGTGAAAA AGCGGCTGAA
CAAATAAGAA AAGAGAAGCC TTCAACCATT ATTCTTACGA CTTCCCACGC GCCTTTGTTT
GAGGATTATA TTTTCATTAA TGACCATAAA ACGCTGAAAG GCAACTTTTC AAGATTTGGA
GCCCGTAAGG TGGAGCTTGG TTTTGAGAAT AATTTAAAAA TGGTGGAGTC AATTATTGAG
TTTGCGAAAA AAGAAGGCTT TGATGCCGGA GGAATCAGCG AAGGTATAGG CAGAAGGTAC
GGGATTTCCG GGGAACTGGA CCATGGAGCG CTGGTGCCTC TTTATTATAT AAGCCGGGTG
TATTCGGATT TTAAACTTGT TCATGTTGCA ATGTCCACAC TTACTTTGGA GGAACATTAC
AAGTTTGGTA TGTGCATAGG CGAAGCCGTC AGAAATTCAG ATGAAGACGT GGTATTTGTC
GCAAGTGGAG ATTTGGCACA CCGCCTTACC AGTGACGGAC CCTATGGCTA CAACAAGCAT
GCCCCGGAAT TTGATGAGCT TCTGGTTAAA AGCATCGAAA AGGACGATAT TGACAGGATT
CTTGATATAG ATGACAAGCT TCGGGATGAA GCCGCAGAGT GCGGATTAAG ATCCTTTGTA
ATAATGCTGG GAGCTTTGGA CGGATACAGT GTGGTTCCTG AAGTTTACTC TTATGAAGGT
CCTTTTGGAG TGGGATATAT GGTGGCAAGA ATCGGAGTCG GAGCTATGGA TTCTTCCCGA
AGGATAATTG AAAACAGGAG AAACAAAAGA AAAAAGAGTA CCGATCCGTA TGTTTCTCTT
GCCAAAAGAG CCCTGGAGGC TTATGTAACG GAAGGCAGGG TTTTGGATGA TTACAGCGGT
CTTCCGGAGG AGATGCTGAA TAGTAGAGCC GGAACTTTTG TTTCAATAAA GAAAAAGGGT
GAACTTAGGG GCTGTATCGG TACTATCGGG CCGACAAGGA AAAATATAGC AAGTGAGATA
GTTCATAATG CAATAAGCGC GGGTACTTCC GATCCCCGGT TCTATCCTGT GAAGCCCTAT
GAGCTGGATG AGCTTGAATA TTCCGTTGAT GTTTTAATGG AGCCCGAAGA GATTAATTCC
ATGGATGAAC TGGATGTAGT AAAATATGGG GTGATTGTAA GAGCCGGAAG AAGGACGGGC
CTTTTGCTTC CAAACCTTGA AAACGTTAAT ACTGTAGAGC AGCAGGTATC AATTGCGCTT
CAAAAGGCAG GCATAAGTCC AAACGAAAAA TACACAATGG AAAGGTTTGA GGTTATAAGG
CACAAATGA
 
Protein sequence
MGRIISSYIF PHPPLIVPEI GKGDEKGAIK TIEACEKAAE QIRKEKPSTI ILTTSHAPLF 
EDYIFINDHK TLKGNFSRFG ARKVELGFEN NLKMVESIIE FAKKEGFDAG GISEGIGRRY
GISGELDHGA LVPLYYISRV YSDFKLVHVA MSTLTLEEHY KFGMCIGEAV RNSDEDVVFV
ASGDLAHRLT SDGPYGYNKH APEFDELLVK SIEKDDIDRI LDIDDKLRDE AAECGLRSFV
IMLGALDGYS VVPEVYSYEG PFGVGYMVAR IGVGAMDSSR RIIENRRNKR KKSTDPYVSL
AKRALEAYVT EGRVLDDYSG LPEEMLNSRA GTFVSIKKKG ELRGCIGTIG PTRKNIASEI
VHNAISAGTS DPRFYPVKPY ELDELEYSVD VLMEPEEINS MDELDVVKYG VIVRAGRRTG
LLLPNLENVN TVEQQVSIAL QKAGISPNEK YTMERFEVIR HK