Gene Cthe_0760 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCthe_0760 
Symbol 
ID4810378 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium thermocellum ATCC 27405 
KingdomBacteria 
Replicon accessionNC_009012 
Strand
Start bp925091 
End bp926725 
Gene Length1635 bp 
Protein Length544 aa 
Translation table11 
GC content36% 
IMG OID640106177 
Producthypothetical protein 
Protein accessionYP_001037188 
Protein GI125973278 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGAGAATAG AAAGCTCGGA AAATGTATTA ATTCAACAGA CCGGAGCTTT GGACATACTT 
TCGCGGCTTG ATACGGGAGA TACCTTGAGA GCAAGGGTAG TTGACATTAC TGCAAATGAG
CTGCTCCTAA AGCTGTTTGA CGGAACATTG ATAAATGCCG GTACTATGAC TCCCATAGAT
GCCAAAAAGG GAGAGTTGCT TGATTTTATT GTAAAAAACA AAGTGAACAA TCAATTGTTT
TTGGAAATCA TGAAGGATGG CGTTCAAAAT GCCGCTCAGC CCAATGTTGA GGACGAAATT
AAAAACAAGC TTGCACAGCT TGGCATAAAG CCTGACAGAA GAAATATGGA GACGGCGGCT
GAACTTCGTG CCAATGGAAC TCAGCTTAAT GCTGAAAACA TTACCAAAGT TGTTGATGCG
GTGCTTAGGT TTAAAAACCT TGGCATTGCA AAAGCAGCCT ATCTTGTTTC AAACAATATT
ATTCCCGAAG AAAAAAGTAT AACAAGTCTA AATAGATTTG TCGAGGGCAG AGTGAGATTG
AGTTCTGAGC TTTTAGATCT GGCTTCAAGT CTTGCAGAGA TTCAGGATAA AGATGTGGCT
TTTGCCATCT TAAAAAAGCT TAATGCCATG GATTATCCTT TTCAAAAGAG CGGAGAAACT
TCTGATGTCA GCACTTCCCC TATTCATTAT GAAAACGGAA AAATTTATAA CAGCCAAAAG
AATTGGCATA TTGAAGGGAA TGTGGCTCAA AAAAGTGAAA AGGAGCATTG GAGCCGTAAG
CCTGAATTGT CCCGAAATGG CAATATAGGT ACAGAAAATT TGGATAAAAC AGTGAAAAAT
GTTGACATAG AGGGAAATGA GATATTACAA TTAAAGAAAA AAACTGCTGA TTTTGTTAAT
TACAAAGAAA ACATAGAGAA ATCTGAAAAA GAGCTTGCGG ACTTTTTAAA TGTGATTCTG
GCTTCACAAG GAAGCAGAAG CGCCGGCAAA GGCCGTTCTG AAACTAATAA TGTGGCTCAA
GTTATAAAAA AATCCTTTGA AAAGATGTTT GCAAAAATTA ATGAGGAAGT TGAAGGCAGA
GATATTAATG TAAAGGAATT TTACAGAGAT ATTTATAAAA AGCTTGAAAT AGTCCGCAAA
GTTTTAGAAG AGACCGACAT CCCAGGTAAA CAAGAGATTT TAAACAAAGT TGACAACATT
AAGAGCGATA TAAATTTTTT AAATGAGTTA AACAAACATA CCGTATATTT TCAGATACCT
CTGAAAATAT TTGACAAGAA TACTAACGGT GAGCTTTATA TATTGAAAAG AAACAACGGA
AGAAAAAGAA TTGATCCGCA AAATGCCACT GTGTTTTTGT CATTGGATAC GGAAAATTTG
GGACAAGTGG ACTCTCTTAT CAGTGTAAAC AAAAAGAATG TCAGCCTCAA TTTTAGGCTT
GAAAAAAATG AAATCATAGA TTATATCAAA GAAAACTATA TTCAGCTTTA TGAAGGATTG
GCCAAAAAAG GCTATAAACT TGTGGATATC AAATACAGGC TTATAGATGA AAAGGTAAAT
CTGTTAAATG CCCGGGAGGT TTTGGAAAAA GAAATAGAAA GAACAAGAAA CAGAGGGTTT
GACTGCAAAA TTTGA
 
Protein sequence
MRIESSENVL IQQTGALDIL SRLDTGDTLR ARVVDITANE LLLKLFDGTL INAGTMTPID 
AKKGELLDFI VKNKVNNQLF LEIMKDGVQN AAQPNVEDEI KNKLAQLGIK PDRRNMETAA
ELRANGTQLN AENITKVVDA VLRFKNLGIA KAAYLVSNNI IPEEKSITSL NRFVEGRVRL
SSELLDLASS LAEIQDKDVA FAILKKLNAM DYPFQKSGET SDVSTSPIHY ENGKIYNSQK
NWHIEGNVAQ KSEKEHWSRK PELSRNGNIG TENLDKTVKN VDIEGNEILQ LKKKTADFVN
YKENIEKSEK ELADFLNVIL ASQGSRSAGK GRSETNNVAQ VIKKSFEKMF AKINEEVEGR
DINVKEFYRD IYKKLEIVRK VLEETDIPGK QEILNKVDNI KSDINFLNEL NKHTVYFQIP
LKIFDKNTNG ELYILKRNNG RKRIDPQNAT VFLSLDTENL GQVDSLISVN KKNVSLNFRL
EKNEIIDYIK ENYIQLYEGL AKKGYKLVDI KYRLIDEKVN LLNAREVLEK EIERTRNRGF
DCKI