Gene Cthe_0861 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCthe_0861 
Symbol 
ID4810479 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium thermocellum ATCC 27405 
KingdomBacteria 
Replicon accessionNC_009012 
Strand
Start bp1035988 
End bp1037124 
Gene Length1137 bp 
Protein Length378 aa 
Translation table11 
GC content38% 
IMG OID640106277 
Productcell envelope-related transcriptional attenuator 
Protein accessionYP_001037288 
Protein GI125973378 
COG category[K] Transcription 
COG ID[COG1316] Transcriptional regulator 
TIGRFAM ID[TIGR00350] cell envelope-related function transcriptional attenuator common domain 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGGTCAAA AAAAAATATT TTCAAAGAAA AAGAATTTAA CTTTTAAAAA AGCCCTGCTC 
ATTTGGATGG TTAGTTTGAT AGTTTTAGGG GGAATGATTT TTCTTTTGGG AAAATTCGTT
CTAAGTGACA GCAACGGCAG TTCAGACGTG GACAATGACC TTACATATCC ACCTGTTAAA
AGTTACGAAC CGATTAGTTT TCCTACACCC GGCAAAATCG GCAATGACAA GCCTTCCGAT
GAGGAAAACT CCGATAAATT TCAGGGTCTC ACAAACGGAA TGAAATATCA TGAAAAGCTG
GTTGACGAAA GCAGTCGAAA CATATTGTTC ATCGGGCAGG ACAGAATTTC CGGCCTGTAT
GATACCATAG GTATATTAAG CATTGACAGA AAAAACAAAA AACTGAAAAT AATAATGATT
CCCCGCGACC TGTATATCGA TTACAGTTCC AGAGTCAAAC ACTATTTAGA AGAAAACGGC
AGATTAAACG ATCCCGACTT TTATAAAATT AATGCCGCGC ACCTTATAGG GCCGTATATG
AAATATGAGG GAAAATTTGG TGCATATTCA ATGAATTTTC TGGCCGAACT GATCAAAGAA
ATTTTTGACA TTGAAGTTCA TGACTACGTA AGGGTTAACA CCGAAGGCTT CGTCCAAATT
GTCGACCTTT TCGGCGGTGT CGACATTAAC GTTCCGTACG ACATGCATTA CGATGACATA
TATCAGGATC TCCACATCCA TATAAACAAA GGATGGAACC ATCTTGACGG AAAAAAAGCC
GAAGGATTCG TCCGTTACAG GCAAAGCAAC GACGAAATGG GAAACATCAC CCATTCGATC
GGCGATTATG AAAGAAAGAA AAACCAGATT AATTTTATAA AAGCATTTAT TGAGCAGCAT
GGGACAATGT CCAATATCGA CAAGCTCCCC AGTCTGATTT CTACGTTAAA CAAATACATG
AAACACAGCA TCGGAGTCGG AGACGTTCTG ACAACTTACA TTGGCTATGC AAAAGATGTC
GTTCTATACA AATACCCGAT AGAAACTTAT ACAGTTACCG GAAAAGACAA ATATATGAAC
CAAAGATATT ATATTGTAAT TGAAAATGAC AACAAGACCA GCAAAGTTGT TGACTAA
 
Protein sequence
MGQKKIFSKK KNLTFKKALL IWMVSLIVLG GMIFLLGKFV LSDSNGSSDV DNDLTYPPVK 
SYEPISFPTP GKIGNDKPSD EENSDKFQGL TNGMKYHEKL VDESSRNILF IGQDRISGLY
DTIGILSIDR KNKKLKIIMI PRDLYIDYSS RVKHYLEENG RLNDPDFYKI NAAHLIGPYM
KYEGKFGAYS MNFLAELIKE IFDIEVHDYV RVNTEGFVQI VDLFGGVDIN VPYDMHYDDI
YQDLHIHINK GWNHLDGKKA EGFVRYRQSN DEMGNITHSI GDYERKKNQI NFIKAFIEQH
GTMSNIDKLP SLISTLNKYM KHSIGVGDVL TTYIGYAKDV VLYKYPIETY TVTGKDKYMN
QRYYIVIEND NKTSKVVD