Gene Cthe_0994 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCthe_0994 
Symbol 
ID4811288 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium thermocellum ATCC 27405 
KingdomBacteria 
Replicon accessionNC_009012 
Strand
Start bp1187769 
End bp1188983 
Gene Length1215 bp 
Protein Length404 aa 
Translation table11 
GC content41% 
IMG OID640106412 
ProductNusA antitermination factor 
Protein accessionYP_001037419 
Protein GI125973509 
COG category[K] Transcription 
COG ID[COG0195] Transcription elongation factor 
TIGRFAM ID[TIGR01953] transcription termination factor NusA 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.000000558995 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGTGCGG ATTTAATTCA TGCATTGGAA CAATTGGAAA AAGAGAAGGG TATTGACAAG 
GATACACTGA TTGAGGCTAT TGAGGCTGCT CTTATTTCAG CTTACAAGAG AAGCTTTGGC
TCATCTCAGA ATGTCAAGGT GTCGATTGAT CGTGAGACTG GGGATTTTAA GGTTTATGCC
TTAAAAAAGG TTACTGCCAA TCCTCAAAAT GAGTTGTTGG AAATCTCTAT TGAAAATGCC
AAAAAAATAA ATCCGGACTT TGAAGAAAAT GATATTGTTG AAATAGAGGT AACACCAAGA
AAGTTTGGAA GGATTGCCGC TCAGACTGCC AAACAGGTGG TTGTGCAGCG TATCAGGGAA
GCGGAAAGAG GAATTATTTA TAACGAGTTC TCGAATAAAG AGGGAGAAAT TGTTACCGGA
GTTGTTCAAA GGATAGAGAG AAAAAACGTA ATTATTGACC TTGGAAAAGC GGAGGCTATA
CTGGCTCCTT CCGAACAAAT ACCGGGAGAG GAATATAAGT TTAATGACAG AATAAAAACG
TATATAATTG AAGTGAAAAA GACCACCAAA GGGCCTCAAA TTTTGGTATC AAGGACACAT
CCCGGAATTA TTAAAAGGCT GTTTGAACTT GAGGTGCCTG AAATATACGA AGGAATTGTT
GAAATAAAGA GTATTGCCAG AGAACCGGGT TCGAGAACCA AGATTGCCGT TTATTCGAAA
GATGAAAATG TGGACCCGGT GGGAGCATGT GTCGGTCAAA AAGGTACCAG AGTCCAGGCC
GTTGTTGATG AACTTCGGGG TGAAAAAATA GATATAATAA AATGGAGCAG CAATCCTGAA
GAATATATTT CCAGCAGCCT TAGTCCCGCC AAGGTTATAA GGGTGGACAT AAATGAAGAA
GAAAAGAGTG CCAAGGTCAC TGTTCCTGAT TTTCAGCTTT CTCTGGCAAT AGGCAAAGAG
GGACAGAATG CCAGACTTGC TGCAAAACTG ACCGGCTGGA AAATTGATAT AAAGAGCGAA
TCACAGCTTC GGGCTGCAAT TGAGCAGCAA CTTTTGAATT TTAACGGGAG CTACAGTACT
GAAAAGCAAG ATACGCCAAT TACACCGGAT CAACTTTTTA AGACAAATGT CGAAGAAAGC
AGCAATGATA CGGTGGATGA TAATGCTGAA AACGCAGCTG ACGATGTGGT GGAAGATACC
ATGGATGATG TATAA
 
Protein sequence
MSADLIHALE QLEKEKGIDK DTLIEAIEAA LISAYKRSFG SSQNVKVSID RETGDFKVYA 
LKKVTANPQN ELLEISIENA KKINPDFEEN DIVEIEVTPR KFGRIAAQTA KQVVVQRIRE
AERGIIYNEF SNKEGEIVTG VVQRIERKNV IIDLGKAEAI LAPSEQIPGE EYKFNDRIKT
YIIEVKKTTK GPQILVSRTH PGIIKRLFEL EVPEIYEGIV EIKSIAREPG SRTKIAVYSK
DENVDPVGAC VGQKGTRVQA VVDELRGEKI DIIKWSSNPE EYISSSLSPA KVIRVDINEE
EKSAKVTVPD FQLSLAIGKE GQNARLAAKL TGWKIDIKSE SQLRAAIEQQ LLNFNGSYST
EKQDTPITPD QLFKTNVEES SNDTVDDNAE NAADDVVEDT MDDV