Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cthe_0994 |
Symbol | |
ID | 4811288 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Clostridium thermocellum ATCC 27405 |
Kingdom | Bacteria |
Replicon accession | NC_009012 |
Strand | - |
Start bp | 1187769 |
End bp | 1188983 |
Gene Length | 1215 bp |
Protein Length | 404 aa |
Translation table | 11 |
GC content | 41% |
IMG OID | 640106412 |
Product | NusA antitermination factor |
Protein accession | YP_001037419 |
Protein GI | 125973509 |
COG category | [K] Transcription |
COG ID | [COG0195] Transcription elongation factor |
TIGRFAM ID | [TIGR01953] transcription termination factor NusA |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 2 |
Plasmid unclonability p-value | 0.000000558995 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGTGCGG ATTTAATTCA TGCATTGGAA CAATTGGAAA AAGAGAAGGG TATTGACAAG GATACACTGA TTGAGGCTAT TGAGGCTGCT CTTATTTCAG CTTACAAGAG AAGCTTTGGC TCATCTCAGA ATGTCAAGGT GTCGATTGAT CGTGAGACTG GGGATTTTAA GGTTTATGCC TTAAAAAAGG TTACTGCCAA TCCTCAAAAT GAGTTGTTGG AAATCTCTAT TGAAAATGCC AAAAAAATAA ATCCGGACTT TGAAGAAAAT GATATTGTTG AAATAGAGGT AACACCAAGA AAGTTTGGAA GGATTGCCGC TCAGACTGCC AAACAGGTGG TTGTGCAGCG TATCAGGGAA GCGGAAAGAG GAATTATTTA TAACGAGTTC TCGAATAAAG AGGGAGAAAT TGTTACCGGA GTTGTTCAAA GGATAGAGAG AAAAAACGTA ATTATTGACC TTGGAAAAGC GGAGGCTATA CTGGCTCCTT CCGAACAAAT ACCGGGAGAG GAATATAAGT TTAATGACAG AATAAAAACG TATATAATTG AAGTGAAAAA GACCACCAAA GGGCCTCAAA TTTTGGTATC AAGGACACAT CCCGGAATTA TTAAAAGGCT GTTTGAACTT GAGGTGCCTG AAATATACGA AGGAATTGTT GAAATAAAGA GTATTGCCAG AGAACCGGGT TCGAGAACCA AGATTGCCGT TTATTCGAAA GATGAAAATG TGGACCCGGT GGGAGCATGT GTCGGTCAAA AAGGTACCAG AGTCCAGGCC GTTGTTGATG AACTTCGGGG TGAAAAAATA GATATAATAA AATGGAGCAG CAATCCTGAA GAATATATTT CCAGCAGCCT TAGTCCCGCC AAGGTTATAA GGGTGGACAT AAATGAAGAA GAAAAGAGTG CCAAGGTCAC TGTTCCTGAT TTTCAGCTTT CTCTGGCAAT AGGCAAAGAG GGACAGAATG CCAGACTTGC TGCAAAACTG ACCGGCTGGA AAATTGATAT AAAGAGCGAA TCACAGCTTC GGGCTGCAAT TGAGCAGCAA CTTTTGAATT TTAACGGGAG CTACAGTACT GAAAAGCAAG ATACGCCAAT TACACCGGAT CAACTTTTTA AGACAAATGT CGAAGAAAGC AGCAATGATA CGGTGGATGA TAATGCTGAA AACGCAGCTG ACGATGTGGT GGAAGATACC ATGGATGATG TATAA
|
Protein sequence | MSADLIHALE QLEKEKGIDK DTLIEAIEAA LISAYKRSFG SSQNVKVSID RETGDFKVYA LKKVTANPQN ELLEISIENA KKINPDFEEN DIVEIEVTPR KFGRIAAQTA KQVVVQRIRE AERGIIYNEF SNKEGEIVTG VVQRIERKNV IIDLGKAEAI LAPSEQIPGE EYKFNDRIKT YIIEVKKTTK GPQILVSRTH PGIIKRLFEL EVPEIYEGIV EIKSIAREPG SRTKIAVYSK DENVDPVGAC VGQKGTRVQA VVDELRGEKI DIIKWSSNPE EYISSSLSPA KVIRVDINEE EKSAKVTVPD FQLSLAIGKE GQNARLAAKL TGWKIDIKSE SQLRAAIEQQ LLNFNGSYST EKQDTPITPD QLFKTNVEES SNDTVDDNAE NAADDVVEDT MDDV
|
| |