Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Ccel_3376 |
Symbol | rho |
ID | 7311939 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Clostridium cellulolyticum H10 |
Kingdom | Bacteria |
Replicon accession | NC_011898 |
Strand | - |
Start bp | 3913774 |
End bp | 3915846 |
Gene Length | 2073 bp |
Protein Length | 690 aa |
Translation table | 11 |
GC content | 40% |
IMG OID | 643610279 |
Product | transcription termination factor Rho |
Protein accession | YP_002507645 |
Protein GI | 220930736 |
COG category | [K] Transcription |
COG ID | [COG1158] Transcription termination factor |
TIGRFAM ID | [TIGR00767] transcription termination factor Rho |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 2 |
Plasmid unclonability p-value | 0.000138614 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGACCAAA TTAATTTACA ATCAAAAACG TTAGAAGATT TGAGATATAT TGCAAAGATG CTTGGGATTA AAAGTATATC CAAATATAAG AAGAGTGAGT TGGTAAAATT GCTTAGCGAA AACGCTGAAA AAATTAAAGC CGACAATATT GTGGAAACGG TTGTTTCTGA AGAACCTAAA AAGGCTGAAC CAGCTGCTAA TACAATAGAT ACTCACGTAA GCGAAGAAAG GTCCAAAGAA GCATCGGAGG AGATTCCTGT TTTAAGAAAA TCCCGAAGAG GAAGGCCGAG CAAGGCATCC AAATTAATGG ACAAGCCGGA AAATAATGGT GTAGCAGATC CAATCACTAG TGTTCCAGTT CAGGAAACAG TCATTCCAGG AAATAATCTA AATATCGTAC CTGAGCAAAA TGGCAAACCC GATAGATTGT CTGACACATT AAGGCATCTG GAAACAAAGA ATATTGAGAA GAAGGCCCAT GGCAGAGGTC GCAAAAAGGA TGTTCAACCT TCTATTGAAT CCCCAAATAC TGAAAATAAG GCAGAAACCA ATACATCAGC AGTTTCAGCA GGGGCTACAG TGAATGATTC TATCGTTAAC CTGCCTCAAA GGAAAAATGG CAGGGCAAAG CCGGAGCAAA CGGTAGAACA GCCAAAAAAC GTTCCTGATG AGAAGATACC CGTTAAAGTT GAAAAGGATT CTAAACCTAA AATATACAAG AAGGAAGAAC AACCAACAGC AAGGCAAGAT ACTAGGCAAT TGCATAGGCA AGTTAAACAG CATACAAATA CAAACAACAA GTCTGACAAT GCACCCGAAC CTCAGCGTAT ACCGCTAAAT CAGCAGCCAC AGCAGGTTCA GCCTCAACAA GTGGTGCCGC CCCAGTCTCA GGTTATTTCA CCTCAAATTC AGCAGCCGCA AGGTAGTGAA AAAATTGAAA GTGATGACCC TGTAGAAGGA GTTCTTGAAG TGTTACCAGA CGGATATGGT TTTTTAAGAA GTGAGAATTA TCTGTCCGGC CCCAAAGATG TTTATGTTTC ACCTTCACAA ATAAGACGTT TTGGACTTAA AACAGGGGAT AAGCTCAGAG GTAAAGGCAG AATTCCGAAA GAAGGCGAAA AATTTCAGGC ATTATTATAT GTACAGTCCA TAAACGGGGA CACTCCTGAT GTTGCATCAA AGAGAGTAGC GTTTGAGTAC TTAACGCCTA TATATCCTGA TAACAGAATT ACGCTCGAAA CTTCACCAAG GGAATTTTCA ACCAGACTTA TAGATCTCAT AGCTCCAATC GGAAAAGGCC AGAGAGGAAT GATAGTATCT CCCCCTAAAG CCGGTAAAAC AATACTTTTA AAGAAAATTG CAAACGCAAT TACAATCAAT TACCCCGAAG CAGAGTTAAT AGTACTTCTT ATCGATGAAA GACCTGAAGA AGTAACCGAT ATGCAGCGTT CTATAAAGGG AGAGGTTATA TATTCTACAT TTGACGAGGT TCCCGAACAC CATATAAAGG TTGCCGAGAT GGTTCTGGAG AGAGCACAGC GTCTTGTTGA GCAAAAGAAG GATGTTGTTA TTCTTCTGGA TAGTATAACA AGGCTGGCAA GGGCTTATAA CCTTACTATT CCTCCGACAG GAAGAACTTT ATCCGGCGGT TTGGATCCGG GTGCGCTTCA TAAACCGAAA AGATTTTTTG GAGCAGCAAG GAATATTGAA TACGGAGGCA GTTTGACAAT TATGGCCACC GCTCTCATAG AAACAGGAAG CAGAATGGAT GATGTAATCT TTGAGGAATT CAAGGGAACC GGTAACATGG AACTTCATCT GGATAGAAAG CTTTCTGAAA AGAGAATATT CCCTGCAATC GATATTAATA AGTCGGGAAC CAGACGTGAG GAGCTTCTTC TTAGCCAAAA AGAGCTTGAG AGCGTATGGG CTATAAGAAA AGCAATGAGT AATATGGGAA CAGCAGAGGT TACAGAGATT TTAATTAACA AGCTAATGCA AACTAGAACA AACGAAGACT TTGTAAATAG TATAAAAATA TCGTTTTTAG ATAAAAATTC TCAGGATAGA TAA
|
Protein sequence | MDQINLQSKT LEDLRYIAKM LGIKSISKYK KSELVKLLSE NAEKIKADNI VETVVSEEPK KAEPAANTID THVSEERSKE ASEEIPVLRK SRRGRPSKAS KLMDKPENNG VADPITSVPV QETVIPGNNL NIVPEQNGKP DRLSDTLRHL ETKNIEKKAH GRGRKKDVQP SIESPNTENK AETNTSAVSA GATVNDSIVN LPQRKNGRAK PEQTVEQPKN VPDEKIPVKV EKDSKPKIYK KEEQPTARQD TRQLHRQVKQ HTNTNNKSDN APEPQRIPLN QQPQQVQPQQ VVPPQSQVIS PQIQQPQGSE KIESDDPVEG VLEVLPDGYG FLRSENYLSG PKDVYVSPSQ IRRFGLKTGD KLRGKGRIPK EGEKFQALLY VQSINGDTPD VASKRVAFEY LTPIYPDNRI TLETSPREFS TRLIDLIAPI GKGQRGMIVS PPKAGKTILL KKIANAITIN YPEAELIVLL IDERPEEVTD MQRSIKGEVI YSTFDEVPEH HIKVAEMVLE RAQRLVEQKK DVVILLDSIT RLARAYNLTI PPTGRTLSGG LDPGALHKPK RFFGAARNIE YGGSLTIMAT ALIETGSRMD DVIFEEFKGT GNMELHLDRK LSEKRIFPAI DINKSGTRRE ELLLSQKELE SVWAIRKAMS NMGTAEVTEI LINKLMQTRT NEDFVNSIKI SFLDKNSQDR
|
| |