Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cphy_0410 |
Symbol | rho |
ID | 5745170 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Clostridium phytofermentans ISDg |
Kingdom | Bacteria |
Replicon accession | NC_010001 |
Strand | + |
Start bp | 520544 |
End bp | 522496 |
Gene Length | 1953 bp |
Protein Length | 650 aa |
Translation table | 11 |
GC content | 36% |
IMG OID | 641291522 |
Product | transcription termination factor Rho |
Protein accession | YP_001557536 |
Protein GI | 160878568 |
COG category | [K] Transcription |
COG ID | [COG1158] Transcription termination factor |
TIGRFAM ID | [TIGR00767] transcription termination factor Rho |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 3 |
Plasmid unclonability p-value | 0.000426411 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGTAATT TGTATGAAAC AAAGTCACTT TCTGAATTGC GTGATATTGC GAAAGAAAAG GGTTTGAAAA GTATTAGCGC ATTACGTAAG CAGGAACTGA TAGATGCTTT GAATGCATTG GAGAAAGGAC AAGGATCAGT TGCCTCCATA AATAAAAGTA CGGATGATAA ACCTATAAAG CTTGGAACAG AAGAAGTTAA ATTGGCAACA GAGGATACAA AGCAGAGTGT AGAGAATCGT GGTATGGAAT CTTCTCATAT GCAAGGAAAT CGAAATAACG TGATACGCAG GCTACCAGAG CAGAATCGTC AACATGAATT GGTAAAAGCA CCTGATCAAT CTAAGGGAAA TGATAATAGC CGAATGAGTG AGAGCTATCG CAGTAATGAA GGTCGGAACA TGGATAATCG TTTAGAGAAC AGAAACAACG ATAATAAAAA TAACGATAAT AGAAATAATG AAAATCGTAA TAATGAAAAT AGAAACTATG AAAATAGTAG AAATTATGAA AATAGAAGTA ATGAAAATAG AAACAATGAA AATAGAAACA ATGAAAATAG AAACAATGAA AATAGAAGCA ATGAGATGCT AAGAAATTAT GATAATAACC GCCGTACGAT GGATAATCGC AGACCAATGG ATAACCGCAG ATTAAATGAA AATCGCGGAT ATAATGAATC GGATCGTGTA GTCAGAATAA ATCCAACAGG GGATATTTCA GGTAATCAAG CTACTGAGTT AGGCTATAAT ACAACTTCCT CATCGGATCG TTTTCTTTCA CAAGCAAATC AACAGGAAGA AAAATTGCCA ATTGATATGG AACAATTGGA TTCTGGTGAA ACAAAAGAAG GAATTTTAGA GGTATTATCC GATGGATACG GATTTATTCG TTGTGATAAT TTTTTACCTG GGGAAAATGA TGTTTATGTT TCTCCGGCAC AGATTAGAAG ATTTAATTTA AAAACTGGTG ATATCATTGT AGGAAATACT AGAATACGTA ATCAAAACGA AAAGTTCAGC GCTCTTCTTT ATATCAAGCT TATTAATGGA TTACATCCTT CTGAAGCAGT AAAACGGAAG AAATTCGAAG ATTTAACACC GATCTTCCCA AATGAAAGAA TTCATCTTGA AACGCCTGGC TGTCAGGTAG CTATGCGAAT GGTAGACTTA ATTTCACCAA TTGGTAAGGG ACAAAGAGGT ATGATTGTAT CCCAGCCAAA AACAGGTAAG ACTACCTTGT TAAAACAAGT TGCTAAGGCT ATTACAAGAA ACCATCCTGA GATGCATTTA ATCATTTTAT TAATTGATGA GCGACCAGAA GAAGTTACCG ATATTAAAGA GTCTATAGAA GGTGGTAATG TAGAAGTAAT TTACTCCACT TTTGATGAAT TACCAGAGAA TCATAAGCGA GTTTCAGAGA TGGTTATAGA ACGTGCTAAG CGTCTGGTAG AGCATAAAAA AGATGTTGTT ATCTTATTAG ATAGTATCAC AAGACTTGCA AGAGCCTATA ACTTAACGGT ACAAGCTAGT GGACGTACCT TATCCGGTGG TCTTGACCCT GCAGCACTTC ACATGCCGAA AAAGTTTTTC GGAGCAGCAA GAAATATGAG AGAGGGCGGA AGCCTCACTA TTTTGGCAAC AGCGTTAGTT GAAACCGGTA GCCGTATGGA TGATGTTGTA TTTGAGGAAT TTAAGGGAAC CGGTAATATG GAACTTGTAT TGGATCGTAA TTTATCAGAG AAACGAATCT TCCCAGCAAT TGACCTTCCA AAGTCTAGTA CACGTCGTGA TGATTTATTA CTAAATTCCG CTGAGGTGGA AGCAAATTAT CTTATGAGGA AGGCGTTAAA CGGTCTTAAA TCAGAAGATG CAGTAGAACG AATTATTCAA ATGTTTGTTA ATACAAAAAA CAATGCAGAA TTTGTCGAAA TGATTAAGAA AACAAAGATA TAA
|
Protein sequence | MSNLYETKSL SELRDIAKEK GLKSISALRK QELIDALNAL EKGQGSVASI NKSTDDKPIK LGTEEVKLAT EDTKQSVENR GMESSHMQGN RNNVIRRLPE QNRQHELVKA PDQSKGNDNS RMSESYRSNE GRNMDNRLEN RNNDNKNNDN RNNENRNNEN RNYENSRNYE NRSNENRNNE NRNNENRNNE NRSNEMLRNY DNNRRTMDNR RPMDNRRLNE NRGYNESDRV VRINPTGDIS GNQATELGYN TTSSSDRFLS QANQQEEKLP IDMEQLDSGE TKEGILEVLS DGYGFIRCDN FLPGENDVYV SPAQIRRFNL KTGDIIVGNT RIRNQNEKFS ALLYIKLING LHPSEAVKRK KFEDLTPIFP NERIHLETPG CQVAMRMVDL ISPIGKGQRG MIVSQPKTGK TTLLKQVAKA ITRNHPEMHL IILLIDERPE EVTDIKESIE GGNVEVIYST FDELPENHKR VSEMVIERAK RLVEHKKDVV ILLDSITRLA RAYNLTVQAS GRTLSGGLDP AALHMPKKFF GAARNMREGG SLTILATALV ETGSRMDDVV FEEFKGTGNM ELVLDRNLSE KRIFPAIDLP KSSTRRDDLL LNSAEVEANY LMRKALNGLK SEDAVERIIQ MFVNTKNNAE FVEMIKKTKI
|
| |