Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | GWCH70_3329 |
Symbol | rho |
ID | 7977092 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Geobacillus sp. WCH70 |
Kingdom | Bacteria |
Replicon accession | NC_012793 |
Strand | - |
Start bp | 3352876 |
End bp | 3354150 |
Gene Length | 1275 bp |
Protein Length | 424 aa |
Translation table | 11 |
GC content | 44% |
IMG OID | 644800096 |
Product | transcription termination factor Rho |
Protein accession | YP_002951235 |
Protein GI | 239828611 |
COG category | [K] Transcription |
COG ID | [COG1158] Transcription termination factor |
TIGRFAM ID | [TIGR00767] transcription termination factor Rho |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 1 |
Plasmid unclonability p-value | 0.00000000488877 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGAGTTAA CACTTTCTAC ATTAGAAAAT ATGAAACTGA AAGAGCTCTA TGAGCTTGCT CGCCAATATA AAATTTCTTA TTACAGTAAG CTAACAAAAA AAGAGCTTAT TTTTGCCATT TTGAAAGCGC GTGCGGAACA AGATGGATTG TTTTTTATGG AGGGCGTGCT TGAAATCATT CAATCAGAAG GATTCGGCTT TTTACGTCCG ATCAACTATT CCCCAAGCTC CGAAGATATT TATATTTCCG CGTCACAAAT TCGCCGATTT GATTTGCGAA ACGGGGATAA AGTGTCTGGA AAAGTGCGTC CTCCAAAAGA AAATGAACGC TATTTCGGAT TGCTTCATGT CGAAGCAGTC AATGGCGAAG ACCCAGAAGT CGCAAAGGAA CGCGTGCACT TCCCGGCATT AACACCTTTA TACCCAAATC GACAAATGAA ATTAGAAACA ACCCCGGACA AGCTATCAAC GAGAATTATC GACTTAATTG CCCCGGTCGG ATTCGGGCAG CGCGGATTGA TTGTAGCTCC TCCAAAAGCG GGAAAAACGA TGTTATTAAA AGAAATCGCC AACAGCATCA CGACCAATCA TCCAGAAGTA GAACTCATTG TTCTCCTCAT TGACGAACGT CCAGAGGAAG TAACAGACAT CGAGCGGTCG GTCAATGGTG ACGTTGTCAG CTCGACGTTT GACGAAGTGC CGGAAAATCA TATTAAAGTA GCGGAATTAG TGTTAGAAAG AGCGATGCGT CTTGTCGAAC ATAAACGAGA TGTCGTTATT CTTATGGATA GCATCACTCG TTTAGCACGC GCTTATAACT TAGTCATCCC GCCAAGCGGC CGCACGCTTT CAGGGGGGAT TGATCCGGCT GCGTTCCACC GTCCGAAACG GTTTTTCGGG GCGGCCCGCA ATATTGAAGA AGGCGGCAGT CTAACGATTT TAGCTACTGC TCTTGTTGAT ACAGGTTCGC GTATGGATGA TGTTATATAC GAGGAATTTA AAGGCACCGG AAATATGGAA CTCCATCTTG ACCGATCGCT GGCGGAGCGG CGTATTTTCC CAGCCATCGA TATTCGTCGC TCAGGTACGC GTAAAGAAGA GTTGCTCATT CCGAAAGAGC ATCTTGAAAA ATTATGGGCG ATTCGAAAAA CAATGTCTGA TTCCCCTGAT TTCATTGAGC GGTTCTTAAG CAAACTCCGT CAAACAAAAT CAAACGAAGA ATTTTTTGCG ATGCTTGATG AAGAATGGAA AAGTAATGGA GCCGTGCGAA TTTAA
|
Protein sequence | MELTLSTLEN MKLKELYELA RQYKISYYSK LTKKELIFAI LKARAEQDGL FFMEGVLEII QSEGFGFLRP INYSPSSEDI YISASQIRRF DLRNGDKVSG KVRPPKENER YFGLLHVEAV NGEDPEVAKE RVHFPALTPL YPNRQMKLET TPDKLSTRII DLIAPVGFGQ RGLIVAPPKA GKTMLLKEIA NSITTNHPEV ELIVLLIDER PEEVTDIERS VNGDVVSSTF DEVPENHIKV AELVLERAMR LVEHKRDVVI LMDSITRLAR AYNLVIPPSG RTLSGGIDPA AFHRPKRFFG AARNIEEGGS LTILATALVD TGSRMDDVIY EEFKGTGNME LHLDRSLAER RIFPAIDIRR SGTRKEELLI PKEHLEKLWA IRKTMSDSPD FIERFLSKLR QTKSNEEFFA MLDEEWKSNG AVRI
|
| |