Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | BCZK5032 |
Symbol | rho |
ID | 3023332 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Bacillus cereus E33L |
Kingdom | Bacteria |
Replicon accession | NC_006274 |
Strand | - |
Start bp | 5129453 |
End bp | 5130724 |
Gene Length | 1272 bp |
Protein Length | 423 aa |
Translation table | 11 |
GC content | 38% |
IMG OID | 637549264 |
Product | transcription termination factor Rho |
Protein accession | YP_086601 |
Protein GI | 52140230 |
COG category | [K] Transcription |
COG ID | [COG1158] Transcription termination factor |
TIGRFAM ID | [TIGR00767] transcription termination factor Rho |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 0.00629962 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAATTTGT CAATTGCAGC ATTAGAAAAC ATGAAATTAA AAGAGTTATA CGAGCTTGCG AAAGAATTTA AGATTTCGTA TTATAGCAAA TTAACGAAAA AAGAGTTAAT CTTCGCCATT TTAAAAGCTC GAGCAGAAAA AGAAGGTTTC TTCTTCATGG AAGGCGTATT AGAAATTATT CAATCAGAAG GATTTGGATT CCTACGTCCT ATCAACTACT CTCCAAGCTC AGAAGATATT TATATCTCAG CTTCGCAAAT TCGTCGTTTT GATTTACGTA ATGGAGATAA AGTTTCTGGT AAAGTACGAC CTCCGAAAGA AAATGAACGC TACTTTGGAT TATTACAAGT TGAAGCTGTA AACGGAGATG ATCCAGAGTC AGCAAAAGAG CGTGTGCATT TCCCTGCATT AACACCATTA TACCCAGATC GCCAAATGAA ATTGGAAACG GAACCGAAAA AGTTACCGAC ACGCATCATG GATTTAATTG CACCAGTTGG ATTTGGACAA CGTGGTTTAA TTGTCGCGCC TCCAAAGGCT GGTAAAACAA GTCTATTAAA AGAAATCGCG CACAGTGTTA CAACAAATCA TCCGGAAGCT GAATTAATTG TACTTTTAAT TGATGAGCGT CCAGAGGAAG TAACAGACAT TGAACGTTCT GTTAAAGGAG ATGTTGTAAG CTCTACTTTT GATGAAGTAC CAGAAAATCA TATTAAAGTA GCGGAACTTG TGTTAGAACG TGCAATGCGT CTTGTAGAGC ACAAAAAAGA TGTTATCATT TTAATGGATA GTATTACCCG TTTAGCGCGA GCTTACAACC TTGTTATTCC GCCAAGTGGT AGAACATTAT CGGGTGGTAT CGACCCAGCT GCCTTCCATA GACCGAAGCG CTTCTTTGGA GCTGCGCGTA ATATTGAAGA AGGCGGTAGC TTAACTATTT TAGCAACAGC GCTTGTTGAT ACAGGATCTC GTATGGACGA TGTAATTTAC GAAGAATTTA AAGGAACTGG AAATATGGAA CTTCACTTAG ATCGTTCATT AGCTGAGCGT CGTATCTTCC CAGCAATTGA TATTCGCCGT TCTGGTACAC GTAAAGAGGA TCTATTAATT CCGAAAGAAC ATTTAGACAA GCTGTGGGGT ATTCGTAAAA CAATGCGTGA TACACCAGAC TTTGTTGAAA GTTTCTTACG TAAACTTCGT CAAACAAAGA CAAATGAAGA ATTTTTACAA AACATTGTTG CAGATTCGAA AAGATATGTA ACAACTAAGT AA
|
Protein sequence | MNLSIAALEN MKLKELYELA KEFKISYYSK LTKKELIFAI LKARAEKEGF FFMEGVLEII QSEGFGFLRP INYSPSSEDI YISASQIRRF DLRNGDKVSG KVRPPKENER YFGLLQVEAV NGDDPESAKE RVHFPALTPL YPDRQMKLET EPKKLPTRIM DLIAPVGFGQ RGLIVAPPKA GKTSLLKEIA HSVTTNHPEA ELIVLLIDER PEEVTDIERS VKGDVVSSTF DEVPENHIKV AELVLERAMR LVEHKKDVII LMDSITRLAR AYNLVIPPSG RTLSGGIDPA AFHRPKRFFG AARNIEEGGS LTILATALVD TGSRMDDVIY EEFKGTGNME LHLDRSLAER RIFPAIDIRR SGTRKEDLLI PKEHLDKLWG IRKTMRDTPD FVESFLRKLR QTKTNEEFLQ NIVADSKRYV TTK
|
| |