Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Acid345_3229 |
Symbol | rho |
ID | 4072564 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Candidatus Koribacter versatilis Ellin345 |
Kingdom | Bacteria |
Replicon accession | NC_008009 |
Strand | - |
Start bp | 3821333 |
End bp | 3822580 |
Gene Length | 1248 bp |
Protein Length | 415 aa |
Translation table | 11 |
GC content | 59% |
IMG OID | 637985250 |
Product | transcription termination factor Rho |
Protein accession | YP_592304 |
Protein GI | 94970256 |
COG category | [K] Transcription |
COG ID | [COG1158] Transcription termination factor |
TIGRFAM ID | [TIGR00767] transcription termination factor Rho |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 0 |
Plasmid unclonability p-value | 0.0000000548676 |
Plasmid hitchhiking | No |
Plasmid clonability | unclonable |
| |
Fosmid Coverage information |
Num covering fosmid clones | 6 |
Fosmid unclonability p-value | 0.0477745 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACCATCG CAGAACTGAA AGAAAAGAAC ATCACCGAGC TTACCCGCAT AGCTCGTTCG CTCGACCTTC CCGGCGCCAG CGGCCTCCGC AAGCAGGACC TTATCTTCAA GATCCTCCAG GCGCAGAGCG AAAAAGAGGG CCACATCTTC GCAGAAGGTG TCCTCGAAAT CCTGCCCGAC GGCTACGGTT TCCTCCGCTC CCCGGATTAC AACTACCTCC CCGGTCCAGA CGACATCTAC GTCTCGCCTT CACAGATTCG CAAATTCGAC CTCAAGACCG GCGACACCAT CAGCGGACAA GTCCGCCCGC CGCATGAAGG CGAAAAGTAC TTTGCGCTCG TCAAGATTGA AGCCGTTAAC TTCGAATCGC CCGACGAAGC TCGCAACAAG ATTCTCTTCG ACAACCTGAC TCCGCTTTAT CCGCAGGAGC GGATCAAACT GGAGACCGTG CGCGACAATA TCTCCGCGCG CGTGATGGAC CTGCTCACGC CGGTGGGTAA AGGCCAGCGC GGCCTGATCG TCGCGCCGCC CCGCACCGGT AAGACGATGC TGTTGCAGAA CCTGGCGAAC TCGATCACCA CGAACCATCC CGAGATCGTG CTCATCGTTC TGCTGATCGA CGAGCGTCCG GAAGAAGTTA CCGACATGCA GCGCTCGGTG AAGGGCGAGG TCATCTCCTC GACGTTTGAC GAGCCCGCTG CCCGCCACGT GCAGGTTGCG GAAATGGTCA TCGAGAAGGC GAAGCGGCTG GTCGAGCACA AGCGCGACGT CGTCATCCTA CTCGATTCGA TCACGCGACT GGCGCGTGCT TACAACACCA TCGTTCCGCC CTCGGGCAAA GTGCTCTCCG GCGGTGTGGA TTCCAACGCG TTGCAGCGTC CGAAGCGTTT CTTCGGCGCA GCCCGCAACA TCGAAGAAGG CGGCTCGTTG ACGATCATTG CCACGGCATT GATCGAAACC GGATCGCGCA TGGACGACGT GATCTTCGAA GAGTTCAAGG GCACCGGCAA CATGGAAATC ATTCTCGACC GGAAACTGGC GGACAAGCGC ACGTTCCCGG CGATCGATAT CCAGCGCTCC GGCACCCGTA AGGAAGAGCT GCTGCTCGCG AAGGAAGACC TGCAACGGAT TTGGATTCTT CGCCGCGTGC TGAACCCGCT CTCACCTGTG GAAGCGATGG AATTGCTCAT CGACAAGCTG GGCAAGAGCC GGAACAATGG CGAGTTCCTG AGCAACATGA ACTCCTAG
|
Protein sequence | MTIAELKEKN ITELTRIARS LDLPGASGLR KQDLIFKILQ AQSEKEGHIF AEGVLEILPD GYGFLRSPDY NYLPGPDDIY VSPSQIRKFD LKTGDTISGQ VRPPHEGEKY FALVKIEAVN FESPDEARNK ILFDNLTPLY PQERIKLETV RDNISARVMD LLTPVGKGQR GLIVAPPRTG KTMLLQNLAN SITTNHPEIV LIVLLIDERP EEVTDMQRSV KGEVISSTFD EPAARHVQVA EMVIEKAKRL VEHKRDVVIL LDSITRLARA YNTIVPPSGK VLSGGVDSNA LQRPKRFFGA ARNIEEGGSL TIIATALIET GSRMDDVIFE EFKGTGNMEI ILDRKLADKR TFPAIDIQRS GTRKEELLLA KEDLQRIWIL RRVLNPLSPV EAMELLIDKL GKSRNNGEFL SNMNS
|
| |