Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | GM21_3808 |
Symbol | rho |
ID | 8139182 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Geobacter sp. M21 |
Kingdom | Bacteria |
Replicon accession | NC_012918 |
Strand | - |
Start bp | 4378892 |
End bp | 4380139 |
Gene Length | 1248 bp |
Protein Length | 415 aa |
Translation table | 11 |
GC content | 60% |
IMG OID | 644871427 |
Product | transcription termination factor Rho |
Protein accession | YP_003023585 |
Protein GI | 253702396 |
COG category | [K] Transcription |
COG ID | [COG1158] Transcription termination factor |
TIGRFAM ID | [TIGR00767] transcription termination factor Rho |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | 110 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAACCTAC AGGAACTTAA AGAAAAGAAG ATCAACGATC TCACTGCGAT TGCCAAGGGG CTCAACATCG AGGGGGCTTC CAGTCTCAGG AAGCAGGACC TGATCTTCGC CATTCTCAAT GCCCAGACTG AGAAGAACGG GATGATCTTC GGCGAGGGCG TTTTGGAGAC GCTCCCTGAC GGTTTCGGCT TCCTGAGGGC GCCGGATTAC AACTACCTGC CGGGGCCGGA CGACATCTAC GTCTCGCCGT CGCAGATCCG CCGCTTCAAC CTGCACACCG GCGACACGGT GGCGGGCCAG ATCAGGCCTC CCAAGGAAGG GGAGCGCTAC TTCGCCCTTT TGAAGGTCGA GACGGTGAAC CACGAGTCGC CGGAGGTTGC GCGCGACAAG ATCCTTTTCG ACAACCTGAC TCCGCTCTAC CCGCAGGAAA AGCTGAAGCT AGAGACCACG CACGACAACA TGTCGACCCG GGTCATGGAG CTGATCGCGC CGATCGGGAA GGGGCAGAGG GGGCTCATCG TGGCGCCGCC GAGAACCGGC AAGACCATGC TGATCCAGAA CATCGCCAAC TCCATCGCCA TGAACCACCC CGAGGTGTTC CTGATCGTCC TTTTGATCGA CGAAAGGCCC GAGGAGGTGA CCGACATGCA GCGCTCGGTG AAGGGCGAGG TGATCTCCTC CACCTTCGAC GAGCCGGCCT CGCGCCACAT CCAGGTGGCC GAGATGGTCA TCGAGAAGGC CAAGCGGCTG GTCGAGCACA AGCGCGACGT CGTCATCCTG CTCGATTCCA TCACCCGACT GGCCCGCGCC TACAACACTG TGATCCCGCC CTCCGGCAAG ATCCTCTCCG GCGGCGTCGA TTCCAACGCC CTGCATAAGC CCAAGCGCTT CTTCGGCGCG GCCCGCAACA TCGAGGAAGG GGGCTCGCTC ACCATCATCG CCACCGCGCT GGTCGACACC GGCTCCAAGA TGGACGAGGT CATCTTCGAG GAGTTCAAAG GTACCGGCAA CATGGAGCTC CACCTGGACC GGAAGCTGGT CGAGAAGAGG ACCTTCCCCG CCATCGACAT CAACAAGTCC GGCACCAGGA AGGAGGAACT CCTGATCGAC AAGGCGTCCC TGAACCGGAT CTGGATACTC AGGAAGGTGC TGCACCCCAT GAACGTGGTC GACTCCATGG AGTTCCTGAT CTCCAAACTG CAGGGGACCA AGAGCAACCA GAACTTCCTT GATTCCATGA GCAAGTAA
|
Protein sequence | MNLQELKEKK INDLTAIAKG LNIEGASSLR KQDLIFAILN AQTEKNGMIF GEGVLETLPD GFGFLRAPDY NYLPGPDDIY VSPSQIRRFN LHTGDTVAGQ IRPPKEGERY FALLKVETVN HESPEVARDK ILFDNLTPLY PQEKLKLETT HDNMSTRVME LIAPIGKGQR GLIVAPPRTG KTMLIQNIAN SIAMNHPEVF LIVLLIDERP EEVTDMQRSV KGEVISSTFD EPASRHIQVA EMVIEKAKRL VEHKRDVVIL LDSITRLARA YNTVIPPSGK ILSGGVDSNA LHKPKRFFGA ARNIEEGGSL TIIATALVDT GSKMDEVIFE EFKGTGNMEL HLDRKLVEKR TFPAIDINKS GTRKEELLID KASLNRIWIL RKVLHPMNVV DSMEFLISKL QGTKSNQNFL DSMSK
|
| |