Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Moth_2402 |
Symbol | rho |
ID | 3830769 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Moorella thermoacetica ATCC 39073 |
Kingdom | Bacteria |
Replicon accession | NC_007644 |
Strand | - |
Start bp | 2521783 |
End bp | 2523072 |
Gene Length | 1290 bp |
Protein Length | 429 aa |
Translation table | 11 |
GC content | 58% |
IMG OID | 637830321 |
Product | transcription termination factor Rho |
Protein accession | YP_431227 |
Protein GI | 83591218 |
COG category | [K] Transcription |
COG ID | [COG1158] Transcription termination factor |
TIGRFAM ID | [TIGR00767] transcription termination factor Rho |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 42 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 21 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | TTGAATACGG CCGAACTAGA AAGCAAGACC ATGGTAGAGC TCTACCGGAT AGCCCGCGAG TTAAACTTGA GCGGCTATTC CCGGATGCGG AAAAAAGAGC TGGTATTTGA GATTGCCAAG GCCCTGGCCC AGAAAGAGCA GCAAGAACAC CAGGAGGAGG CCAGGGAAGA ACAACTCCAG GCCCAGGGCA TCCTGGAGAT ACTCCCGGAT GGTTACGGAT TTTTACGGCC CTTCGGCTAC CTGGCCAGCG GCGACGATAT CTACATCTCC GCCTCTCAAA TCCGTCGCTT CGACCTGCGG ACCGGGGATA AGGTGGCCGG CCTGGTCCGG CCACCCAAGG ACAATGAACG CTTTTTTGCC CTCCTGCGGG TGGAGAAGGT CAACGGGGAA AACCCGGAAA CAGCAGCAGA ACGCCTTCAC TTTGATGCCC TGACCCCCAT CTATCCCTCA GAGCGCTATA CCCTGGAGAC GGCCAACGGC GACCCATCGG CCCGGATTAT CGACCTGATA GCCCCTATAG GCAAAGGCCA GCGGGCCCTC ATTGTCTCCC CGCCCAAAGC CGGCAAGACG GTTCTGCTCA AGAAAATAGC CAACGCCATC AAGACCAACT ACCCTGAAGT CGAATTGATG ATCCTGCTAA TCGACGAACG CCCGGAAGAG GTCACCGACA TCGAGCGTTC GGTCCGGGGC GAGGTCATCA GTTCCACCTT TGACGAATTC CCTGAAAACC ACGTCAAAGT AGCTGACATG GTCCTGGAAC GGGCTAAACG CTTGGTGGAG CATAAAAAGG ATGTCGTAGT TCTCCTGGAC AGCATCACTC GCCTGGCCCG GGCCCACAAC CTGGTTGTTC CCCCCAGCGG CCGCACCCTC TCCGGTGGCG TCGATCCCAC GGCTCTTTAT AAACCCAAGC GCTTCTTCGG TGCCGCCCGG AACATCGAAG AGGGTGGCAG CCTGACCATT GTGGCCACCG CCCTGATTGA GACCGGCAGC CGCATGGACG AGGTTATTTT CGAAGAATTT AAAGGCACCG GTAATATGGA ACTCATCCTG GACCGGAGGT TGGCCGAGCG GCGGATTTTC CCGGCCATCG ATGTTAAACG TTCCGGCACG CGGCGGGAGG AATTGCTCCT CAGCCGGGAG GAGCTGGAAC TGGTCTGGAA CTTCCGGCGG GTGAGCAGCG GAATGGGCCC GGTAGAGGTC ACCGAGACCC TCATCGACGC CATGAAGAAA ACCAAGAGTA ACCAGGACCT CCTGCGCGCC CTCCCCGCCC TTTTCCCCCG GGAACATTAA
|
Protein sequence | MNTAELESKT MVELYRIARE LNLSGYSRMR KKELVFEIAK ALAQKEQQEH QEEAREEQLQ AQGILEILPD GYGFLRPFGY LASGDDIYIS ASQIRRFDLR TGDKVAGLVR PPKDNERFFA LLRVEKVNGE NPETAAERLH FDALTPIYPS ERYTLETANG DPSARIIDLI APIGKGQRAL IVSPPKAGKT VLLKKIANAI KTNYPEVELM ILLIDERPEE VTDIERSVRG EVISSTFDEF PENHVKVADM VLERAKRLVE HKKDVVVLLD SITRLARAHN LVVPPSGRTL SGGVDPTALY KPKRFFGAAR NIEEGGSLTI VATALIETGS RMDEVIFEEF KGTGNMELIL DRRLAERRIF PAIDVKRSGT RREELLLSRE ELELVWNFRR VSSGMGPVEV TETLIDAMKK TKSNQDLLRA LPALFPREH
|
| |