Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcSMS35_4146 |
Symbol | rho |
ID | 6144642 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli SMS-3-5 |
Kingdom | Bacteria |
Replicon accession | NC_010498 |
Strand | + |
Start bp | 4246064 |
End bp | 4247323 |
Gene Length | 1260 bp |
Protein Length | 419 aa |
Translation table | 11 |
GC content | 51% |
IMG OID | 641618969 |
Product | transcription termination factor Rho |
Protein accession | YP_001746101 |
Protein GI | 170683673 |
COG category | [K] Transcription |
COG ID | [COG1158] Transcription termination factor |
TIGRFAM ID | [TIGR00767] transcription termination factor Rho |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 21 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 58 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAATCTTA CCGAATTAAA GAATACGCCG GTTTCTGAGC TGATCACTCT CGGCGAAAAT ATGGGGCTGG AAAACCTGGC TCGTATGCGT AAGCAGGACA TTATTTTTGC CATCCTGAAG CAGCACGCAA AGAGTGGCGA AGATATCTTT GGTGATGGCG TACTGGAGAT ATTGCAGGAT GGATTTGGTT TCCTCCGTTC CGCAGACAGC TCCTACCTCG CCGGTCCTGA TGACATCTAC GTTTCCCCTA GCCAAATCCG CCGTTTCAAC CTCCGCACTG GTGATACCAT CTCTGGTAAG ATTCGCCCGC CGAAAGAAGG TGAACGCTAT TTTGCGCTGC TGAAAGTTAA CGAAGTTAAC TTCGACAAAC CTGAAAACGC CCGCAACAAA ATCCTCTTTG AGAACTTAAC CCCGCTGCAC GCAAACTCTC GTCTGCGTAT GGAACGTGGT AACGGTTCTA CGGAAGATTT AACCGCTCGC GTACTGGATC TGGCATCACC TATCGGTCGT GGTCAGCGTG GTCTGATTGT GGCACCGCCG AAAGCCGGTA AAACCATGCT GCTGCAGAAC ATTGCTCAGA GCATTGCTTA CAACCACCCG GATTGCGTAC TGATGGTTCT GCTGATCGAC GAACGTCCGG AAGAAGTCAC CGAGATGCAG CGTCTGGTTA AAGGTGAAGT TGTTGCTTCT ACCTTTGACG AACCCGCATC TCGCCACGTT CAGGTTGCGG AAATGGTGAT CGAGAAGGCC AAACGCCTGG TTGAGCACAA GAAAGACGTT ATCATTCTGC TCGACTCCAT CACTCGTCTG GCGCGCGCTT ACAACACCGT TGTTCCGGCG TCAGGTAAAG TGTTGACCGG TGGTGTGGAT GCCAATGCTC TGCATCGTCC GAAACGCTTC TTTGGTGCGG CGCGTAACGT GGAAGAGGGC GGCAGCCTGA CCATCATCGC TACCGCGCTT ATCGATACCG GTTCCAAGAT GGACGAAGTT ATCTACGAAG AGTTTAAAGG TACAGGCAAC ATGGAACTGC ACCTCTCTCG TAAGATCGCT GAAAAACGCG TCTTCCCGGC TATCGACTAC AACCGTTCCG GTACCCGTAA AGAAGAGCTG CTCACGACTC AGGAAGAACT GCAGAAAATG TGGATCCTGC GCAAAATCAT TCACCCGATG GGCGAAATCG ATGCAATGGA ATTCCTCATT AATAAACTGG CAATGACCAA GACCAATGAC GATTTCTTCG AAATGATGAA ACGCTCATAA
|
Protein sequence | MNLTELKNTP VSELITLGEN MGLENLARMR KQDIIFAILK QHAKSGEDIF GDGVLEILQD GFGFLRSADS SYLAGPDDIY VSPSQIRRFN LRTGDTISGK IRPPKEGERY FALLKVNEVN FDKPENARNK ILFENLTPLH ANSRLRMERG NGSTEDLTAR VLDLASPIGR GQRGLIVAPP KAGKTMLLQN IAQSIAYNHP DCVLMVLLID ERPEEVTEMQ RLVKGEVVAS TFDEPASRHV QVAEMVIEKA KRLVEHKKDV IILLDSITRL ARAYNTVVPA SGKVLTGGVD ANALHRPKRF FGAARNVEEG GSLTIIATAL IDTGSKMDEV IYEEFKGTGN MELHLSRKIA EKRVFPAIDY NRSGTRKEEL LTTQEELQKM WILRKIIHPM GEIDAMEFLI NKLAMTKTND DFFEMMKRS
|
| |