Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | SNSL254_A4197 |
Symbol | rho |
ID | 6484905 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Salmonella enterica subsp. enterica serovar Newport str. SL254 |
Kingdom | Bacteria |
Replicon accession | NC_011080 |
Strand | + |
Start bp | 4091894 |
End bp | 4093153 |
Gene Length | 1260 bp |
Protein Length | 419 aa |
Translation table | 11 |
GC content | 53% |
IMG OID | 642739451 |
Product | transcription termination factor Rho |
Protein accession | YP_002043154 |
Protein GI | 194444899 |
COG category | [K] Transcription |
COG ID | [COG1158] Transcription termination factor |
TIGRFAM ID | [TIGR00767] transcription termination factor Rho |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 95 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAATCTTA CCGAATTAAA GAATACGCCG GTTTCTGAGC TGATCACTCT CGGCGAAAGT ATGGGGCTGG AAAACCTGGC CCGTATGCGC AAGCAGGACA TTATTTTTGC CATCCTGAAG CAGCACGCAA AGAGTGGCGA AGATATCTTT GGCGACGGTG TGCTGGAGAT ATTGCAGGAT GGATTTGGTT TCCTCCGTTC TGCAGACAGC TCCTACCTCG CCGGTCCTGA TGATATCTAC GTTTCCCCCA GCCAAATCCG CCGTTTCAAC CTCCGCACTG GTGATACCAT TTCTGGTAAG ATTCGCCCGC CGAAAGAAGG TGAACGCTAT TTTGCGCTGT TGAAAGTTAA CGAAGTTAAC TACGACAAAC CGGAAAACGC CCGTAACAAA ATCCTCTTTG AGAACTTAAC CCCGCTGCAC GCAAACTCTC GTCTGCGTAT GGAGCGTGGT AACGGTTCTA CCGAAGACTT AACGGCGCGC GTTTTGGATC TGGCTTCGCC GATCGGTCGC GGCCAGCGCG GTCTGATTGT CGCGCCGCCG AAAGCGGGTA AAACCATGCT GCTGCAGAAC ATCGCGCAGA GCATCGCGTA TAACCACCCA GACTGCGTGC TGATGGTGCT GCTGATTGAC GAACGTCCGG AAGAAGTAAC CGAGATGCAG CGTCTGGTGA AAGGCGAAGT GGTTGCGTCT ACCTTTGACG AACCGGCATC CCGCCACGTT CAGGTTGCCG AAATGGTTAT CGAGAAGGCG AAACGTCTGG TTGAACACAA GAAAGACGTT ATCATCCTGC TCGACTCCAT CACCCGTCTG GCGCGTGCCT ACAACACCGT GGTGCCGGCT TCCGGTAAAG TACTGACCGG TGGTGTGGAC GCTAACGCCC TGCATCGTCC GAAGCGTTTC TTCGGTGCGG CGCGTAACGT GGAAGAGGGC GGTAGCCTGA CTATCATCGC GACGGCGCTG ATCGATACCG GTTCCAAGAT GGACGAAGTT ATCTACGAAG AGTTTAAAGG CACCGGTAAC ATGGAGCTGC ATCTCTCGCG TAAGATTGCT GAAAAACGTG TCTTCCCGGC TATCGACTAC AACCGTTCCG GTACCCGTAA AGAAGAGCTG CTCACCACTC AGGAAGAGCT GCAGAAAATG TGGATCCTGC GTAAAATCAT CCATCCGATG GGTGAGATTG ACGCGATGGA ATTCCTCATT AACAAACTGG CGATGACCAA AACTAATGAC GACTTTTTCG AGATGATGAA GCGCTCATAA
|
Protein sequence | MNLTELKNTP VSELITLGES MGLENLARMR KQDIIFAILK QHAKSGEDIF GDGVLEILQD GFGFLRSADS SYLAGPDDIY VSPSQIRRFN LRTGDTISGK IRPPKEGERY FALLKVNEVN YDKPENARNK ILFENLTPLH ANSRLRMERG NGSTEDLTAR VLDLASPIGR GQRGLIVAPP KAGKTMLLQN IAQSIAYNHP DCVLMVLLID ERPEEVTEMQ RLVKGEVVAS TFDEPASRHV QVAEMVIEKA KRLVEHKKDV IILLDSITRL ARAYNTVVPA SGKVLTGGVD ANALHRPKRF FGAARNVEEG GSLTIIATAL IDTGSKMDEV IYEEFKGTGN MELHLSRKIA EKRVFPAIDY NRSGTRKEEL LTTQEELQKM WILRKIIHPM GEIDAMEFLI NKLAMTKTND DFFEMMKRS
|
| |