Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | TM1040_2863 |
Symbol | rho |
ID | 4076397 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Ruegeria sp. TM1040 |
Kingdom | Bacteria |
Replicon accession | NC_008044 |
Strand | + |
Start bp | 3031417 |
End bp | 3032688 |
Gene Length | 1272 bp |
Protein Length | 423 aa |
Translation table | 11 |
GC content | 54% |
IMG OID | 638008192 |
Product | transcription termination factor Rho |
Protein accession | YP_614857 |
Protein GI | 99082703 |
COG category | [K] Transcription |
COG ID | [COG1158] Transcription termination factor |
TIGRFAM ID | [TIGR00767] transcription termination factor Rho |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 0.799607 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 14 |
Fosmid unclonability p-value | 0.377015 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACCGAGC AATCTCTGAA CCTTTCTGAT CTCAAGGCGC AAAGCCCGAA GGCCCTGTTG GCTATGGCTG AGGAGCTGGA GATCGAAAAC GCCTCCACTA TGCGGAAAGG CGAGATGATG TTCCAAATCC TGCGTGAACG CGCGGATGAA GGCTGGAACA TCGGAGGGGA CGGGGTTCTC GAGGTCCTGC AGGACGGATT TGGCTTCTTG CGCTCGCCCG AAGCAAACTA TCTGCCGGGT CCAGACGATA TCTATGTCTC TCCTGACATG ATCCGCCAGT TTTCGCTACG CACTGGGGAT ACGGTAGAGG GGCAGATCAA GGCACCACTG GAAAATGAGC GCTACTTTGC GCTGACCACG GTGACAAAGA TCAACTTTGA AGAGCCCGAA AAGGCCCGTC ACAAGATTGC ATTCGACAAT CTGACACCGC TCTATCCGGA CGAGCGTCTG CAGATGGAAA TCGAGGATCC GACGGTTAAG GATCGCTCAG CACGCGTGAT TGACCTTGTC TCGCCGATCG GCAAGGGCCA GCGTTCGCTT ATTGTAGCGC CGCCGCGAAC CGGTAAGACA GTTCTACTGC AAAACATCGC CCATTCGATC GAGCAGAACC ACCCTGAGTG CTATCTGATC GTGCTTCTGA TCGACGAGCG CCCTGAAGAA GTCACCGACA TGCAGCGCTC CGTCAAAGGC GAGGTCGTGT CCTCCACGTT TGACGAACCC GCAGCGCGCC ACGTGGCGGT TTCCGAGATG GTCATTGAAA AAGCCAAGCG TCTGGTCGAA CATAAACGTG ACGTTGTGAT CTTGCTCGAC TCGATCACGC GTCTCGGCCG TGCTTTCAAC ACGGTTGTTC CATCGTCAGG CAAGGTTCTG ACTGGTGGTG TGGATGCAAA CGCTCTGCAA CGTCCCAAGC GCTTCTTTGG GGCTGCGCGG AATATCGAAG AAGGTGGCTC TTTGACGATC ATCGCGACCG CGCTTATCGA CACGGGCAGC CGTATGGACG AGGTTATCTT TGAAGAATTC AAGGGTACGG GTAACTCTGA GATCGTTCTG GATCGCAAGA TTGCAGACAA GAGGGTCTTC CCTGCGATCG ACATTCTCAA GTCGGGAACG CGGAAGGAAG ACCTCCTGGT CGACAAAGGC GATCTGGCAA AAACCTTTGT GTTGCGTCGC ATCCTGAATC CGATGGGGAC CACCGACGCA ATTGAGTTCC TGCTGTCTAA GTTGAAGCAA ACAAAGACAA ACTCAGAGTT TTTTGACTCG ATGAACACCT AA
|
Protein sequence | MTEQSLNLSD LKAQSPKALL AMAEELEIEN ASTMRKGEMM FQILRERADE GWNIGGDGVL EVLQDGFGFL RSPEANYLPG PDDIYVSPDM IRQFSLRTGD TVEGQIKAPL ENERYFALTT VTKINFEEPE KARHKIAFDN LTPLYPDERL QMEIEDPTVK DRSARVIDLV SPIGKGQRSL IVAPPRTGKT VLLQNIAHSI EQNHPECYLI VLLIDERPEE VTDMQRSVKG EVVSSTFDEP AARHVAVSEM VIEKAKRLVE HKRDVVILLD SITRLGRAFN TVVPSSGKVL TGGVDANALQ RPKRFFGAAR NIEEGGSLTI IATALIDTGS RMDEVIFEEF KGTGNSEIVL DRKIADKRVF PAIDILKSGT RKEDLLVDKG DLAKTFVLRR ILNPMGTTDA IEFLLSKLKQ TKTNSEFFDS MNT
|
| |