Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Dshi_3453 |
Symbol | rho |
ID | 5712511 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Dinoroseobacter shibae DFL 12 |
Kingdom | Bacteria |
Replicon accession | NC_009952 |
Strand | + |
Start bp | 3634311 |
End bp | 3635582 |
Gene Length | 1272 bp |
Protein Length | 423 aa |
Translation table | 11 |
GC content | 58% |
IMG OID | 641269382 |
Product | transcription termination factor Rho |
Protein accession | YP_001534787 |
Protein GI | 159045993 |
COG category | [K] Transcription |
COG ID | [COG1158] Transcription termination factor |
TIGRFAM ID | [TIGR00767] transcription termination factor Rho |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 36 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 56 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACCCAAG AGCGTCTCGC CCTTGCCGAT CTCAAAGCTA TGAGCCCTGC TGATCTTCTG CCCCTGGCCG AGGAGCTGGA GGTTGAAAAC GCATCCACCA TGCGGAAAGG CGACATGATG TTCGCCATCC TCAAGGAGCG TGCAGAGGAT GGATGGGAAA TTTCTGGCGA TGGCGTTCTG GAAGTCATGC AGGACGGGTT CGGGTTTCTC CGCTCCCCCG AGGCGAACTA TCTGCCGGGC CCGGACGACA TCTATGTTTC TCCCGAGAAA ATTCGGCAGC ACTCCCTGCG CACTGGCGAT ACCGTCGAAG GGGTGATCCA GGCCCCGCGG GAGAACGAAC GCTATTTCGC GATCACCTCC GTCAGCCTGG TAAACTTCGA AGACCCGGAG ATGGCACGTC ACAAGGTGAG CTTTGATAAC TTGACGCCTC TTTATCCGGA CGAGCGGCTG AAGATGGAGA TCGAGGATCC GACGATCAAG GATCGGTCTG CCCGGATCAT CGACCTGGTG GCACCGATCG GGAAGGGGCA GCGGTCGCTG ATCGTGGCTC CGCCGCGGAC GGGCAAGACG GTCCTGTTGC AGAATATCGC CAAGAGCATC GCAGCCAATC ATCCTGAATG CTATCTGATG GTGCTGCTGA TCGATGAACG TCCAGAAGAA GTCACGGACA TGCAGCGTTC GGTGAAAGGG GAGGTGATTT CCTCCACTTT CGATGAGCCG GCCTCGCGCC ACGTGGCAGT GTCCGAGATG GTGATCGAGA AGGCCAAGCG CCTGGTGGAG CACAAGCGGG ACGTGGTTAT CTTGCTGGAT TCGATCACTC GTCTTGGTCG GGCTTTCAAC ACGGTCGTTC CGTCTTCGGG CAAGGTGCTG ACCGGGGGTG TGGATGCCAA TGCGTTGCAG CGGCCGAAGC GGTTCTTCGG TGCGGCCCGA AACATCGAAG AAGGCGGGTC GCTGACAATC ATCGCAACCG CATTGATCGA TACAGGCTCC CGTATGGACG AAGTGATCTT TGAAGAGTTC AAGGGTACGG GCAACTCCGA GATCGTGCTT GACCGCAAGG TGGCCGACAA GCGGGTCTTC CCGGCGATGG ACATCCTCAA ATCCGGCACG CGCAAAGAGG ATCTCCTTGT CGATGCCAAG GATCTTCAGA AGACCTTTGT GCTGCGTCGC ATTCTGAATC CCATGGGGAC CACGGATGCC ATCGAGTTCC TGATCTCGAA GCTGAAGCAG ACGAAGACCA ATTCCGAGTT CTTCGATTCG ATGAACACCT GA
|
Protein sequence | MTQERLALAD LKAMSPADLL PLAEELEVEN ASTMRKGDMM FAILKERAED GWEISGDGVL EVMQDGFGFL RSPEANYLPG PDDIYVSPEK IRQHSLRTGD TVEGVIQAPR ENERYFAITS VSLVNFEDPE MARHKVSFDN LTPLYPDERL KMEIEDPTIK DRSARIIDLV APIGKGQRSL IVAPPRTGKT VLLQNIAKSI AANHPECYLM VLLIDERPEE VTDMQRSVKG EVISSTFDEP ASRHVAVSEM VIEKAKRLVE HKRDVVILLD SITRLGRAFN TVVPSSGKVL TGGVDANALQ RPKRFFGAAR NIEEGGSLTI IATALIDTGS RMDEVIFEEF KGTGNSEIVL DRKVADKRVF PAMDILKSGT RKEDLLVDAK DLQKTFVLRR ILNPMGTTDA IEFLISKLKQ TKTNSEFFDS MNT
|
| |