Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_3377 |
Symbol | rho |
ID | 5735238 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | - |
Start bp | 4256731 |
End bp | 4257981 |
Gene Length | 1251 bp |
Protein Length | 416 aa |
Translation table | 11 |
GC content | 51% |
IMG OID | 641280524 |
Product | transcription termination factor Rho |
Protein accession | YP_001546141 |
Protein GI | 159899894 |
COG category | [K] Transcription |
COG ID | [COG1158] Transcription termination factor |
TIGRFAM ID | [TIGR00767] transcription termination factor Rho |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 1 |
Plasmid unclonability p-value | 0.0000542218 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GTGGTCAACC TTGCTGATTT AGAAAACAAA ACGTTGAGCG ACCTCCAAGA AATGGCTCGG GAACTCGATA TCTCTGGCTA TAGTCGCCTC AAGAAACAAG ACCTCATTTA CAAATTAATT CAGGCTCAAA CTGAACAAGC AGGCAATATT TTCAATACGG GGATTCTCGA TATCGTTTCT GACGGTTTCG GGTTTCTGCG CAGCGACCGT ATGTTGCCTG GCCCCGATGA TGTGTATGTC TCGCAAACTC AAATTCGCCG CTTTGGCCTC CGTACTGGCG ACCGCATCTC CGGCCAGATT CGTCCTCCCA AAGAAAGTGA ACGCTATTAT AGTTTGCTGC GGGTTGAATT AGTGAATGGT ATGGACCCTG AGCAAGCTCG TAAGCGCCCC CATTTTGAAA AATTGACCCC GATTTTTCCC AATGAACGCT TTATTTTGGA GACTGAGCCG CAAATTCTTT CAACTCGTTT GGTTGATTTG ATTGCGCCAA TCGGTCGCGG TCAGCGTGGG CTGCTGGTTT CGCCCCCCAA AGCTGGTAAA ACGATGCTGA TGAAGGCGAT TGCTAATGGC ATCACCACCA ATTATCAAGA TGCCCATTTG ATGGTCTTGT TGATTGGCGA GCGTCCCGAA GAAGTCACCG ATATGCGCCG CTCAGTCCGC GGTGAGGTGA TTGCTTCGAC CTTCGATGAG CCAGTTGAAG ATCATACCAA AGTTTCTGAA ATGACCCTCG AACGAGCTAA GCGCTTGGTC GAAGGCGGCC AAGATGTGGT GATCTTGATG GACTCTATCA CCCGTTTGGC ACGGGCTTAC AACTTGGATA TGCCACCATC AGGCCGAACC TTGACTGGTG GGATCGACCC AGTGGCCTTG TACCCTCCAA AACGCTTCTT TGGTGCTGCC CGTAACATCG AAGGCGGCGG CTCGTTGACA ATCATTGCTA CCTGTTTGGT CGATACTGGT AGCCGTATGG ACGACGTGAT TTACGAAGAA TTCAAGGGTA CTGGCAACAT GGAATTGCAC TTGGATCGGC GCTTGGCTGA ACGCCGCACC TATCCGGCAG TCGATATTGC CCGCTCTTCA ACCCGTCGCG ATGAGTTGTT GCTCTTGCCT GAGCAATTGC GCCAAGTTTG GACGTTGCGC CGTATGGTCA GTATGTTGGG CGAAAACGAA GGCACTGAAT TGGTGCTGAC GCGCATGTCC AAAACCCGTA CCAACGACGA GTTCTTGCTC ACTCTCAACA AGAGCCTCTA G
|
Protein sequence | MVNLADLENK TLSDLQEMAR ELDISGYSRL KKQDLIYKLI QAQTEQAGNI FNTGILDIVS DGFGFLRSDR MLPGPDDVYV SQTQIRRFGL RTGDRISGQI RPPKESERYY SLLRVELVNG MDPEQARKRP HFEKLTPIFP NERFILETEP QILSTRLVDL IAPIGRGQRG LLVSPPKAGK TMLMKAIANG ITTNYQDAHL MVLLIGERPE EVTDMRRSVR GEVIASTFDE PVEDHTKVSE MTLERAKRLV EGGQDVVILM DSITRLARAY NLDMPPSGRT LTGGIDPVAL YPPKRFFGAA RNIEGGGSLT IIATCLVDTG SRMDDVIYEE FKGTGNMELH LDRRLAERRT YPAVDIARSS TRRDELLLLP EQLRQVWTLR RMVSMLGENE GTELVLTRMS KTRTNDEFLL TLNKSL
|
| |