Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rcas_3757 |
Symbol | |
ID | 5541259 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Roseiflexus castenholzii DSM 13941 |
Kingdom | Bacteria |
Replicon accession | NC_009767 |
Strand | + |
Start bp | 4927506 |
End bp | 4928507 |
Gene Length | 1002 bp |
Protein Length | 333 aa |
Translation table | 11 |
GC content | 62% |
IMG OID | 640895867 |
Product | MerR family transcriptional regulator |
Protein accession | YP_001433814 |
Protein GI | 156743685 |
COG category | [K] Transcription |
COG ID | [COG0789] Predicted transcriptional regulators |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 3 |
Plasmid unclonability p-value | 0.00211459 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | 9 |
Fosmid unclonability p-value | 0.00419816 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGTTGGAGC AGGTCCATCG CCTGCTGGCG CTGTCCGACG TTCCACGGTA CAACATTAAG GCGGTCGTCC AGCAGACTCA GGTCAATGTT TCGACGCTAC GCGCATGGGA GCAGCGCTAT GGCGTTCCAC GCCCGACTCG CTCCGATCAC GGTCATCGGC TCTACTCGCA GCGCGACATC GAAATCATCA AGTGGCTCAA GCAGTGTACG GAGGAAGGAC TAGCGATCAG CCAGGCAGTC GCACTGCTAC GCGACATCAG TGATACCGGC GATATCGCCC CGCGCGCCCC GCAGCCTCCC CCGCCAACGC TCGCCGACGC TGGCTGGCCC GACCTGCGCA CCCAACTGAC CGAGGCGTTA CTCAGCGCCA ACCTGCGGCA GGCGCACTTG CTGGTCAATA CGGCGGTTGC GCTCTTCCCC ATCGAGACGC TGGTGCTCGA TCTCTTTCAG CCGATGCTGA TCGAGATCGG CGACCGCTGG GCGCAGGGCG ACGTCTGTGT CGCAGAAGAG CGCGTGGTCA CGAACTTCGT GCGCCAACGA CTCCTTGGCT TGTTGCAAAT CCACGCGCCG TTCGCCACCG GTCCGCGCCT GATCGCCGGA TGCGCGCCGG AAGAACAGCA CGAGATCGGG TTGATTATGT TCTCGCTCCT GATGGAGCAG CGCGGTTGGG AACTCATCTA TCTGGGACAA ACGGTATCGG CGGAAGGGCT GGATGGCTTT CTGGTGCGAA TGGCGCCGGC GCTCATCTGT ATGTCGGTCT CGATGGCGGA ACATGTGCCC GGACTGCTGG AAATTGCGCG GATCGTCGAA AATCGCCGCC GTCATCGGTT GCTGTTCGCC TACAGCGGTC AGGTGTTCGA CCGCCATCCT GAACTTCGCG GGCGCATTCC CGGCATCTTT CTGGGCAACG ATTTGCGCGA GGCGGTGATC CGGGCGGACG ATCTCGGCGA GGAGATCGAC CCGGAACGAT GGGCGCGACA GGCGCACTTT TTTCGCCATT GA
|
Protein sequence | MLEQVHRLLA LSDVPRYNIK AVVQQTQVNV STLRAWEQRY GVPRPTRSDH GHRLYSQRDI EIIKWLKQCT EEGLAISQAV ALLRDISDTG DIAPRAPQPP PPTLADAGWP DLRTQLTEAL LSANLRQAHL LVNTAVALFP IETLVLDLFQ PMLIEIGDRW AQGDVCVAEE RVVTNFVRQR LLGLLQIHAP FATGPRLIAG CAPEEQHEIG LIMFSLLMEQ RGWELIYLGQ TVSAEGLDGF LVRMAPALIC MSVSMAEHVP GLLEIARIVE NRRRHRLLFA YSGQVFDRHP ELRGRIPGIF LGNDLREAVI RADDLGEEID PERWARQAHF FRH
|
| |