Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rcas_0699 |
Symbol | |
ID | 5538164 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Roseiflexus castenholzii DSM 13941 |
Kingdom | Bacteria |
Replicon accession | NC_009767 |
Strand | - |
Start bp | 918439 |
End bp | 921723 |
Gene Length | 3285 bp |
Protein Length | 1094 aa |
Translation table | 11 |
GC content | 63% |
IMG OID | 640892855 |
Product | transcriptional activator domain-containing protein |
Protein accession | YP_001430839 |
Protein GI | 156740710 |
COG category | [K] Transcription |
COG ID | [COG2909] ATP-dependent transcriptional regulator |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 6 |
Fosmid unclonability p-value | 0.00060152 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGGTAGTTT CTCGAGTTAC GGGCGAGTTC GGCGCTGCTG TGTCTCTTGT ATCGACGGTA CTGCTTCCGC GCCTGGAACC GCCGCCCCAG CCAGCGCGCA TCATCGAGCG CCCGCGTATT GACGGGCTAC TCGCTGCGAT CGCCGATTAT CCGGTGACGC TTGTTCTTGC GCCGGCCGGC AGCGGCAAAA CGGTCGCGCT TACCAGTTTT GCGCGCCACG GTGGATGGCC TGCCGCCTGG TGCCGCCTGG ACCCCGCAGA TACGCCATTG TCGCTGGCGT TGCACCTGGC GACCGCCTTT CGCCCGATCA CGGGCTTCGA TCATGCTCGC TTCGCTGCTG CGCATCCGGT GGATGTGCTC GATGGACTGA TCAATGCGCT GACAGCGCTA GGCGATGAGA CCTTACTGAT CCTGGACGAT CTCCACCATG CAGATCGACG CCCGGAATTG CGCGTTCTGA TCGAACATCT GATTGACCGT CTGCCGCCGC ATCTGCATCT GGTGCTGGTG AGCCGGGAAA TGCCTGCGCT CGCCTCCCTG CCGACGATTG CGGCGCGTGG CGAACTCTAT CGCCTGAGTC GCGCGCAACT GGCGTTCACC AATGCTGAAG CGCGTGATTT CTTTGCCGCA TACGGCTTGC CGCCGCATCC AATCGATGCC GAGCTGAACA CCATAGCGCG CGGGTGGCCC CTGGCACTCC GCTTCTTTGC CGCAGCGCGC ATCGACTCCG CAACACCCTC CGATCAACCG CCGACTCTCG AGCGCTTGCA GGAGAGTATC GCGCCGCACC TCGATGCATA TCTGGCGCGT GAGGTGCTGG GCGATCTGCC ATTCGCTGTG CGCACCTGGG TGCTTGGCAC GGCGTTGATG CGCTGGATCG ATGAAGCGGC ATGCGCTGCC GTCACCGAAC TGGCACATCT CCATATGACC GTCGATCTAT TGGAACGTTG GGAGTTGTTC ATTGAAACCC TCCCTGACGG GAGGCGTGTC TATCAACCTT TGCAGGCAGC CAGTTTCGCA CGCCTGGCGG AGCGCGATTT GCCCGATTGG CGTGCCTTCC ATGCGCAATT AGGGCACTAT TATGCTGCGC ACAACGACGA CCACAGCGCC GCGCACCATT TTCTCGCCGC CGAACGATGG GAAGACGCCG CTGCTGCGTT GAGCCGGATG GCGCTCTCCG GCGTGTCTGG CTCACAGGCC GCGGCGCTCC TGGACTGGAT CGACCAGATC CCGCCAGCGC ATCGCAACAG CGCCGCGCTC CTCGAGGCGC GCGCTGTCGC CGAACGCCGT CTCGGTCGCT ATACGCACGC GGTTGAACTG TACCGCAAAG CGGAAGAACA GTACCACGCA CAGGGCGACA TAGAGGGACA GGTGCGCGCG CTCCGCGGGC AGGCGGAGGT GTATATCGAT ACGGTGCAAC CTGCGCCGGC TGCGATCCTG TTGAAACGCG CCATGAAACT CTTGCCGCGC GATCGCCGCG CCGAACGCGC AACCATCCTC AGTCTCCAGG CAGAAAACTG GATCAATCGC GGTCGCGCCG ATGTGTCGGT CCTGATCATT GCAGCGGCGC ATCGCGAGGC ATACGGCAAA ACAGCGCACA CCGACGCAGT TGGAGGGTAT CGCCGGTCCG CCATCCTGTC GCCCCGCCTG TTGCTGCGCA GCGGCAGGCT GATCGATGCT CGTCGTCTTC TCGAAGAAGA ACTCGGTCTG GAAGCCGGCA GAGCGCGTGC GGAACATTCA TTGCACCGTG ATCCGCTCCT GCTGCTGGCA TTGATCGAGT GTATGCTGGG CAACGGCGTG CGCGCACTGG CGCTCGCACA ACGCGGGTTG CTCGAAGCGC AACGCGGCGA CTCGCCGCTG ACCGAAGCAA TTGCCCATAT GCGCCTCGGA CATGCCTGTC TTGTGACGGC ATCAAGCGAT GAGATGGCGC GATCCCACTA CCGCGCTGCG CTCGACATCA TCGAGGCAGT CGGCATTCCG CGCGCACGCG CTGAGGTGAT GCTGGGGCTG ACCCTGCTCG AAGGGCATGC CGGCAATCTC ACGGCTGCCG AAGCCTATGC CCGCGATGGT CTCGACCGCG CCCTGGAGGC GGGTGATGAG TGGACGGCAG CGCTCATCTG GTTGGCGCTC GGCAGCGTCG CTGCGGCTGC CGGCGATCCG CGTGCGCTGG AGTGGATTGG CGAGGCGCAT CAGCGGTTTG TGCGCGGCGA TGATCAGTAC GGACAAACCG TCGCGCTCCT CTGGGAAGCG CACGTTCATG TGCAGTCCGG CAATGAAATC GAAGCCGATA AGAAACTGGC GCGCCTCCTC GAACTGGTAA GCGCCCATGG ATTCGATGGC GTGTTGACCA CACGCACGCT GTTCGGTCCC CACGATCTGG CGATCCTGGT TCCGCTGCTC CTGCGGGGAC GGGTATTGCG CGGCGCAGCG CAGCGTCAGG CAGCGACCGC GTACCGGCTC TTGCGGCAGG GCTTCCCGTC GATCGCGGCT GATGATGCCG TCGATATCTA CCATCCCGGC TATACACTGC GGGTCTATAT GCTGGGGCGT TTCCGCATCT TCCGCGGCGC GCACGAGATT CAGGCGCGCG AGTGGCAACG AGAGAAAGCG CGGCAGTTGT TGCAACTGTT GCTGACCTAT CGTGGCATGT GGTTGCAACG TGAGCAGATC TGCGCCTGGC TCTGGCCCGA CAGCGAACCG GCAGCCGCCG AGCGGCAGTT CAAAGTGACA CTCAACGCGC TCAATAATGT GCTGGAACCG CGCCGACCGC CGCGTGTCGC GCCGTTCTTT ATTCGGCGGC AGGGGCTGGC GTATAGTTTT GCCCCATCTT ATGGATGCTG GATCGATGTG GACGAGTTCG AACTGCGCAC CGCCGGTGCG CCGGGACGCG ATCCAGAGGT CGAGATCCGC AGCCGCCGCA CAGCATTCCA TCTGTATCGC GGCGACTATC TCGCCGAGGC GCTGTACGAC CCCTGGACGC TCGAAGAACG TGAGCGCTTG CTGGCGCGGC ATCTGGCATC GACCGCGACC CTTGCCAGTT TGCTGGTTGA CCGCGGCGAT TTCGATGAAG CCATCGATCT GTGCGAACAC ATCATCCGCC GCGACCGTGG TTATGAGGAG GCGTACCAAA CCCTCATGCG CGCCTATGCC CGCGCAGGGA GCCGTTCCCA GGCGTTGCGC GCCTACGCGC GTTGCGTTCA GGCATTGCAG GACGAACTGG GAATAGAACC GCTCCCGGAG ACAACCGACC TCTGTGAGCG GATCAAGCGG AACGAGGCGG TGTAG
|
Protein sequence | MVVSRVTGEF GAAVSLVSTV LLPRLEPPPQ PARIIERPRI DGLLAAIADY PVTLVLAPAG SGKTVALTSF ARHGGWPAAW CRLDPADTPL SLALHLATAF RPITGFDHAR FAAAHPVDVL DGLINALTAL GDETLLILDD LHHADRRPEL RVLIEHLIDR LPPHLHLVLV SREMPALASL PTIAARGELY RLSRAQLAFT NAEARDFFAA YGLPPHPIDA ELNTIARGWP LALRFFAAAR IDSATPSDQP PTLERLQESI APHLDAYLAR EVLGDLPFAV RTWVLGTALM RWIDEAACAA VTELAHLHMT VDLLERWELF IETLPDGRRV YQPLQAASFA RLAERDLPDW RAFHAQLGHY YAAHNDDHSA AHHFLAAERW EDAAAALSRM ALSGVSGSQA AALLDWIDQI PPAHRNSAAL LEARAVAERR LGRYTHAVEL YRKAEEQYHA QGDIEGQVRA LRGQAEVYID TVQPAPAAIL LKRAMKLLPR DRRAERATIL SLQAENWINR GRADVSVLII AAAHREAYGK TAHTDAVGGY RRSAILSPRL LLRSGRLIDA RRLLEEELGL EAGRARAEHS LHRDPLLLLA LIECMLGNGV RALALAQRGL LEAQRGDSPL TEAIAHMRLG HACLVTASSD EMARSHYRAA LDIIEAVGIP RARAEVMLGL TLLEGHAGNL TAAEAYARDG LDRALEAGDE WTAALIWLAL GSVAAAAGDP RALEWIGEAH QRFVRGDDQY GQTVALLWEA HVHVQSGNEI EADKKLARLL ELVSAHGFDG VLTTRTLFGP HDLAILVPLL LRGRVLRGAA QRQAATAYRL LRQGFPSIAA DDAVDIYHPG YTLRVYMLGR FRIFRGAHEI QAREWQREKA RQLLQLLLTY RGMWLQREQI CAWLWPDSEP AAAERQFKVT LNALNNVLEP RRPPRVAPFF IRRQGLAYSF APSYGCWIDV DEFELRTAGA PGRDPEVEIR SRRTAFHLYR GDYLAEALYD PWTLEERERL LARHLASTAT LASLLVDRGD FDEAIDLCEH IIRRDRGYEE AYQTLMRAYA RAGSRSQALR AYARCVQALQ DELGIEPLPE TTDLCERIKR NEAV
|
| |