Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | RoseRS_2354 |
Symbol | |
ID | 5209323 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Roseiflexus sp. RS-1 |
Kingdom | Bacteria |
Replicon accession | NC_009523 |
Strand | - |
Start bp | 2911204 |
End bp | 2914416 |
Gene Length | 3213 bp |
Protein Length | 1070 aa |
Translation table | 11 |
GC content | 61% |
IMG OID | 640595960 |
Product | transcriptional activator domain-containing protein |
Protein accession | YP_001276682 |
Protein GI | 148656477 |
COG category | [K] Transcription |
COG ID | [COG2909] ATP-dependent transcriptional regulator |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 5 |
Plasmid unclonability p-value | 0.607566 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 18 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACGCTGC GATTTCAGCA GAAGCTCGTT GTTCCAACGT CGGCGCGACC GCTGATCGAA CGACCGAACG TTCTCGCGCA GCTTGACCGT GCCATCCGGA GCAAGCGCGT CGTTGCGCTG GCTGCGCCCG CCGGCTGGGG GAAAACGACT GCGCTGGCGC AGTGGGTTGC GCAGCGCACG ATGCCGGTTG CCTGGTATAC CCTCGATAGC GCCGATCGCG ATCCGCACGT CTTCCTCGAC TACCTGCTCC ACAGCGTTGC CGATTTTGTC CCAGGAGCGC CGGGTATCGC AGCGCGGCTC GCCGAAACGA CCCCGCAAGG GCTGGCGGAG ATGATCCACC AGGTGGCGCT TGCATTTGCG GATGCGCCTG AACATTTTGC GCTGATCCTC GATGATGTCC ATGTACTTGA CGATGACCAG GCGCAATCCA TTCCGGGGGT ATCGCTCGTC TTTACGCTAC TCGCTTCGAT CGCGGAATAC GCCTCCCACT GCCATCTGGT GCTCGCTTCG CGCACCCTGC CGGTGTTGCA TGGCATGGTA CGCATGGTCG CACAGCAGCG CGCTGCGGTC TTCGATTATA GCGTGTTACA GTTCCATCGC GACGACACGC AGCGTCTTGC TGGCATAACA AGCGGGTTGA TCCTCTCCGA CGAAGCAGCG GAACAGTTGA CCGCATCAGT TGGCGGCTGG GTTACTGGCA TTGTCCTCTC GCTGGATCAA CCGGTTGCGC ACAAAGACGG GATGACAGGT CAGCAGGTTG TTGATCGACA CCTGATGCAC GTTCCCACAC CTGAAGAGGC GGTCATCGAA GCCAATACCA GTCAGGTATA CGCTTACTTT GCCGAGCAGA TTCTGGCACC GTTGCCAGCG GATCTGCAAC GTTTCCTCGA AGACACCAGC GTTCTGCACG ATCTATCGCC ACAGCGCTGC GATATACTGC GTTCTGCCGA TAACTCGGCA GAATATCTCG ATGAGATCAG GCGGCGCGGG TTATTCGTCT CCAGCCGCGC CGGATGGCTC TCGTATCACA GCCTGTTTCG CGATTATCTG CGCTCACGCC TGGCGCGCGA TCCGCAACGA TGCCGGCAGT TGCTGCGTTC CGCCGGTGAC CTGTATGCTG CTGAAGATGA TATCGAGCGT GCGCTTGACT GCTACCTGGC AGCCGGCGAC GATCAGCACG CTATCGATCT GATCCGTTCC GCCGTGCCGC GCCTGCGGCA GTGTTCGCGC CAGACAACCC TGCTGACCTG CTTCGAACGT CTGCACCGCG CGCGCAGGAC AAATGATCAG CGCACAGCCA GCGGAGGATT CCCGCCAGCA ACACGCGCCG CACGCAAGCA GTCGATCCCG CCCGATCTGC TGCTGGCAGA GGCGCGTGTC TACAGCGACC TGGCGCTCTG GGAACGTGCA TATCTGGCGC TTCACCTTGC CGGGGCGATC GGGGATGCTT CTATCCGTGC CGAGGCGCAG ATCCTCTCTG CCGAGTTGCA GGTGCTCCAG GGTGACTACG CCCGCGCTCA GCATGCGCTC AGATTCGTCG ATGTCAACAA CCTCGACAAT CGATTGCGCC TGGAATACCA TGTCGCGGCT GCCCGCGCGC ATATTATGGC AGGAGAAGTT GCCGCCGCCA TCACCGAACT CGAACGCGCG CATACGCTTG TCGCAACCCT GATCGACACC GTCGATAACC CTGCCGCACT GGCGGATATT TACGACAACC TTGGCTGGGC ATACGCCGCG CAGGGCGATC GTCCATCCGC CATTCGCTAT CTGAAACGCG CTGATGCCTG CTGGCAGTCT TCCGGTAACC AGGGAAGACG CGCGCTGACC CTCAACAATA TGGGGGTGAT GGCGATGGAG GAAGGGCGGT TTGCCGAAGC CCGCACAACA CTCGATCTGG GATTGGAGAT CGCCCGGCAG ACAGAGTTAC GACGTGAAGA AACGGTTCTG CTGTGTAGTC TGGCAGAACT CGATCTGCGT GAAGGCGATT TTGAACAGGC GATTCAGCGT TTTACCACAG CGCACGCCCT GGCAACCCGT CTCGATATTG CCAGCAGCGT GGAAGCAGCC GCAGCGGGAG CGCTCTGGGC TGCCATACTT TCCGACAACC TGACGCTCGC ACAGGCATGG CACGACACGG CAGCCTCCAT CGTGACACCG CTACAACCGG AGGTGCGCGG ACGACTGGCA TTGGCACGCG CAGCGCTCGC GTTGCAATGC CCGCACCCGG AATTTGCGTC TTTCGCCAGT TTATTAGCCG AAGCGACCAC ATACGAAGAA TCCCTCAGCG AGGATGAGCG CATCTATATG GCGCTATTGC GTACTGAACT CGCCTTCGCC CGGTCAGGCT GGCACGCAGC GGCATCGCTG TGGGAACAGT TCGCCGCACG CGCGGCCACG CTGCCCGAAA CCCTGTGGCG CCGCTTTGCG TCGATGCATC GTCCCCTCTT CGAAGCCGCT GCACCATACG ATCCGCGCGC CGGTCGCGCG ATTGCGCTGT CTCGCACAGC ATCTCCATCT TCTGTACGCT GGCGAATAAC GACACTCGGC GGATTCGCGT GTCTGGTCGA TGGGAAACCG GTCGATCTTT CGCAGTTGCA CCGCGCGCTG CTGGTGCGGC TCCTTGATGC CGGTCCGCAA GGTCTGGCAG TCGAGCGGCT GTGGGAAGCG GTTTGGGGCG ACGATGTTAT TTCGATGCCG GCGCTGCATC AGGCGCTGCG TCGGTTACGC CTGCAAACCG GACTCGCCGC TTCGGCGCGC GAAGGCGCTG TAGCAATCCG CAGCGGTTGG GACGCGATTG AATACGATGT GCGTGAACTG GAACGCATTC TTGAAACGCC TCCCAGCCTC GAATCCATCC AACGCGCAAT GACGCTCTAC GGCGGTGAGT TTCTGCCTGG CGCCCCGGCA AGCGCTGCGC TTTGGGTCGA GGCGCGTCGG GCGCACCTGC AGCAACGCTA CCTTGAGGCT ATTGAACAGT ACGCGCACTC CATCGAACAG AACTTGCCGC AGCAGGCGAT GTTCTACTAT CAGCACGTGC TTCAGATCGA TGGATGCCGC GAGCATACCG CCGCCCGGTT GATGCGCCTT GCGGCACGGT ACGGCAACCG CACCCTGGTC GCCGCCACCT TTGAGCATCT GAAAGGATCG TTACGCGCGC TCGGCGCCTC ACCAGAACCG GCAACCACTG CACTATACCG GCAACTGACC TGA
|
Protein sequence | MTLRFQQKLV VPTSARPLIE RPNVLAQLDR AIRSKRVVAL AAPAGWGKTT ALAQWVAQRT MPVAWYTLDS ADRDPHVFLD YLLHSVADFV PGAPGIAARL AETTPQGLAE MIHQVALAFA DAPEHFALIL DDVHVLDDDQ AQSIPGVSLV FTLLASIAEY ASHCHLVLAS RTLPVLHGMV RMVAQQRAAV FDYSVLQFHR DDTQRLAGIT SGLILSDEAA EQLTASVGGW VTGIVLSLDQ PVAHKDGMTG QQVVDRHLMH VPTPEEAVIE ANTSQVYAYF AEQILAPLPA DLQRFLEDTS VLHDLSPQRC DILRSADNSA EYLDEIRRRG LFVSSRAGWL SYHSLFRDYL RSRLARDPQR CRQLLRSAGD LYAAEDDIER ALDCYLAAGD DQHAIDLIRS AVPRLRQCSR QTTLLTCFER LHRARRTNDQ RTASGGFPPA TRAARKQSIP PDLLLAEARV YSDLALWERA YLALHLAGAI GDASIRAEAQ ILSAELQVLQ GDYARAQHAL RFVDVNNLDN RLRLEYHVAA ARAHIMAGEV AAAITELERA HTLVATLIDT VDNPAALADI YDNLGWAYAA QGDRPSAIRY LKRADACWQS SGNQGRRALT LNNMGVMAME EGRFAEARTT LDLGLEIARQ TELRREETVL LCSLAELDLR EGDFEQAIQR FTTAHALATR LDIASSVEAA AAGALWAAIL SDNLTLAQAW HDTAASIVTP LQPEVRGRLA LARAALALQC PHPEFASFAS LLAEATTYEE SLSEDERIYM ALLRTELAFA RSGWHAAASL WEQFAARAAT LPETLWRRFA SMHRPLFEAA APYDPRAGRA IALSRTASPS SVRWRITTLG GFACLVDGKP VDLSQLHRAL LVRLLDAGPQ GLAVERLWEA VWGDDVISMP ALHQALRRLR LQTGLAASAR EGAVAIRSGW DAIEYDVREL ERILETPPSL ESIQRAMTLY GGEFLPGAPA SAALWVEARR AHLQQRYLEA IEQYAHSIEQ NLPQQAMFYY QHVLQIDGCR EHTAARLMRL AARYGNRTLV AATFEHLKGS LRALGASPEP ATTALYRQLT
|
| |