Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rcas_3125 |
Symbol | |
ID | 5540621 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Roseiflexus castenholzii DSM 13941 |
Kingdom | Bacteria |
Replicon accession | NC_009767 |
Strand | + |
Start bp | 4046548 |
End bp | 4049754 |
Gene Length | 3207 bp |
Protein Length | 1068 aa |
Translation table | 11 |
GC content | 61% |
IMG OID | 640895244 |
Product | transcriptional activator domain-containing protein |
Protein accession | YP_001433197 |
Protein GI | 156743068 |
COG category | [K] Transcription |
COG ID | [COG2909] ATP-dependent transcriptional regulator |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 7 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 25 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCGCTGC GATTCCAGCA GAAGCTCATT GTTCCGACAT CAGCGCGACC GCTGATCGAA CGTCCTCACG TCATCGCACA ACTTGAGCGC GCCATTCGCA GCAAGCGCGT CGTCGCGCTC GCCGCTCCTG CCGGATGGGG TAAAACGACT GCTCTGGCAC AGTGGGTCGC GCACACCACG ATGCCCACTG CATGGTATAC GCTCGATAGT GCCGATCGCG ATCCCCAGGT TTTTCTCGAT TATCTGTTGC ACAGCGTTGC CGATCTGGCG CCGGGAACGG CGGACATCGC CGCGCGTCTT GCCACCGCCA CCCCCCAGAG TCTCGCGGAG ATCTCGCAGC AGACGGCGCT GGCGCTTGCT GATGCGCCGG ATCACTTCGC CCTGATTCTT GATGATGTGC ATGTTCTGGA GGATGACCAG TCGCAATCCA TTCCGGGCGT CTCGCTGGTT TTTGCGCTGC TGGCATCCAT CGCCGAGTAT GCTGCCAGGT GTCATCTGGT GCTTGCATCG CGCACCCTGC CAGCTTTGCA CGGCATGGTG CGCATGGTTG CGCAACAGCG TGCTGCTGTG TTCGATTATA GCGTGTTGCA GTTCCAGCGT GCCGATACGC AGCGTCTTGC GGGAATGACC GCAGGGCTGA CGCTCTCCGA TGACGCCGCG GAGCAATTGA CCGCCGCAGT TGGCGGTTGG GTCACCGGTA TCGTGCTTTC ACTGGATCAA CCATCTGTCA ACGGCAGCAG CGTCCCCAAA CAGCATATTG TTGACTATCG CCTGGCGGAG ATCGCCACGC AACGCGATGC CATCATCGAA GCCAACACGA GCCAGGTGTA TGCGTATTTC GCCGAACAGA TTCTGTCGCC ACTACCAGCC GACCTGCAAC GGTTTCTTGA GGATACCAGC GTGCTTCAGG ACCTGTCGCC GCATCGCTGC GACCGATTGC GAAACACCAC CAACTCGGCG GAATATCTTG ACGATATCAA ACGCCGTGGT CTGTTCGTTT CGAGTCGTGC TGGATGGGTA TCGTACCATA GTCTGTTTCG TGAGTATCTG CGATCGCGCC TCGCGCGTGA TCCCCAACGG TATCGTTCGC TTTTGCGGAC TGCCGGCGAC CTCTATGCTA CTGAGGACGA CATCGAACGT GCGCTCGATT GCTATCTGGC AGCCAGCGAT TATCGACAGG CGCTCGAACT GCTGCGATCG GCAGTGCCAC GCCTGCGTCA GCGTTCGCGT CAGACCACCC TGCTGGCGTG CTTTGAACGC CTGCATCGCT TTCGATTGAC CGGTGATCGC CACATCCGGG ACGCTATGCC GTTCCCGGCG GTGCAAGATC CACGACTGCC CGCAACCCCG CCAGATCTCT TGTTGGCAGA GGCGCGCGTG TATAGCGATC TGGCACTCTG GGAACGCGCA TACCTGGCGC TCCAGCTCGC CGAAGCGTCC GGAAATGCTC AGATCCGTGC AGAAGCGCGG ATTCTCTCGG CAGAGGTGCA GGTGCTCCAG GGCGACTACG CTCGCGCTCA ACAAACATTG CGAACCGTCG ATGTGGAGAT TCTCGATGAT CGCCTGCGCC TGGAATATGC CATAGCCGCC GGACGCGCGC ACATTATGGC AGGCGAGGTC GCTGCCGCCA TTACGGCATT GGAACGCGCG CATACTCTTG CGACGACCCG CGCCGACGCC GTGGATCATC CCGGACCTCT CGCGGATATT TACGATAATC TCGGCTGGGC ATACGCTGCT CAGGGTGATC GTCAGTCGGC TCTCCGCCAC CTGAAGCGCG CCGACGCTTG CTGGCAGGCA TCCGGCAATC ACGGAAGGCG CGCACTGACG CTCAACAACA TGGGAGTCAT GGCGATGGAA GAAGGGCGGT ATGCCGAAGC GCGCGCGGCA TTCGACACCG GACTGGATAT TGCCCGACAT ACCGGGCTGC GACGTGAAGA AAGCGTGCTG CTGTGCAGCC TGGCAGAACT CATGCTGCGT CAGGGCGATG TTGAGCAATC GATCCATTGC GCTGCTGAGG CACACGCACT GGCCACAGCG TTCGACATCG CCAGCAGTGC GGAAGCAGCG GCGGCGACCG CGCTCTGGTC CGCTCTCCTG GTTGGCGATA GGGCAGCCAC ATCTGCGTGG TCCGACAGGA CAGCCGCAAT CGTAGCGCCT TTCCAACCGG AGGTGCGCGG GCGTCTGGCA TTGGCACGGG CAATGCTGGC GATGCAACAG AGCAACCCGG ACCCGGAACG CCTCGCCAAT TTTCTGGCAG AAGCGACGGT ATGCGAAGCC GCCCTCAGCA ACGAAGAGCG CGCCTATGTC GCACTGTTGC GCGCCGACAT CGCATACTCC CGCGCTGGCT GGCAATGCGC TGCTGCGGAA TGGGCGGATT TCGTGGCGCG CGCTAGCACG CTCTCCGAAC CCTTGCTGCA TCGTTTCGCA GCGGTGCATC GCAAACTCTT TGAAATCGCA GCACCTCACA CGCCGCTTGC CGCTCGCGCG ATTGCGGGCT TCCGTCAACC CGCGTCCATA CGCTGGCGGA TCACCGCGCT TGGCGGATTC GAGTGCCTTG TCGATGGCAT GCCGGTCGAA CTGTCGCAAC TGCACCGCGC CCTCCTGATC CGACTCCTCG ATGCCGGTCC TCCGGGACTG GCAGTGGAGC GCCTCTGGGA GGCGGTGTGG GGTGATGATC ACGTTTCGAT GCCCGCGTTA CACCAGGCGC TACGCCGGTT GCGCCTGCAA ACCGGTCTCT CCGTTTCGGC GCGCGAGGGG GCGGTTGCTA TTCGTAGCGG ATGGGACATG ATCGAGTACG ATGTGCAAGA ACTCGAACGT CTCCTCGAGA TGCCGACCCA TCCCAACATC ATTCAGCGCG TCATGGCGCT CTACCGTGGC GAGTTCCTTC CGGGAGCGCC CCTCAGCGCC AGTCTGTGGG TCGAGTCGCG TCGGGCGCAT CTCCAGCAGC GTTATCTGGA TGCTATCGAA CAATTCGCAC AATCCATCGA GCGGGACTCG CCGCAACAGG CGATGTTCTA CTACCAGCAT GTGTTGCAGA TCGATGGATG CCGCGAGCAT ACTGCCGCAC AACTAATGCG CCTGGCTGCG CGGTTCGGCA ACCGCACCCT GGTCACGGTC ACCTTTGAGC ATCTCAAGGG TGCGTTGCGC GCCCTTGGCG CATCGCCAGA GCCAGCAACT GCGGCATTGT ATCAGCAACT GACATAA
|
Protein sequence | MALRFQQKLI VPTSARPLIE RPHVIAQLER AIRSKRVVAL AAPAGWGKTT ALAQWVAHTT MPTAWYTLDS ADRDPQVFLD YLLHSVADLA PGTADIAARL ATATPQSLAE ISQQTALALA DAPDHFALIL DDVHVLEDDQ SQSIPGVSLV FALLASIAEY AARCHLVLAS RTLPALHGMV RMVAQQRAAV FDYSVLQFQR ADTQRLAGMT AGLTLSDDAA EQLTAAVGGW VTGIVLSLDQ PSVNGSSVPK QHIVDYRLAE IATQRDAIIE ANTSQVYAYF AEQILSPLPA DLQRFLEDTS VLQDLSPHRC DRLRNTTNSA EYLDDIKRRG LFVSSRAGWV SYHSLFREYL RSRLARDPQR YRSLLRTAGD LYATEDDIER ALDCYLAASD YRQALELLRS AVPRLRQRSR QTTLLACFER LHRFRLTGDR HIRDAMPFPA VQDPRLPATP PDLLLAEARV YSDLALWERA YLALQLAEAS GNAQIRAEAR ILSAEVQVLQ GDYARAQQTL RTVDVEILDD RLRLEYAIAA GRAHIMAGEV AAAITALERA HTLATTRADA VDHPGPLADI YDNLGWAYAA QGDRQSALRH LKRADACWQA SGNHGRRALT LNNMGVMAME EGRYAEARAA FDTGLDIARH TGLRREESVL LCSLAELMLR QGDVEQSIHC AAEAHALATA FDIASSAEAA AATALWSALL VGDRAATSAW SDRTAAIVAP FQPEVRGRLA LARAMLAMQQ SNPDPERLAN FLAEATVCEA ALSNEERAYV ALLRADIAYS RAGWQCAAAE WADFVARAST LSEPLLHRFA AVHRKLFEIA APHTPLAARA IAGFRQPASI RWRITALGGF ECLVDGMPVE LSQLHRALLI RLLDAGPPGL AVERLWEAVW GDDHVSMPAL HQALRRLRLQ TGLSVSAREG AVAIRSGWDM IEYDVQELER LLEMPTHPNI IQRVMALYRG EFLPGAPLSA SLWVESRRAH LQQRYLDAIE QFAQSIERDS PQQAMFYYQH VLQIDGCREH TAAQLMRLAA RFGNRTLVTV TFEHLKGALR ALGASPEPAT AALYQQLT
|
| |