Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rcas_3701 |
Symbol | |
ID | 5541203 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Roseiflexus castenholzii DSM 13941 |
Kingdom | Bacteria |
Replicon accession | NC_009767 |
Strand | - |
Start bp | 4837476 |
End bp | 4840670 |
Gene Length | 3195 bp |
Protein Length | 1064 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | 640895812 |
Product | SARP family transcriptional regulator |
Protein accession | YP_001433759 |
Protein GI | 156743630 |
COG category | [R] General function prediction only [T] Signal transduction mechanisms |
COG ID | [COG3629] DNA-binding transcriptional activator of the SARP family [COG3903] Predicted ATPase |
TIGRFAM ID | |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 1 |
Fosmid unclonability p-value | 0.000000037794 |
Fosmid Hitchhiker | No |
Fosmid clonability | decreased coverage |
| |
Sequence |
Gene sequence | GTGCTACTAT GCTACGAGAA TTTAGAATTC GTGTCGCAGC TCTCGCTCAC TCTATTGGGG ACACTGGCGA TCACCCTGGA TGGCCAGCCC GTCGCCGGCA TCGAGTCGGA CAAGGCGCGC GCGCTGCTGG TCCGGCTGGC GCTGGAGCCG GAGCGCGCTT TCCGACGCGA GGCGCTGAGC GCGCTCTTGT GGCCCGAAGC CGCGCCCGCA CAGGCTTCCC AGAACCTCCG TCAGGCGCTC TACAACCTGC GCCGCGCCCT CGGCGAAGCC TTTTTGCTGA CGACGCCCCA CACCGTGCAG TTCAACGCGG CTGCCGACGT GACGGTGGAC GCGCTGACCT GGCGCCGCCT GTGGAGTGAG ACGCAGACGC ATCGCCACCG CCGCCGCGAG ACCTGCCGTC CCTGCCTGGA ACGCCTGGCG CAGGCCATCG CCCTCTATCG CGGCGACCTG CTGGCCGGGT TCGCGCTCCA AGATAGCGCA GAGTTCGACG ACTGGCTGGC CGTCGAGCGC GAACGGCTGC ACGTGCAGGC CCTGGAGGCG CTGACGTTGC TGGCGAACGC CGCCGAGCGG CGCGGCGATT ACCCTGCCGC GCAGGAGTAC GTGCGGCGAT TGTTGGCCCT GGAACCCTGG CAGGAGGCCG CGCATCGGCA TCTGATGCGC CTGCTGGCCC TGGACGGCCG GCGCGCGGCG GCGCTGGCGC AGTTCGAGGT CTGTCGCCGC GCGCTGGCCG ACGAGCTGGG CCTGGCGCCG GATGAGGAAA CCCGCGCCCT CCACGCGCGC ATCCGCGCCG GCGAGCCGCT CTCCGCCGCG ATGCCGCTTC CCCCCGCGCC GCCCACCGAC CTGCCGCTGC AACTCACCTC GTTCATCGGC CGTGAGCGCG AGTTGACGCT GTTGAGCGAG CGCCTGAGCA ACCCCGCCTA CCGCCTCATC ACCCTCACCG GGCCGGGCGG GGTGGGAAAG ACGCGCCTGG CGCTGCAACT CGCGGCGACC CTAGCGGAAC AGTTTGCCGA TGGCGTCTCT TGGATCTCGC TGAGCGACGC CGTCACCGAA AGCAACCTGA TTCTGGCGAT TGCCGACGCG CTGCACCTGC GCCTTTCCGG CGCACAAGAC CTGCGCGCGC AACTCTTGCA GGCGCTGCGT CACGACCGGC GCGACCTGCT GCTGGTGCTG GACAATTTCG AGCAACTGTT GCCTGTCGGC GGCGCGACGC TGGTACTGGA GGTGATGCGC GCCGCCCCGC GCCTCACGCT GCTGGTCACC TCGCGCGAAC GGCTGAATCT GCAAGCCGAG TCGGTGCTCC CGCTAGAGGG CCTGGGCTAC GATCTGCCCG CTCCTGACGC GCCGCCCTCC GAAGCCGCGC AGTTGTTCGT CGAGCGCGCC GGACGTGCCC GAATGGACCT GAGCGTAGGC GCGGCAGACC AGACGATGAT CGCGGAGATT TGTCATCTGC TGGAAGGCTT GCCGCTGGGC ATTGAGCTGG CCGCCGCTTG GGCCGGTGAA ATGTCGCTGG AAGGCATCGC TGAAGCCATC ACCGTCACGC GCGATTTCCT CGCCTCCAGC AGTCCCGACA TGCCCGACCG TCACCGCAGC CTGCGCGTGG TCTTTGAAGG CTCCTGGCAA TTGCTTTCTC CGGAAGAGCA GTTCGCGCTG ATGCGGGTTT CCATCTTTCG CGGCGGCTTT CAATCTGAGG CCGCGCAGCA CGTCGCCGGG GTGAGCGCGG CAATGCTCAG CCGTCTGGTG CGCAAATCGT TGCTCTTCCT GGACGGGCCG CGCGTCCGCT ACGGGCTGCA CGGCGACATC CGCTACTATG CGGCGGAGAA ACTGGCGGCG CAGCCGTCCA CTGCCCAGGA GATGGCCGCG CGTCACGCCG CGTATTTTGC CGACCTGGTG CAGCAGCGAG AACAGGCCCT GCGCGGACGC GCGCAGCAGG CAGTGCAGGC CGAGCTGGAA CCCGAATGGC AGAACGTGCT CGCCGCGTGG CAATGGGCCA TCGCCCACGG CGACGAGGCG CTGCTCACCC GCCTGACCCA CGGGCTGTTT GCCTTTTGCG AAGCCAAATC CTGGTTCCGC GAAGGTGCGG CCCTCTTCCA GCCCGCTTTA GAGCGGATGC GGGAAGCGGC CCGCGCCGAC CTGGCGGCCG CCGCGCTGCT CCGTCGCCTG CTGGGGCGGC AGGCCGTCTT TTGCCGACAA CTCTCGCAGT ACGCGCAAGC GCATCAGTTA ATTGAAGAGG GCCTAGCCTT GCCGGGCCTG CCCGACGACG AGGAGCGCGC GTTCCTGCTG TATCAAAAGT CCTGGGTGGA TTTTTTGCAG GCGCGGTACG CGCAGGCGCG CCAGTGGGCC GAGGCAAGCC TGGAGCGTTA CCGCGCACTG GGGCAGCCGG TGGGCATCGG CGACAGCCTC TATATGCTCG GCTGGACGGC CTACGAGTTG GGGGATTTTG CCGCCGCCGA GGCGCTCTGC CTGGAGGCGC GGGCGGTGTG TGCGCAGGCC GATTATGCCT GGGGAGTGCA GTACGCCATC TATGGGCTGG GGCTGGTGCG ACGCGCGCAG GGGGACTATG CCGCCGCCCG CCGCTGTTTC GAGGAGAACA TGACGTTTTG CGACGCCATC GGCTACCTGT GGGGCGTGGC ACAGGCGCGC ATCAACCTGG GGCTGGTGGC ACTGGCCGAG GGCAACGTGG AAGAGTCCGA AGCGCACTTT CAAAAGAGCC TGCTCATCGG CGAGCAGATC GGCAATGCAT GGGTCAACGC GCAAAGCCAG AAGGGCCTGA GCGATGCGGC TTTGGTGCGC CGCGACCTAT CCACTGCGCT GACCCTGGCG GAGCGCAGCC TGGCGCTCTA TCAAGCGATG CAGGATCGGG ATGGGATGGC GGATAGCTTG CTGCTGCTGA GCCAGGTCGC GCTGGCAAGC GGTGACCTCC CCGCCGCCCA CCGCGCCCTG ACGGAAGCCG AGGGGTTGAT CCAGGCTACG GAAAACGGCT TCCGCGCCGC CAGGGCGCTG GTTCAGCGGG TGGGCATCCT GCTGCGGGAA GGGGAAACTG CGCAGGCGCG GGCGCTGCTG GAGGAAACGC TGCGCCATCC GGCCTGCGAG GCGTCCATCC ACAAGCGGGC TATGGAGGCG CTGCGTGGAG CCGGCTACCT TGCCCCGCCT GCCGAAAGCG ACATTCGTCA CAGAACACCT GAGTGGAAGC CATAG
|
Protein sequence | MLLCYENLEF VSQLSLTLLG TLAITLDGQP VAGIESDKAR ALLVRLALEP ERAFRREALS ALLWPEAAPA QASQNLRQAL YNLRRALGEA FLLTTPHTVQ FNAAADVTVD ALTWRRLWSE TQTHRHRRRE TCRPCLERLA QAIALYRGDL LAGFALQDSA EFDDWLAVER ERLHVQALEA LTLLANAAER RGDYPAAQEY VRRLLALEPW QEAAHRHLMR LLALDGRRAA ALAQFEVCRR ALADELGLAP DEETRALHAR IRAGEPLSAA MPLPPAPPTD LPLQLTSFIG RERELTLLSE RLSNPAYRLI TLTGPGGVGK TRLALQLAAT LAEQFADGVS WISLSDAVTE SNLILAIADA LHLRLSGAQD LRAQLLQALR HDRRDLLLVL DNFEQLLPVG GATLVLEVMR AAPRLTLLVT SRERLNLQAE SVLPLEGLGY DLPAPDAPPS EAAQLFVERA GRARMDLSVG AADQTMIAEI CHLLEGLPLG IELAAAWAGE MSLEGIAEAI TVTRDFLASS SPDMPDRHRS LRVVFEGSWQ LLSPEEQFAL MRVSIFRGGF QSEAAQHVAG VSAAMLSRLV RKSLLFLDGP RVRYGLHGDI RYYAAEKLAA QPSTAQEMAA RHAAYFADLV QQREQALRGR AQQAVQAELE PEWQNVLAAW QWAIAHGDEA LLTRLTHGLF AFCEAKSWFR EGAALFQPAL ERMREAARAD LAAAALLRRL LGRQAVFCRQ LSQYAQAHQL IEEGLALPGL PDDEERAFLL YQKSWVDFLQ ARYAQARQWA EASLERYRAL GQPVGIGDSL YMLGWTAYEL GDFAAAEALC LEARAVCAQA DYAWGVQYAI YGLGLVRRAQ GDYAAARRCF EENMTFCDAI GYLWGVAQAR INLGLVALAE GNVEESEAHF QKSLLIGEQI GNAWVNAQSQ KGLSDAALVR RDLSTALTLA ERSLALYQAM QDRDGMADSL LLLSQVALAS GDLPAAHRAL TEAEGLIQAT ENGFRAARAL VQRVGILLRE GETAQARALL EETLRHPACE ASIHKRAMEA LRGAGYLAPP AESDIRHRTP EWKP
|
| |