Gene Rcas_0699 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRcas_0699 
Symbol 
ID5538164 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRoseiflexus castenholzii DSM 13941 
KingdomBacteria 
Replicon accessionNC_009767 
Strand
Start bp918439 
End bp921723 
Gene Length3285 bp 
Protein Length1094 aa 
Translation table11 
GC content63% 
IMG OID640892855 
Producttranscriptional activator domain-containing protein 
Protein accessionYP_001430839 
Protein GI156740710 
COG category[K] Transcription 
COG ID[COG2909] ATP-dependent transcriptional regulator 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.00060152 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGGTAGTTT CTCGAGTTAC GGGCGAGTTC GGCGCTGCTG TGTCTCTTGT ATCGACGGTA 
CTGCTTCCGC GCCTGGAACC GCCGCCCCAG CCAGCGCGCA TCATCGAGCG CCCGCGTATT
GACGGGCTAC TCGCTGCGAT CGCCGATTAT CCGGTGACGC TTGTTCTTGC GCCGGCCGGC
AGCGGCAAAA CGGTCGCGCT TACCAGTTTT GCGCGCCACG GTGGATGGCC TGCCGCCTGG
TGCCGCCTGG ACCCCGCAGA TACGCCATTG TCGCTGGCGT TGCACCTGGC GACCGCCTTT
CGCCCGATCA CGGGCTTCGA TCATGCTCGC TTCGCTGCTG CGCATCCGGT GGATGTGCTC
GATGGACTGA TCAATGCGCT GACAGCGCTA GGCGATGAGA CCTTACTGAT CCTGGACGAT
CTCCACCATG CAGATCGACG CCCGGAATTG CGCGTTCTGA TCGAACATCT GATTGACCGT
CTGCCGCCGC ATCTGCATCT GGTGCTGGTG AGCCGGGAAA TGCCTGCGCT CGCCTCCCTG
CCGACGATTG CGGCGCGTGG CGAACTCTAT CGCCTGAGTC GCGCGCAACT GGCGTTCACC
AATGCTGAAG CGCGTGATTT CTTTGCCGCA TACGGCTTGC CGCCGCATCC AATCGATGCC
GAGCTGAACA CCATAGCGCG CGGGTGGCCC CTGGCACTCC GCTTCTTTGC CGCAGCGCGC
ATCGACTCCG CAACACCCTC CGATCAACCG CCGACTCTCG AGCGCTTGCA GGAGAGTATC
GCGCCGCACC TCGATGCATA TCTGGCGCGT GAGGTGCTGG GCGATCTGCC ATTCGCTGTG
CGCACCTGGG TGCTTGGCAC GGCGTTGATG CGCTGGATCG ATGAAGCGGC ATGCGCTGCC
GTCACCGAAC TGGCACATCT CCATATGACC GTCGATCTAT TGGAACGTTG GGAGTTGTTC
ATTGAAACCC TCCCTGACGG GAGGCGTGTC TATCAACCTT TGCAGGCAGC CAGTTTCGCA
CGCCTGGCGG AGCGCGATTT GCCCGATTGG CGTGCCTTCC ATGCGCAATT AGGGCACTAT
TATGCTGCGC ACAACGACGA CCACAGCGCC GCGCACCATT TTCTCGCCGC CGAACGATGG
GAAGACGCCG CTGCTGCGTT GAGCCGGATG GCGCTCTCCG GCGTGTCTGG CTCACAGGCC
GCGGCGCTCC TGGACTGGAT CGACCAGATC CCGCCAGCGC ATCGCAACAG CGCCGCGCTC
CTCGAGGCGC GCGCTGTCGC CGAACGCCGT CTCGGTCGCT ATACGCACGC GGTTGAACTG
TACCGCAAAG CGGAAGAACA GTACCACGCA CAGGGCGACA TAGAGGGACA GGTGCGCGCG
CTCCGCGGGC AGGCGGAGGT GTATATCGAT ACGGTGCAAC CTGCGCCGGC TGCGATCCTG
TTGAAACGCG CCATGAAACT CTTGCCGCGC GATCGCCGCG CCGAACGCGC AACCATCCTC
AGTCTCCAGG CAGAAAACTG GATCAATCGC GGTCGCGCCG ATGTGTCGGT CCTGATCATT
GCAGCGGCGC ATCGCGAGGC ATACGGCAAA ACAGCGCACA CCGACGCAGT TGGAGGGTAT
CGCCGGTCCG CCATCCTGTC GCCCCGCCTG TTGCTGCGCA GCGGCAGGCT GATCGATGCT
CGTCGTCTTC TCGAAGAAGA ACTCGGTCTG GAAGCCGGCA GAGCGCGTGC GGAACATTCA
TTGCACCGTG ATCCGCTCCT GCTGCTGGCA TTGATCGAGT GTATGCTGGG CAACGGCGTG
CGCGCACTGG CGCTCGCACA ACGCGGGTTG CTCGAAGCGC AACGCGGCGA CTCGCCGCTG
ACCGAAGCAA TTGCCCATAT GCGCCTCGGA CATGCCTGTC TTGTGACGGC ATCAAGCGAT
GAGATGGCGC GATCCCACTA CCGCGCTGCG CTCGACATCA TCGAGGCAGT CGGCATTCCG
CGCGCACGCG CTGAGGTGAT GCTGGGGCTG ACCCTGCTCG AAGGGCATGC CGGCAATCTC
ACGGCTGCCG AAGCCTATGC CCGCGATGGT CTCGACCGCG CCCTGGAGGC GGGTGATGAG
TGGACGGCAG CGCTCATCTG GTTGGCGCTC GGCAGCGTCG CTGCGGCTGC CGGCGATCCG
CGTGCGCTGG AGTGGATTGG CGAGGCGCAT CAGCGGTTTG TGCGCGGCGA TGATCAGTAC
GGACAAACCG TCGCGCTCCT CTGGGAAGCG CACGTTCATG TGCAGTCCGG CAATGAAATC
GAAGCCGATA AGAAACTGGC GCGCCTCCTC GAACTGGTAA GCGCCCATGG ATTCGATGGC
GTGTTGACCA CACGCACGCT GTTCGGTCCC CACGATCTGG CGATCCTGGT TCCGCTGCTC
CTGCGGGGAC GGGTATTGCG CGGCGCAGCG CAGCGTCAGG CAGCGACCGC GTACCGGCTC
TTGCGGCAGG GCTTCCCGTC GATCGCGGCT GATGATGCCG TCGATATCTA CCATCCCGGC
TATACACTGC GGGTCTATAT GCTGGGGCGT TTCCGCATCT TCCGCGGCGC GCACGAGATT
CAGGCGCGCG AGTGGCAACG AGAGAAAGCG CGGCAGTTGT TGCAACTGTT GCTGACCTAT
CGTGGCATGT GGTTGCAACG TGAGCAGATC TGCGCCTGGC TCTGGCCCGA CAGCGAACCG
GCAGCCGCCG AGCGGCAGTT CAAAGTGACA CTCAACGCGC TCAATAATGT GCTGGAACCG
CGCCGACCGC CGCGTGTCGC GCCGTTCTTT ATTCGGCGGC AGGGGCTGGC GTATAGTTTT
GCCCCATCTT ATGGATGCTG GATCGATGTG GACGAGTTCG AACTGCGCAC CGCCGGTGCG
CCGGGACGCG ATCCAGAGGT CGAGATCCGC AGCCGCCGCA CAGCATTCCA TCTGTATCGC
GGCGACTATC TCGCCGAGGC GCTGTACGAC CCCTGGACGC TCGAAGAACG TGAGCGCTTG
CTGGCGCGGC ATCTGGCATC GACCGCGACC CTTGCCAGTT TGCTGGTTGA CCGCGGCGAT
TTCGATGAAG CCATCGATCT GTGCGAACAC ATCATCCGCC GCGACCGTGG TTATGAGGAG
GCGTACCAAA CCCTCATGCG CGCCTATGCC CGCGCAGGGA GCCGTTCCCA GGCGTTGCGC
GCCTACGCGC GTTGCGTTCA GGCATTGCAG GACGAACTGG GAATAGAACC GCTCCCGGAG
ACAACCGACC TCTGTGAGCG GATCAAGCGG AACGAGGCGG TGTAG
 
Protein sequence
MVVSRVTGEF GAAVSLVSTV LLPRLEPPPQ PARIIERPRI DGLLAAIADY PVTLVLAPAG 
SGKTVALTSF ARHGGWPAAW CRLDPADTPL SLALHLATAF RPITGFDHAR FAAAHPVDVL
DGLINALTAL GDETLLILDD LHHADRRPEL RVLIEHLIDR LPPHLHLVLV SREMPALASL
PTIAARGELY RLSRAQLAFT NAEARDFFAA YGLPPHPIDA ELNTIARGWP LALRFFAAAR
IDSATPSDQP PTLERLQESI APHLDAYLAR EVLGDLPFAV RTWVLGTALM RWIDEAACAA
VTELAHLHMT VDLLERWELF IETLPDGRRV YQPLQAASFA RLAERDLPDW RAFHAQLGHY
YAAHNDDHSA AHHFLAAERW EDAAAALSRM ALSGVSGSQA AALLDWIDQI PPAHRNSAAL
LEARAVAERR LGRYTHAVEL YRKAEEQYHA QGDIEGQVRA LRGQAEVYID TVQPAPAAIL
LKRAMKLLPR DRRAERATIL SLQAENWINR GRADVSVLII AAAHREAYGK TAHTDAVGGY
RRSAILSPRL LLRSGRLIDA RRLLEEELGL EAGRARAEHS LHRDPLLLLA LIECMLGNGV
RALALAQRGL LEAQRGDSPL TEAIAHMRLG HACLVTASSD EMARSHYRAA LDIIEAVGIP
RARAEVMLGL TLLEGHAGNL TAAEAYARDG LDRALEAGDE WTAALIWLAL GSVAAAAGDP
RALEWIGEAH QRFVRGDDQY GQTVALLWEA HVHVQSGNEI EADKKLARLL ELVSAHGFDG
VLTTRTLFGP HDLAILVPLL LRGRVLRGAA QRQAATAYRL LRQGFPSIAA DDAVDIYHPG
YTLRVYMLGR FRIFRGAHEI QAREWQREKA RQLLQLLLTY RGMWLQREQI CAWLWPDSEP
AAAERQFKVT LNALNNVLEP RRPPRVAPFF IRRQGLAYSF APSYGCWIDV DEFELRTAGA
PGRDPEVEIR SRRTAFHLYR GDYLAEALYD PWTLEERERL LARHLASTAT LASLLVDRGD
FDEAIDLCEH IIRRDRGYEE AYQTLMRAYA RAGSRSQALR AYARCVQALQ DELGIEPLPE
TTDLCERIKR NEAV