Gene Rcas_3604 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRcas_3604 
Symbol 
ID5541105 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRoseiflexus castenholzii DSM 13941 
KingdomBacteria 
Replicon accessionNC_009767 
Strand
Start bp4702858 
End bp4704717 
Gene Length1860 bp 
Protein Length619 aa 
Translation table11 
GC content58% 
IMG OID640895723 
Productputative PAS/PAC sensor protein 
Protein accessionYP_001433671 
Protein GI156743542 
COG category[K] Transcription
[T] Signal transduction mechanisms 
COG ID[COG0745] Response regulators consisting of a CheY-like receiver domain and a winged-helix DNA-binding domain
[COG2206] HD-GYP domain 
TIGRFAM ID[TIGR00229] PAS domain S-box
[TIGR00277] uncharacterized domain HDIG 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value0.562878 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGATGCGA CCAGCACCAT CTTGATCGCC GATGACGATC CGGGCGCGCG CCTTGTTCTA 
CAACGACTGT TGACGCGCGA GGGTCATCGG GTGCTGATGG CTCAGAACGG CGCCGAGGCA
CTGCGTCAGT CGATCGAATA TACGCCAGAT TTGCTGCTTC TAGATGTATT GATGCCGGAG
ATCGACGGTT TTGAGGTTTG TCGGCAATTA CGCAATCATA CACGGCTCCG CGAGATCCCC
ATTCTTTTGA TCACGTCGCT TGGCGATCAG CAGTCACGTG TGGAGGGGTT GAGCGCGGGC
GCTAATGGAT TCATCACCAA GCCGTTCGAC ATGGCGGAAC TGCTGGCGCA TGTGCGCACC
ATCACCCGTC TCAACCGCTA TCGCCGTTTA TTGTCAGAAC AGGAGCGGTT CCAGCGTCTG
ATCGAACTTT CACCGGAAGG CATCGCTATC ATCAATGAGG AAAGCCGATT GCTGCTCGTG
AACCCTGCGC TGGTCGCGCT GCTCGAAGCT CCGTCTGCCG GGTATCTCCT GGGTGACACA
TTGCTCAACT ATCTGCATAT TGCTGGGCTT GATCGCTATC GGGCGGGCAT CAGAGCGCTC
ATGCAAGAGA CGCAGCAGGT CTGGCGCATT GAACTCGATC TGATCAGCGT CAACGGTCAC
GCTACTCCCG CCGAAATCAG TCTGGGACGT TTTCGCGACC AGAATGGCGC ATTTGCTCAG
GTGATCATCC GCGATATTTC GGAACGTAAA CGCGCCGAGG CACAGATCTA TCGGCAGGTC
AGCCGACTGA CCAGTCTGCA TACCATCGGC GTCGCTATTA CAGCAAGCCT GGAGCTTCCG
GCAACGCTGG CGATCCTGCT GGATCGCCTG ATTGAGGAGT TGCACGTCGA TGCAGCGAGT
GTGCTATTGT TCAACCCACG CACCACAATG CTCGAAACAG CCTGCAACCG GGGGCTGCCG
CGCGATCTTG CCGAGGTGGC GATCTGCGCT GAGGAAGGAC TGGCTGGCAT GGCGTTTCGT
TCGCGTCAAA CGGTCTCGCT CGCAACCTTT CCTTCCGAGA TGCTGATCAG TCCACGTGAT
CAAATGCTGG CTGCGGCATT TGCAACATAC TTTGCCGTAC CCTTGCAGGC GCGCGACGAA
ATCAAAGGCG TGCTCGAAAT CTTGCAGCGC GCCTCGTTTA CGCCCGACGA GCATTGGTGG
ACATTTCTCG AAGCGCTGGC GATGCAGGCG GCGATTGCTA TCGATACATC TTCGCTCTTC
GAAGATTTGC GACGCACACA CGCAGAATTG AAGCAATCCT ACGATGCAAC GATTGCCGGC
TGGTCGCGCG CGCTGGATTT GCGCGACCGT GAAACGGAAG GGCATAGCGA ACGTGTAACA
GAGCTAACCC TGCGGCTGGC GCGTTGGATG GGCATTCCTG AGGATCAGAT GGAACATATC
CGGCGCGGCG CGCTCCTTCA CGATATTGGC AAGATGGGCG TTCCCGATCA TATTTTGCTT
AAGCCAGGCC CGCTGAGTGT CGATGAATGG GCGATCATGC GCCAGCATCC GGTCTATGCG
TTTCGCTGGC TCTCGGCGAT TCCCTTTCTG CAACCTGCGC TCGATATTCC CTATGCCCAT
CACGAACGAT GGGATGGCAG CGGTTACCCG CGCGGATTGC GTGGTGAACA GATCCCACTT
GCCGCGCGTA TCTTTGCGGT CGTCGATGTG TGGGACGCGC TGCGCTCTGA TCGACCGTAT
CGTTCAGCCT GGACAGAAGA ACAGACGATC GCATACCTCC GCTCCCTGGC CAGTACGCAT
TTCGATCCGG CCGTGGTGGC TGCATTTCTT GACATGATCG GGCAGGGGCG CAGGGAGTAG
 
Protein sequence
MDATSTILIA DDDPGARLVL QRLLTREGHR VLMAQNGAEA LRQSIEYTPD LLLLDVLMPE 
IDGFEVCRQL RNHTRLREIP ILLITSLGDQ QSRVEGLSAG ANGFITKPFD MAELLAHVRT
ITRLNRYRRL LSEQERFQRL IELSPEGIAI INEESRLLLV NPALVALLEA PSAGYLLGDT
LLNYLHIAGL DRYRAGIRAL MQETQQVWRI ELDLISVNGH ATPAEISLGR FRDQNGAFAQ
VIIRDISERK RAEAQIYRQV SRLTSLHTIG VAITASLELP ATLAILLDRL IEELHVDAAS
VLLFNPRTTM LETACNRGLP RDLAEVAICA EEGLAGMAFR SRQTVSLATF PSEMLISPRD
QMLAAAFATY FAVPLQARDE IKGVLEILQR ASFTPDEHWW TFLEALAMQA AIAIDTSSLF
EDLRRTHAEL KQSYDATIAG WSRALDLRDR ETEGHSERVT ELTLRLARWM GIPEDQMEHI
RRGALLHDIG KMGVPDHILL KPGPLSVDEW AIMRQHPVYA FRWLSAIPFL QPALDIPYAH
HERWDGSGYP RGLRGEQIPL AARIFAVVDV WDALRSDRPY RSAWTEEQTI AYLRSLASTH
FDPAVVAAFL DMIGQGRRE