Gene Rcas_3757 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRcas_3757 
Symbol 
ID5541259 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRoseiflexus castenholzii DSM 13941 
KingdomBacteria 
Replicon accessionNC_009767 
Strand
Start bp4927506 
End bp4928507 
Gene Length1002 bp 
Protein Length333 aa 
Translation table11 
GC content62% 
IMG OID640895867 
ProductMerR family transcriptional regulator 
Protein accessionYP_001433814 
Protein GI156743685 
COG category[K] Transcription 
COG ID[COG0789] Predicted transcriptional regulators 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00211459 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.00419816 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGTTGGAGC AGGTCCATCG CCTGCTGGCG CTGTCCGACG TTCCACGGTA CAACATTAAG 
GCGGTCGTCC AGCAGACTCA GGTCAATGTT TCGACGCTAC GCGCATGGGA GCAGCGCTAT
GGCGTTCCAC GCCCGACTCG CTCCGATCAC GGTCATCGGC TCTACTCGCA GCGCGACATC
GAAATCATCA AGTGGCTCAA GCAGTGTACG GAGGAAGGAC TAGCGATCAG CCAGGCAGTC
GCACTGCTAC GCGACATCAG TGATACCGGC GATATCGCCC CGCGCGCCCC GCAGCCTCCC
CCGCCAACGC TCGCCGACGC TGGCTGGCCC GACCTGCGCA CCCAACTGAC CGAGGCGTTA
CTCAGCGCCA ACCTGCGGCA GGCGCACTTG CTGGTCAATA CGGCGGTTGC GCTCTTCCCC
ATCGAGACGC TGGTGCTCGA TCTCTTTCAG CCGATGCTGA TCGAGATCGG CGACCGCTGG
GCGCAGGGCG ACGTCTGTGT CGCAGAAGAG CGCGTGGTCA CGAACTTCGT GCGCCAACGA
CTCCTTGGCT TGTTGCAAAT CCACGCGCCG TTCGCCACCG GTCCGCGCCT GATCGCCGGA
TGCGCGCCGG AAGAACAGCA CGAGATCGGG TTGATTATGT TCTCGCTCCT GATGGAGCAG
CGCGGTTGGG AACTCATCTA TCTGGGACAA ACGGTATCGG CGGAAGGGCT GGATGGCTTT
CTGGTGCGAA TGGCGCCGGC GCTCATCTGT ATGTCGGTCT CGATGGCGGA ACATGTGCCC
GGACTGCTGG AAATTGCGCG GATCGTCGAA AATCGCCGCC GTCATCGGTT GCTGTTCGCC
TACAGCGGTC AGGTGTTCGA CCGCCATCCT GAACTTCGCG GGCGCATTCC CGGCATCTTT
CTGGGCAACG ATTTGCGCGA GGCGGTGATC CGGGCGGACG ATCTCGGCGA GGAGATCGAC
CCGGAACGAT GGGCGCGACA GGCGCACTTT TTTCGCCATT GA
 
Protein sequence
MLEQVHRLLA LSDVPRYNIK AVVQQTQVNV STLRAWEQRY GVPRPTRSDH GHRLYSQRDI 
EIIKWLKQCT EEGLAISQAV ALLRDISDTG DIAPRAPQPP PPTLADAGWP DLRTQLTEAL
LSANLRQAHL LVNTAVALFP IETLVLDLFQ PMLIEIGDRW AQGDVCVAEE RVVTNFVRQR
LLGLLQIHAP FATGPRLIAG CAPEEQHEIG LIMFSLLMEQ RGWELIYLGQ TVSAEGLDGF
LVRMAPALIC MSVSMAEHVP GLLEIARIVE NRRRHRLLFA YSGQVFDRHP ELRGRIPGIF
LGNDLREAVI RADDLGEEID PERWARQAHF FRH