Gene Rcas_2004 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRcas_2004 
Symbol 
ID5539482 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRoseiflexus castenholzii DSM 13941 
KingdomBacteria 
Replicon accessionNC_009767 
Strand
Start bp2571914 
End bp2572879 
Gene Length966 bp 
Protein Length321 aa 
Translation table11 
GC content63% 
IMG OID640894139 
Producthelix-turn-helix type 11 domain-containing protein 
Protein accessionYP_001432110 
Protein GI156741981 
COG category[K] Transcription 
COG ID[COG2378] Predicted transcriptional regulator 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.532666 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value0.656208 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGACATCTG AGCGGGTTCA GAGTAAGGCA GCACGATTAC GCCGAATCGA ACACCTGCTC 
TACAACGCCC CCGGCGGTCT GCGTGTCGTC GATCTTGCGG AGCGCTGCGG CGTTGATCGC
CGCACGATCT ACCGTGACCT TGCGGCCCTC GAAGAGATGG GCGCGCCGGT CTGGCAGCAG
GGCGGGCGCT ATGGCATTGA ACGCGACTCG TACCTTTCGA CGGTGCGCCT GAACCTCAAC
GAGGCTGTGG CCCTCTTCTT CGCAGCGCGG CTGCTTGCCC ACCACAGCGA CGAACACAAC
CCGCATGTCG TTTCGGCGCT GCAAAAACTG GCGACAAGCC TGCCCGACGC GACTGTGTCA
TCACATATCG CTCGCGTTGC CGACCTTATT CGTGAGCGGG CGCATCGGAC GGCGTACATC
CGGGTGCTCG AAACGATCAC CCGCGCCTGG GCTGACCGGC GGTGCGTTAC GATTGACTAT
CGCGCGGCAA GCGGTGATGT GACCCAACGG GTGATCGAGC CATATGTGCT GGAAGTGGCG
CGGAGCGAAC CGGCATCCTA CGTGGTTGCG CACGATCGGC TACGCGGCGC GTTGCGCACC
TTTAAACTCG AACGCATCGC GCAGGCGACG ATACTGGACG AGACCTACAC CATTCCTGAC
GATTTCGACC CCTATGCGCA TTTTGGCGCT GCCTGGGGGG TGATCAACGA GACTGAGGTC
GAGGTGCGCT TGCGCTTCAC CGGCGACGCT GCGCGACGGG TGCGTGAAAG CGTCTGGCAC
CACAGCCAGC AGATCATCGA GCGCGCCGAT GGCGGGTGCG ACATGGTCCT GCGGGTCGGC
GGCGTGCGAG AGATTCGTTC ATGGGTATTG AGTTGGGGAG CGGATGTGGA GGTACTGGCG
CCGGAAGTGC TGCGGAACGA TATTCGCGAT CACGCGCAGC GTCTGGCGGC AATGTATCAG
GACTGA
 
Protein sequence
MTSERVQSKA ARLRRIEHLL YNAPGGLRVV DLAERCGVDR RTIYRDLAAL EEMGAPVWQQ 
GGRYGIERDS YLSTVRLNLN EAVALFFAAR LLAHHSDEHN PHVVSALQKL ATSLPDATVS
SHIARVADLI RERAHRTAYI RVLETITRAW ADRRCVTIDY RAASGDVTQR VIEPYVLEVA
RSEPASYVVA HDRLRGALRT FKLERIAQAT ILDETYTIPD DFDPYAHFGA AWGVINETEV
EVRLRFTGDA ARRVRESVWH HSQQIIERAD GGCDMVLRVG GVREIRSWVL SWGADVEVLA
PEVLRNDIRD HAQRLAAMYQ D