Gene Rcas_1606 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRcas_1606 
Symbol 
ID5539082 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRoseiflexus castenholzii DSM 13941 
KingdomBacteria 
Replicon accessionNC_009767 
Strand
Start bp2069909 
End bp2071069 
Gene Length1161 bp 
Protein Length386 aa 
Translation table11 
GC content56% 
IMG OID640893743 
Productputative transcriptional regulator 
Protein accessionYP_001431716 
Protein GI156741587 
COG category[K] Transcription 
COG ID[COG2865] Predicted transcriptional regulator containing an HTH domain and an uncharacterized domain shared with the mammalian protein Schlafen 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.906529 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones29 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGACGTTT TAGAACTCTA TCAACGTATC CAACGTTGGG AGGATTTGCA TACCGAGTTC 
AAAGAGCAGG ATGCGCACTC CGATGATATC GCCGCCGCGC TGGTCGCTTT TGCCAACACT
GATGGCGGTC AACTGATTTT CGGCGTCAGC AAGGACCGGG TTATCATCGG CGTGGATGAT
GCGGACCGTG TGATGCAACG CATTGACCAG ATTGCTTACC ATAATTGCGA GCCTCCCGTC
ACCGTGATTC AGGAAACGAT TCCCACAGAA CAGGGCCTCG CGGTTGTTGT GAACATCCCC
AAGGGCGACC AGCGTCCCTA TCGCACGCAG CGGGGAGACT ATTTCATTCG CACCACCTCC
GGGCGCCGGC GCGCTTCCAG ACAAGAACTG TTGCGCCTCT TCCAGTCGGT CGAAAGCCTG
TACTATGACG AAACGCTGGT GTTACGCGCA AGCCTTTCGG ATCTGGACTA TCGGCGCTTC
ACGGTGTTTT TTGAACAATC CTATCAAAGG CTTTTGCAGT CTGAGCAAGA GGTAGAGAAT
CTGCTGCGAA ACATGCGCCT GGTCAGGGAA CAGGCAGGCG TCTGGCATCC GACCCTGGCA
GGTCTGCTTT GCTTTGGCCG CGAGCCGCAA AGTTTCTTCC CCTACGCACA GGTCAACGCG
GCGCGTATCC CCGGCGACTC GCTTGCCACC GCGCCATCGG ATGCCAAGCA AATCGGCGGT
ACCCTGTTTG ATATGTTGGA AGACACGGCG CGTTTTTTGC AGATTCATCT GCCCAGCCCG
CACATCATTC ACGGCTTTGC GCCCGAGCAA CGGACTGAAA TTCCCGAAGA AGCCCTGCGC
GAGTTGTTGG TCAATGCCCT GGTCCACCGC GATTACACCG TTGCGTCGCC CATCCGTCTG
CTGATTTTTG ACCGGCGGAT CGAAATCCGC ACGCCGGGCG CGTTGCCGAA TACGGTCACC
ATCGAAGCTA TTTTGTTGGG GGCGGCGCAT GTGCTTCGCA ATCCCACCAT TTATACGATG
TTCAGCCGGG CCGGCCTGGT CACGAGTTTG GGCAGCGGCG TTTTACGTGC CAAAGAACTC
CTTGAACAGC ACGCTCACAC AACCCTCGAA CTAAAAGTTG TCGCCAATGA ATTTGTCGTG
ATCATCACTC GTCCGGGGTG A
 
Protein sequence
MDVLELYQRI QRWEDLHTEF KEQDAHSDDI AAALVAFANT DGGQLIFGVS KDRVIIGVDD 
ADRVMQRIDQ IAYHNCEPPV TVIQETIPTE QGLAVVVNIP KGDQRPYRTQ RGDYFIRTTS
GRRRASRQEL LRLFQSVESL YYDETLVLRA SLSDLDYRRF TVFFEQSYQR LLQSEQEVEN
LLRNMRLVRE QAGVWHPTLA GLLCFGREPQ SFFPYAQVNA ARIPGDSLAT APSDAKQIGG
TLFDMLEDTA RFLQIHLPSP HIIHGFAPEQ RTEIPEEALR ELLVNALVHR DYTVASPIRL
LIFDRRIEIR TPGALPNTVT IEAILLGAAH VLRNPTIYTM FSRAGLVTSL GSGVLRAKEL
LEQHAHTTLE LKVVANEFVV IITRPG