Gene Rcas_1744 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRcas_1744 
Symbol 
ID5539222 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRoseiflexus castenholzii DSM 13941 
KingdomBacteria 
Replicon accessionNC_009767 
Strand
Start bp2247172 
End bp2248611 
Gene Length1440 bp 
Protein Length479 aa 
Translation table11 
GC content63% 
IMG OID640893883 
Productputative transcriptional regulator 
Protein accessionYP_001431854 
Protein GI156741725 
COG category[K] Transcription 
COG ID[COG2865] Predicted transcriptional regulator containing an HTH domain and an uncharacterized domain shared with the mammalian protein Schlafen 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value0.853827 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTTCGACG CTGTGGAGCA AACATCAAAA GTGATGGATA TTCATGAACA GGCGATCACA 
GACCTCCTGG CGCAGGCGGA AGGCGAGCGC GTGGCATGCG TGCGGGCGAA TGTTCGTCCC
ATCGATCTGG CGCAAACCAT GGCGGCAATG GCGAATGCCT CCGGCGGGAC GATCTTGATC
GGCGTGCGTG GGCGTCAGGT CGAAGGGGTG GCAGATGTGG ACGCTGCGCG AGCGATGGCG
TTCGACGCGG CGCTGGCATG TCTGCCGCCG CTCGTCCTTC CGCTGCCGGT CGTTGTCACA
CACAATGGCG CCACGCTGCT GATTGTCACC GTTCCTTCAG GATTGCCCCA TGTCTACAGC
GTCGGTGGAA CCTACCTGCG GCGGGAAGGT TCTGCGAATG TGCCGCTTGC TCCCGATGCC
TTGCGACACC TTCTGATTGA GCGGGGCGAA ACAAGTTGGG ACCGGATGGC GCCGCCGGAT
GCAACGCTCG CCGATCTGGA TGCTGAAAAG ATCGCCGCCT ACGCACGACG TGTTGGTCCT
ATCGCTGAAG CCGATCCGCT GGCATTTCTG CTCCGGCGCG GCTGTCTGAT GCAGCGCAGC
GGCACCGCTA CCGGTAAGAC CGAATTCGTG CCCACCAACG CCGGTCTCCT CCTCTTCGGC
GTCGAAATCG AGCGCTGGTT TCCACAGGCA GAAGTGACGC TGGTGCGCTA TCAGGGACGA
GAGATGAGCG ACGCATTTCT GCGTGAAGAT ATCCGCGACA CGCTGCCGGA AACAGCCCGC
CGCGCCGAAC GCTGGCTGAT AGAACATATG CGCCGTGGTA GCCGCATGGT CGGGTTGGAA
CGGGAGGACT GGACGCAATT CCCGTTGGGT GCGGTCCGCG AAGCGTTGAT CAATGCGCTG
GCGCACCGCG ATTACACGAT CCGTGGCGAC TCCATTCGCG TCCTGCTCTT CAGCGACCGT
CTGGAATGTT ATTCGCCGGG GCGTTTGCCG GGACATGTCA CGCTTCAGAA CCTGGTCGAA
GAGCGCTTTA GCCGGAACGC AACGCTGGTG CAGGCGCTGG CGGACCTGGG GTTGATCGAG
CGGCTTGGTT ATGGCATCGA CCGCATGCTG CGTCAAATGG CCGACGCTGG CTTGCCGCCG
CCGGAGTTCC GCGAGACGAC TGCCGGATTT CTGGTCACGC TCTATGGGCG CACCGGCGAC
GACCGCGCCG ATGCCGGCGG CGCCGATGTC GCCGCCTGGC GGCGCATGGG GCTGAACGAG
CGGCAGATTG CGGCGCTGCT CTTTCTCGCA GAGCATCAGC GCATCACCAA CCGCGACATG
CAGGAACTTG CGCCGGATGT CTCTGCCGAA ACACTGCGTC GTGACCTCGT CGATCTGGTC
GAACGCGGGT TGCTGTTGCG CGTCGGCGAC AAACGTGGCG CGTACTATAT TCTGAAGTGA
 
Protein sequence
MFDAVEQTSK VMDIHEQAIT DLLAQAEGER VACVRANVRP IDLAQTMAAM ANASGGTILI 
GVRGRQVEGV ADVDAARAMA FDAALACLPP LVLPLPVVVT HNGATLLIVT VPSGLPHVYS
VGGTYLRREG SANVPLAPDA LRHLLIERGE TSWDRMAPPD ATLADLDAEK IAAYARRVGP
IAEADPLAFL LRRGCLMQRS GTATGKTEFV PTNAGLLLFG VEIERWFPQA EVTLVRYQGR
EMSDAFLRED IRDTLPETAR RAERWLIEHM RRGSRMVGLE REDWTQFPLG AVREALINAL
AHRDYTIRGD SIRVLLFSDR LECYSPGRLP GHVTLQNLVE ERFSRNATLV QALADLGLIE
RLGYGIDRML RQMADAGLPP PEFRETTAGF LVTLYGRTGD DRADAGGADV AAWRRMGLNE
RQIAALLFLA EHQRITNRDM QELAPDVSAE TLRRDLVDLV ERGLLLRVGD KRGAYYILK