Gene Rcas_4389 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRcas_4389 
Symbol 
ID5541902 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRoseiflexus castenholzii DSM 13941 
KingdomBacteria 
Replicon accessionNC_009767 
Strand
Start bp5641245 
End bp5642426 
Gene Length1182 bp 
Protein Length393 aa 
Translation table11 
GC content67% 
IMG OID640896489 
Productradical SAM domain-containing protein 
Protein accessionYP_001434425 
Protein GI156744296 
COG category[R] General function prediction only 
COG ID[COG4277] Predicted DNA-binding protein with the Helix-hairpin-helix motif 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value0.484216 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGACCTCG ATGAAAAACT CGCCATTCTG GCGCCGGCGG CGCGCTTCGA TGCCTGCGAC 
CGGTTCCTGG GGAAACGCCG CGCGCCGCCC CCTGCAACCG GATGGAACGA CGACGCTGTG
GTTGCCGATG CCGACAGCGA TGGGCGCGCG TTGCCGGTCT TCCGGTTGTT GCTGAGCAAC
CGCTGCGAAT GGAACTGCGC CTACTGCCCG TTGCGCTCCG GCAACGACAT GCCGCGCGCC
GCGCTGAACC CCAACGAACT CGCGCGCGTC GTTCTGCCGC GCGTCGAACG AAAGACGGTG
CAGGGGTTAC TTATCTCCAC TGGGGTCGAT GGCAGCCCGT CCGTTGCGAC CGAACGCCTG
CTCGATGCTG TTGAAGCGCT GCGCGCGCGC CATGGCTATA CCGGGTACGT CCACCTGAAA
TTGCCACCCG GCGCGCCCGC TGCGGACATT GAGCGCGCCG CGCGCCTTGC CGACCGTATC
AGCCTGAATC TGGAAGCGCC GACGGCAATG CACCTGGCGC GTATTTCGCC GGAGCGCGAC
TGGCTGCGCG ACCTGATCGC GCCGCTGGCG CTGGCGCGCG ACTGGAGTCG GACCGGCGCT
ATTCGGGCGG GGCTTGCGAC GCAGTTCGTG GTCGGCGCGG CCGGCGAAAG CGACCACGAT
CTCCTGGTGA CAACCACGTG GCTCTACCGC GACCTGGGGT TGCGGCGCGT CTATTTTGGC
GCGTTTCGAC CGGTTGCCGG CACGCCGCTG GAGCAGCGCG CACCCACGCC ATTCGTGCGC
GAACAGCGCC TCCGCGAAGC CGACTGGCTG GTGCGGCGCT ACGGCTTCGA TCAGCGCGAA
TTGCCCTATG ATGCGGCAGG CAACCTGCCG TTGCACATCG ACCCAAAACT GGCCTGGGCG
TTGGCGCACC CCGAACGCTT TCCGGTTGAA CTGAACAGCG CCGACCGCGA CGAACTGTTG
CGGGTGCCGG GGTTGGGTCC GGTGAGCGTG GCGCGCATTC TTCGTCTGCG GCGTGAAGGG
CGCTTTCGCG AACCGGCGCA CCTTGCAGCG CTCGGCGGAG CGCTTGCGCG CGCCCGTGAC
TTTGTGACGC TCGATGGACG CTTTTTCGGC AGGAACGAAC GTGACCGCCT GCGCCATTAT
GCCCGGCAAT CGGAAATTGC CGAGCAGTTG ACCTTGTGGT AG
 
Protein sequence
MDLDEKLAIL APAARFDACD RFLGKRRAPP PATGWNDDAV VADADSDGRA LPVFRLLLSN 
RCEWNCAYCP LRSGNDMPRA ALNPNELARV VLPRVERKTV QGLLISTGVD GSPSVATERL
LDAVEALRAR HGYTGYVHLK LPPGAPAADI ERAARLADRI SLNLEAPTAM HLARISPERD
WLRDLIAPLA LARDWSRTGA IRAGLATQFV VGAAGESDHD LLVTTTWLYR DLGLRRVYFG
AFRPVAGTPL EQRAPTPFVR EQRLREADWL VRRYGFDQRE LPYDAAGNLP LHIDPKLAWA
LAHPERFPVE LNSADRDELL RVPGLGPVSV ARILRLRREG RFREPAHLAA LGGALARARD
FVTLDGRFFG RNERDRLRHY ARQSEIAEQL TLW