Gene Rcas_2166 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRcas_2166 
Symbol 
ID5539646 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRoseiflexus castenholzii DSM 13941 
KingdomBacteria 
Replicon accessionNC_009767 
Strand
Start bp2782816 
End bp2783976 
Gene Length1161 bp 
Protein Length386 aa 
Translation table11 
GC content62% 
IMG OID640894299 
Productradical SAM domain-containing protein 
Protein accessionYP_001432268 
Protein GI156742139 
COG category[R] General function prediction only 
COG ID[COG4277] Predicted DNA-binding protein with the Helix-hairpin-helix motif 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.978798 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.00000549931 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGAGTCCTC TGATCTTTGT CGAACCAGAC ACGCAGCAGA AACTCTCGAT CCTGAGCGCA 
GAAGCCGGAT TCGAAGCCAC AACGGTTCCA GCCGTCGAAC GGGTACAGCA GGGGATGCGC
CATCCCGCCG GTTTTCTCTA TACTGCCAGG AAAGAAGGCG GGGGGACAGC GCGCCTCTTC
AAAGTGCTCC AGACGAACGC CTGCCGCTAC GCCTGCCGCT ACTGCTTCAC CTCGTGCGCA
GTGCGACGTT CCCGCACGAC GTTTAAGCCC GACGAACTGG CAACAACCTT CATTGCGCTG
CATCGTTCCC GGCAGGTCGA TGGGCTGTTC CTCTCCTCCG GGATTGTCCC CGACGCCAAT
GCAACGATGG AGAAGATGCT GGCGACAGTC GAACGATTGC GACTGAAAGA AGGATACACG
GGCTATATCC ACCTGAAGTT GATCCCCGGC GCGGCATTCG AGTATATCGA GCGCGCCGTT
GAACTGGCAG ACCGCGTTTC GTTGAACCTG GAAGCGCCTA ATGCCGAACG CCTGGCGATG
CTTGCGCCGG AGAAAGAGTT TGCCGGCAGC ATGTGGGGGC GCATGGCATG GGCTGCCGGG
TTGATCCGCC GGGCGCGCGC CGCCGGGCGC CCCGCTGCCC GCAGCCTTAC GACGCAGTTC
GTCGTTGGTC CGGCGGGCGA AAGCGACCGT GAGTTGCTCG AAACGGCGGC GCGCACCCAC
CGCGACCTCG ACCTGCGCCG CGCATTCTTC AGCGCCTTTC ACCCCATCGA ACGCACACCG
TTCGCCGATA TGCCCGCCGA AGACCCACTG CGCGAACTGC GGCTCTATCA GGCGGATTTT
ATGCTGCGTG ATTATGGTTT CACCGTTGAC GAATTGCCAT TCGATGAGCG TGGCCTGCTG
CCACGCACCA TCACTCCGAA ACAGGCCTGG GCGGAACGCC ATCTTGTTGA GCCGATCGAC
GTGAATGTTG CGCCGCGACG CCTGCTGCTT CGCATCCCCG GCATCGGTCC ACGATCCGCC
GACCGGATCA TTGCTATGCG ACGCGAGATG CACCTGCGTG ATATGGCGCA TCTCGCGCGG
CTTGGCGTGG TGGTCAATTG GGCGGCGCCC TATGTGCTGC TCGATGGAAG ACGCCCGCCG
GAACAGGGGC GTTTGTGGTA A
 
Protein sequence
MSPLIFVEPD TQQKLSILSA EAGFEATTVP AVERVQQGMR HPAGFLYTAR KEGGGTARLF 
KVLQTNACRY ACRYCFTSCA VRRSRTTFKP DELATTFIAL HRSRQVDGLF LSSGIVPDAN
ATMEKMLATV ERLRLKEGYT GYIHLKLIPG AAFEYIERAV ELADRVSLNL EAPNAERLAM
LAPEKEFAGS MWGRMAWAAG LIRRARAAGR PAARSLTTQF VVGPAGESDR ELLETAARTH
RDLDLRRAFF SAFHPIERTP FADMPAEDPL RELRLYQADF MLRDYGFTVD ELPFDERGLL
PRTITPKQAW AERHLVEPID VNVAPRRLLL RIPGIGPRSA DRIIAMRREM HLRDMAHLAR
LGVVVNWAAP YVLLDGRRPP EQGRLW