Gene Rcas_4090 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRcas_4090 
Symbol 
ID5541601 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRoseiflexus castenholzii DSM 13941 
KingdomBacteria 
Replicon accessionNC_009767 
Strand
Start bp5301095 
End bp5302285 
Gene Length1191 bp 
Protein Length396 aa 
Translation table11 
GC content61% 
IMG OID640896202 
ProductPUA domain-containing protein 
Protein accessionYP_001434140 
Protein GI156744011 
COG category[R] General function prediction only 
COG ID[COG1092] Predicted SAM-dependent methyltransferases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.337957 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value0.372554 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCAATCG TCATATTGCA TCCCGGTAAA GAACGACCTG TGGTTCAACG GCACCCATGG 
GTGTTTTCTG GAGCGATTGC GCGCATTCAG GGTCGGTTCC CCGATCGCGG CGAGGTGGTC
GATGTGCAGG CGGCCAGCGG CGAGTGGCTG GCGCGCGGTT GCTGGAGTGA CGGATCGCAG
ATCCGCGTTC GCCTGTTCAC GTGGAATCCG GATGAGCCGA TTGATGACGC ACTGATCCGG
CGCCGTATCG AGCGCGCCAT TGACGGTCGT CGCAGACTGG GCATGCTCAC CGACGATGGG
GCGTGTCGCC TGGTCTATGC CGAATCCGAC GGTATTCCCG GCCTGATCGT CGATTACTAC
GCCGGGTTTT TGGTGGTGCA ACTGCTGACT CAGGCGATGG CGCTGCGCCG TGCGGCGATC
ACGCGCGTGC TGGCGGAGAC GCTTGTGCCG CGCGGCATCT ACGAGCGGAG CGAATCTGAC
GTTCGTGAGA AGGAAGGGTT GCCGCCAGCG TCGGGCGTAC TGTGGGGCGA AACGCCGCCC
GATTGTGTGC ATGTGCGGTT GCCCGGCGAT CTCTGGCACG CGGTCGATCT CCGCACCGGT
CAAAAAACCG GCGCTTACCT CGACCAAGCG TTCAATCGGT GGCGGGTCGC CATGCATTGC
ACCGGCGCAG AGATGCTGGA CTGCTTCTGC TACGCTGGCG GCTTTACCAT TGCGGCAGCG
CGTGCTGGCG CTCGTCACGC AATTGCTCTC GATACCAGCG AGTCCGCGCT TGAGATGCTC
CGCGCTGGGC TTGCCCTCAA CGCCATTGCT ACCCCGGTCG AAACGGTTGC GGCGGATGTG
TTTCAGATGT TACGGCGTTA CCGCGATGAA CAACGCCGCT TTGACGTCGT TGTGCTCGAC
CCGCCCAAAT TTGCCCATAC GCAGGCGCAG GTCGAACGGG CAACCCGTGG GTATAAGGAC
ATCAATGTGC TGGCAATGCA GTTGCTGCGC CCCTGCGGGA TTCTGGCGAC GTTCTCCTGC
TCCGGTCTGG TGTCGAGCGA TCTGTTTCAG AAGATTGTCT TTGGTGCTGC GCTCGATGCG
CGCCGTGAAG CGCAGATCAT CGAGCGGTTA ACGCAAAGCC CCGATCATCC GGTGTTGCTG
ACATTTCCCG AAGGAGCATA TCTGAAAGGT CTGATCTGTC GTGTCTGGTA G
 
Protein sequence
MAIVILHPGK ERPVVQRHPW VFSGAIARIQ GRFPDRGEVV DVQAASGEWL ARGCWSDGSQ 
IRVRLFTWNP DEPIDDALIR RRIERAIDGR RRLGMLTDDG ACRLVYAESD GIPGLIVDYY
AGFLVVQLLT QAMALRRAAI TRVLAETLVP RGIYERSESD VREKEGLPPA SGVLWGETPP
DCVHVRLPGD LWHAVDLRTG QKTGAYLDQA FNRWRVAMHC TGAEMLDCFC YAGGFTIAAA
RAGARHAIAL DTSESALEML RAGLALNAIA TPVETVAADV FQMLRRYRDE QRRFDVVVLD
PPKFAHTQAQ VERATRGYKD INVLAMQLLR PCGILATFSC SGLVSSDLFQ KIVFGAALDA
RREAQIIERL TQSPDHPVLL TFPEGAYLKG LICRVW