Gene Rcas_4073 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRcas_4073 
Symbol 
ID5541584 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRoseiflexus castenholzii DSM 13941 
KingdomBacteria 
Replicon accessionNC_009767 
Strand
Start bp5286524 
End bp5287732 
Gene Length1209 bp 
Protein Length402 aa 
Translation table11 
GC content62% 
IMG OID640896185 
ProductSAM-dependent methyltransferase 
Protein accessionYP_001434123 
Protein GI156743994 
COG category[R] General function prediction only 
COG ID[COG1092] Predicted SAM-dependent methyltransferases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.910073 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value0.355186 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGTTCGTC GTGTGGAAAT CGTTGTGCCG TCGTTGCTGC GCGAGCGACT GGCGCAGGGG 
CATCCATGGG TCTACCGCGA CCATGTTTCT CCCCATGTGC GTCTGCCGTC CGGCGCCTGG
GTTATCGTTC GTTGCGGCGC CTGGCGCGGG TATGCGCTGT GGGATGCGGA AGGTCCGATT
GCGTTGCGCA TCTTCTCGAC GCGCACCGTC CCCGACATCG CCTGGCTACG TGAGCGTCTG
ACTGCTGCGT GGAATCTGCG GGCGCCGCTG CGTGCGGCAG GCATCACGGC GTATCGCTGG
GTTTTTGGCG AAGGAGATGG AGTGCCCGGC ATTGTCGTGG ATCGCTATAA CGATATTGCT
GTCCTCCAGG CGTCTTCCGC CGGTACGCTG ACCCTCATCG AAGACGTGGC GACTGCCATT
CTCAAGGTCG ATCCGACGGT GCGCCGTGTG GCGCTGCGTA TGGCAACGGA GTCGCGTTCA
GCAATAGATG AAGGCGACGA AGGCGATGGT GACGCGCGGC TACGATCACT GTACGGTGAG
TCGCCGCCGC GCGAGATTGT GGTGGTCGAG CACGGGATCC GTTTTGCCGT TGCGCTTCAC
ACGGCGCAGA AAACCGGGTT GTTCCTCGAT CAGCGCGAGA ATCGGCGTTT TGTCGAAGGA
CTCGCTGCCG GGCGCACGGT GCTGAATTGC TTCGCCTATA CTGGCGGGTT TTCGCTCTAT
GCCCTGCGCG GCGGTGCGCG GCAGGTCGTT AGCGTCGATG TTGGCAAGGG TCTGGCATCG
GCGACGGCGC GCAATCTGGC GCTTAACCGT CTCGACGATG GACGCCATCG CTTCGAAACT
GCCGATTGTT TCGAGTTGCT GGAGCAGTAT GCCGCAGCCG GTCAACGCTT CGATCTGGTC
ATTCTCGACC CTCCCAGTTT TGCGCGGCGC AAAGAGAGCC GATATGCCGC ACAGCGCGCG
TATGTGCGAC TTAATGCGCT GGGCATGCGC TGCGTGAAAC CTGGAGGTCT GTTGGCGACT
GCGAGTTGCA CCACACAGGT GGGACCAGAG GCGTTCCGTG AGGCGCTGGC ATCCGCAGGC
GCTCTTGCCG AGCGGCGGCT GCGGATTATC CACGAAGCCG GTCAACCGCT CGATCATCCG
GTTCCGGCAC ATTTTCCCGA AGGGCGGTAT TTGAAGTTCG TGGTTGGGCG GGTGGAGGAA
GCAGTGTAA
 
Protein sequence
MVRRVEIVVP SLLRERLAQG HPWVYRDHVS PHVRLPSGAW VIVRCGAWRG YALWDAEGPI 
ALRIFSTRTV PDIAWLRERL TAAWNLRAPL RAAGITAYRW VFGEGDGVPG IVVDRYNDIA
VLQASSAGTL TLIEDVATAI LKVDPTVRRV ALRMATESRS AIDEGDEGDG DARLRSLYGE
SPPREIVVVE HGIRFAVALH TAQKTGLFLD QRENRRFVEG LAAGRTVLNC FAYTGGFSLY
ALRGGARQVV SVDVGKGLAS ATARNLALNR LDDGRHRFET ADCFELLEQY AAAGQRFDLV
ILDPPSFARR KESRYAAQRA YVRLNALGMR CVKPGGLLAT ASCTTQVGPE AFREALASAG
ALAERRLRII HEAGQPLDHP VPAHFPEGRY LKFVVGRVEE AV