Gene Rcas_4038 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRcas_4038 
Symbol 
ID5541549 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRoseiflexus castenholzii DSM 13941 
KingdomBacteria 
Replicon accessionNC_009767 
Strand
Start bp5238059 
End bp5239510 
Gene Length1452 bp 
Protein Length483 aa 
Translation table11 
GC content44% 
IMG OID640896151 
Productputative RNA methylase 
Protein accessionYP_001434089 
Protein GI156743960 
COG category[L] Replication, recombination and repair 
COG ID[COG0863] DNA modification methylase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.00846703 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
TTGCATAACC TTATATGGTT CTACGCCGCT ATGACCCCCT CTCTTGATCT TGAAAACCAG 
GTCTATATTG ACAGGCTGTG TCGTTTACTG GAAGAGGATC TTGATTTTCA CGATCAGGAC
AGCGGATACG CATCACACAA CATCCACTCT TTTCCAGCCA AGTTTCCGCC GCAGTTACCC
CGGAAGTTCA TTCAAGCGCT GACCCTTCCT GGTGAAACGG TTCTAGACCC AATGATGGGT
TCGGGCACCA CCGTGCTCGA GGCATTTCTG CTAGGTAGAC GAGGCATTGG CTTCGATATC
GACCCTCTGG CTGTTATGTT GGCAAAAGCG AAGGTTTCGC CCATCAGTCA TCATGATGCC
GTAATATGGA GCAGAGAAAT CATTAGCAAT GCAAGAGAGT CGTTTTTCTC CCAAAAAAGA
GTACTATATA ATGAAATAGA TCGGATGTGG GACGAAGAAA CGAGAGAATT TGTTGATTAC
TGGTTTTCTC GAGAAGTTCA ACTCGCTCTA ACAGCTCTTG TCGTCGAAAT AAATAGAATC
AACGATGAAA ATCTAAGGAA TTTCTTTAAC GTCATCCTCT CGTCAATTAT CATCACAAAA
TCTGGCGGTG TTTCACTTGC GCTCGATCTT GCTCATACTC GACCACATAG GGTTGACAGA
GCTATTGATT GGAATGGTTG TCCCTTAGAG ATGGACACCT CAAAAAGAAG TGAAAGAAGA
AAGTTATCTA AGAAAATCCG CTCCCCTTTC GAAGAGTTTG AAAAGAAGTG CATGCAGAGC
CTGAAGAACA TGTCCGAGAA TGGTTTGAAT AGTGACCAGT TGAGCATGAG ATGCCTTCCA
AACTGGGAAA ACGCGAGAAT GCAGCCTGAT ATATCAATGT GTAATGCAAA AAGTTTACTG
CTCAATGACG AGTGTGTTGA CATTATCATC ACATCTCCTC CTTATGCTTC TCACGCGATT
GATTATATGC GCGCTCATAA GTTTTCACTC GTGTGGCTTG GCTATGCTAT ACGAGAACTC
AGCGAAAGAA GGAAAAGATA CATAGGTGGA GATGCCTTAG AAGGGCATGG CTTTGAAAGT
CTTCCAGGTT ACACCTCAAG CATTATTGAT AGCCTAGCCC GAAGAGATCC TCGCAAGAGC
CTGGTGCTTC GCAGGTACTA CTCTGAAATG AAAGCCATCT TGCGAGAGAT GTTTCGTGTC
TTAAAAAAAG GACGGGTTGC TATCGTCGTC GTGGGTGAAT CGAAACTGAG AGGTCAAGAT
GTGGAAATCG ATGTTTGCCT GAGCGAAATC GGAGAGTCGC TCGGATTTTT TGTTCCTAGG
ATTGGTGTAC GCCGTCTGGA TAGGAATCGC CGAATGATGC CTGTCGGAAA TCAAGTTGAC
ATGAAGTCTC AAATACAACG ACGAATGCAC AAAGAGTTTG TTATTGGTTT CTATAAACCA
CCTATCGACT AA
 
Protein sequence
MHNLIWFYAA MTPSLDLENQ VYIDRLCRLL EEDLDFHDQD SGYASHNIHS FPAKFPPQLP 
RKFIQALTLP GETVLDPMMG SGTTVLEAFL LGRRGIGFDI DPLAVMLAKA KVSPISHHDA
VIWSREIISN ARESFFSQKR VLYNEIDRMW DEETREFVDY WFSREVQLAL TALVVEINRI
NDENLRNFFN VILSSIIITK SGGVSLALDL AHTRPHRVDR AIDWNGCPLE MDTSKRSERR
KLSKKIRSPF EEFEKKCMQS LKNMSENGLN SDQLSMRCLP NWENARMQPD ISMCNAKSLL
LNDECVDIII TSPPYASHAI DYMRAHKFSL VWLGYAIREL SERRKRYIGG DALEGHGFES
LPGYTSSIID SLARRDPRKS LVLRRYYSEM KAILREMFRV LKKGRVAIVV VGESKLRGQD
VEIDVCLSEI GESLGFFVPR IGVRRLDRNR RMMPVGNQVD MKSQIQRRMH KEFVIGFYKP
PID