Gene Rcas_1389 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRcas_1389 
Symbol 
ID5538862 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRoseiflexus castenholzii DSM 13941 
KingdomBacteria 
Replicon accessionNC_009767 
Strand
Start bp1775718 
End bp1776881 
Gene Length1164 bp 
Protein Length387 aa 
Translation table11 
GC content60% 
IMG OID640893527 
ProductDNA-cytosine methyltransferase 
Protein accessionYP_001431503 
Protein GI156741374 
COG category[L] Replication, recombination and repair 
COG ID[COG0270] Site-specific DNA methylase 
TIGRFAM ID[TIGR00675] DNA-methyltransferase (dcm) 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.376971 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones43 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCGGCC TGACCTTCTA CGAATTCTTC GCCGGCGGCG GCCTGGCACG GATTGGACTG 
GGACCACAGT GGACCTGCCT GTTCGCCAAT GACATCGACC CCAAAAAAGC GGACGTGTAC
CGGCGTAACT TCTCCGGCGC GCCAGAGCTG GTTGTTGCGG ATATCCATCG TGTGACGACG
GATATGCTTC CGGGTCGCGC GCTATTGGCA TGGGCGTCGT TCCCCTGTCA GGATCTGTCG
CTCGCGGGGA AAGGCGGCGG GTTGCGCGCC GAACGCAGCG GCACATTCTG GTCATTCTGG
AATCTCGTGA CCACGCTCGA CCGGGAAGGG AGACCGGCGC CGATCATTGC CATTGAAAAT
GTTGTCGGTT TGCTTACCTC AAATCGAGGA CGCGATTTTC AGGAACTGGT CTCGGTTATT
GTTGCACAAG GATACCGCCT GGGCGCCATG GTCATTGACG CCGTGCATTT CGTTCCTCAA
TCGCGACCAC GGCTCTTCAT CGTTGCAGTC AAGGACGACG TGACGATACC AGAGATGGTG
ATCACGCCCA CGCCGCACGC AACGTGGCAT CCGGCAGCGG TCGTTCGCGC CTTCCGTCAT
CTCGCGCCAT TGGCGCAAGA TGCATGGGTC TGGTGGAGTC TCCCCTTGCC ACAACGAACG
CCACGCCGTA TTCACGATGT GATCGATCCT GAGCCAACCG GCGTTTCCTG GCATCGCCCC
GAAGAAACAC AACGCCTCCT GTCACTGATG TCTCCGCTCA ATCTCGCCAA GGTGCGTCAC
GCGCAGTTGA CCGGTCGTCT CCACATCGGC GCCATCTACA AGCGCACGCG CCTTCAGAAC
GGAGCCAAGC GCCAGCGCGC AGAAGTGCGG TTCGATGGCA TCAGCGGTTG TCTGCGGACA
CCGGCTGGAG GTTCGAGCCG ACAGACGATT CTGGTCGTCG AAGGGGATGT CATCCGCTCA
CGGCTGCTTT CGGTACGCGA AGCGGCGCGC TTAATGGGCT TGCCCGACCG ATACTGGTTG
CCTGGACGCT ACAATGACGG GTATTATGTC ATGGGCGACG CGGTCGTCGT GCCCGTCGTT
TCGTGGCTGG AGGAGCATAT CCTTCGTCCG ATTGCAACAT CTATCGTGCA GAATGAGGAG
TCCCTGGCGT ATGTCAGCCC GTGA
 
Protein sequence
MTGLTFYEFF AGGGLARIGL GPQWTCLFAN DIDPKKADVY RRNFSGAPEL VVADIHRVTT 
DMLPGRALLA WASFPCQDLS LAGKGGGLRA ERSGTFWSFW NLVTTLDREG RPAPIIAIEN
VVGLLTSNRG RDFQELVSVI VAQGYRLGAM VIDAVHFVPQ SRPRLFIVAV KDDVTIPEMV
ITPTPHATWH PAAVVRAFRH LAPLAQDAWV WWSLPLPQRT PRRIHDVIDP EPTGVSWHRP
EETQRLLSLM SPLNLAKVRH AQLTGRLHIG AIYKRTRLQN GAKRQRAEVR FDGISGCLRT
PAGGSSRQTI LVVEGDVIRS RLLSVREAAR LMGLPDRYWL PGRYNDGYYV MGDAVVVPVV
SWLEEHILRP IATSIVQNEE SLAYVSP