Gene Rcas_2228 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRcas_2228 
Symbol 
ID5539709 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRoseiflexus castenholzii DSM 13941 
KingdomBacteria 
Replicon accessionNC_009767 
Strand
Start bp2876810 
End bp2878390 
Gene Length1581 bp 
Protein Length526 aa 
Translation table11 
GC content54% 
IMG OID640894361 
Productsite-specific DNA-methyltransferase (adenine-specific) 
Protein accessionYP_001432329 
Protein GI156742200 
COG category[L] Replication, recombination and repair 
COG ID[COG0827] Adenine-specific DNA methylase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0314397 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0000846728 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGCCTACGA CGAACTACAA CCCTGATGTG CTCTCCTGCC TGGCGAACCT CTCCAGCGAC 
GAGGTGTTCA CGCCGCCGCA ACTGGCTAAC CAGATGCTGG ACCTGCTGCC GGCTGAGCTG
TGGAGCAATC CCGAGGCGCG CTTTCTCGAC CCCTGCTGCA AATCGGGCGT GTTCCTGCGC
GAGATCGCCA AGCGGCTGGA CAAGGGATTG GAAGCCAAAA TCCCTGACCG GCAGGCACGG
ATTAACCATA TTATGAAAAA CCAGCTCTTC GGCATCGCCA TCACCGAGCT AACTGCGCTC
ATCTCGCGTC GTTCGCTCTA CTGCTCCAAG ACTGCCAACG GCAAGTATTC GGTCTGCACC
GCCTTTGACA CACCCGAAGG CAACATCCGC TACCGGCGCA TTGAGCACAC CTGGAAAGAC
GGCCGCTGCG TGTTCTGCGG CGCCAACGAA GCCAACTACG CCCGCGGCGC GGAACTGGAA
ACCCACGCCT ACGAGTTCAT CCACACCGAC AACCCCCTTT GCCTTTTCGA CGAGCGAAGT
GAGGGTTGTC ATTTCGACGA GCGAAGTGAG GAGAAATCCA TGAAATTCGA TGTGATCATC
GGCAATCCGC CGTATCAGTT GAGCGACGGC GGCTTTGGCC GAAGCGCCAC TCCGATTTAC
AACAAGTTTG TCCATCAGGC GAAGAAGCTC AATCCACGCT ATCTCGTGAT GATTATTCCA
GCACGTTGGT ATTCCGGCGG AAAGGGCTTG GACGACTTCC GCGAGGGGAT GCTAAAAGAC
AACCGCATTC TCGAGATACA CGACTTCCCC GATGCGACAG ATGTTTTCCC CGGTGTACAA
ATCAAAGGTG GTGTCTGCTA TTTTCTTTGG GACCGCGATC ATCCCGGCTT GTGCAAGGTC
TCCAACTATT TGCACGGCAA GGTGGCAACG ATGGAGCGCC CGTTGATGGA AGCGGGGGCA
GATACCTTTA TCCGCTACAA CGAGGCGATT TCTATTCTCC GTAAAGTTCA GCAATTCAAC
GAACCCTCGT TCAAGACGCT GGTAAGCGCA CAAAAGCCGT TTGGGCTTAG AACGTATGTT
CTTGGCAAAC CAGAACCATT TCCGGGAGCA GTGAAGCTTT ATCAGAACGG CGGCGTTGGT
TGGGTCAGCC GAAAGGAAAT CGCACAAAAT CGGGAATGGA TTGATACTTA TAAGGTTTTT
ATTCCCAGAT TAGGTAGTGG AAGTGACAGT TTCCCACATC CGATACTAGG ACATCCATTT
GTAGGCGAAC GCAATTCTGC CTGCACCGAG ACGTATCTTG TCATCGGCCC TTGCAACACA
GAGACCAAAG CAGAAAATAT CATTTCGTAT ATCCGAACTC GCTTCTTTCG ATTTCTTGCG
CTATTAAACA AGCCGACGCA AGATGCGCCC AAGCGAGTCT ATCAATTCGT CCCCATGCAA
GACTTCTCCA AGCCTTGGAC TGATGAAGAA CTCTACCAAA AATACGGCCT AACCCAAGAC
GAGATCGCCT TCATCGAATC CATGGTCCGG CCGATGCCAG CGGAGGACGA ACCGGAACCG
ACGGAGGAAA CCGATGAGTA A
 
Protein sequence
MPTTNYNPDV LSCLANLSSD EVFTPPQLAN QMLDLLPAEL WSNPEARFLD PCCKSGVFLR 
EIAKRLDKGL EAKIPDRQAR INHIMKNQLF GIAITELTAL ISRRSLYCSK TANGKYSVCT
AFDTPEGNIR YRRIEHTWKD GRCVFCGANE ANYARGAELE THAYEFIHTD NPLCLFDERS
EGCHFDERSE EKSMKFDVII GNPPYQLSDG GFGRSATPIY NKFVHQAKKL NPRYLVMIIP
ARWYSGGKGL DDFREGMLKD NRILEIHDFP DATDVFPGVQ IKGGVCYFLW DRDHPGLCKV
SNYLHGKVAT MERPLMEAGA DTFIRYNEAI SILRKVQQFN EPSFKTLVSA QKPFGLRTYV
LGKPEPFPGA VKLYQNGGVG WVSRKEIAQN REWIDTYKVF IPRLGSGSDS FPHPILGHPF
VGERNSACTE TYLVIGPCNT ETKAENIISY IRTRFFRFLA LLNKPTQDAP KRVYQFVPMQ
DFSKPWTDEE LYQKYGLTQD EIAFIESMVR PMPAEDEPEP TEETDE