Gene Rcas_0397 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRcas_0397 
Symbol 
ID5537859 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRoseiflexus castenholzii DSM 13941 
KingdomBacteria 
Replicon accessionNC_009767 
Strand
Start bp501587 
End bp504673 
Gene Length3087 bp 
Protein Length1028 aa 
Translation table11 
GC content56% 
IMG OID640892560 
ProductDNA methylase N-4/N-6 domain-containing protein 
Protein accessionYP_001430547 
Protein GI156740418 
COG category[L] Replication, recombination and repair 
COG ID[COG2189] Adenine specific DNA methylase Mod 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.00293449 
Fosmid HitchhikerNo 
Fosmid clonabilitydecreased coverage 
 

Sequence

Gene sequence
ATGAAAACAA TAACCTTAGA AAAATTCCAA TCCCTCCTGC GTGAGCTGTT CCAGTTCGAC 
TGCGCCGACC TCGATTTCGG CATCTATCGC ATCATGAACC TCAAACGGGC GGTCATTGAG
CGTTTCATCG CTGAAGACCT GCCCAAGGAG ATTGCCGAAG AACTCCAGAG AGGCGCATTT
GCTGAACAGG AACGGGCGCA ACAGACCCTG AAAGAAGTCC GGCAGAAGCT GCTCGATGTC
CTGGGCGAGG ACGCGCTGGA CGCAAACGGC AATCTGGCCG AGAAGTATCG CGAGACGAGA
GCGGGCCGGG AATACCTGGA GGCGCAAGCC AAAGCCGCGG GCGGCCGCTC CGCCGAGGCG
CTGGAAGCGG ACGTATACAA CCGCCTCTAC GCCTTTTTCA GCCGCTACTG GCAGGAAGGC
GACTTTATCT CCAAGCGCCG CTACTCGAAA AAGGAACGCT ACGCCATCCC CTACAACGGC
GAGGAAGTCT ATCTCTACTG GGCCAACCAT GACCAGTACT ATGTCAAAAC CGGCGAGTAT
TTCACTGACT ATACCTACCA GGCGCCCAAC GGCGTCACCG TGCAGTTCAA ACTCAAGCAA
GCCGATGTGG AGCAGAACAA CGTCAAGGGC GAAAAGCGCT TCTTCCTGCC GCGCCTGGAC
GAAATCAGTT GGGACGAGCC CGCCCGTCTG CTCACGATTC CCTTTGAGTT CCGTCCGCTC
ACCGAGCAGG AAAACATCGC CTTTGGACAG AAGAACCAGC AGGAGTCCAT CATCGCCAAG
GCGGTCGCCG AGATTCCGAA GCGCGTCCAG GCTGCCGACG CCCTGGCCGC GCTGCTCGCT
GAACGCCGCA AGACCGAAAA AGGCGAGACC GTCACCTGCC TGGAGCACCA CCTGCGCCAG
TACACTCGCC GCAACACGTC CGACTTTTTT ATCCACAAGG ACCTGCGCGG CTTTCTTTCG
CGCGAACTGG ACTTTTACCT CAAGAATGAA GTGCTCAACC TGGACGAGAT GGAAGCCGCT
GGCGAGGGGC TGGCCGAAGG CTGGTTTCAA CTCATGCGCC TCATCAAGCG CATTGGCCTG
AAGATCGTCG AGTTTCTCGC GCAGATTGAA GACTTCCAGA AGGCGCTGTG GGAGAAGAAA
AAGTTCGTCA CCGAGACCTT TTACGTGGTC GCCGTGGGCA ACATTCCCGA AGCCTTCTAC
CCGGAAGTCG CCGCCAATGA CCCGCAGTGG GAAGAGTGGG AACAATTAGG AATGGTGAAT
GCTGAATGCA GAAAGATGAC TGCGGAAGAG CGCTCATCAT TCCTCATGCA TCATTCCTCA
TTGCCTTTGG ACACGCGCCA CTTCCCGCCG GAGTTCACCG ACCGCCTGCT GGCGTCTTTT
CAGAACCTGG ACGAGATGAC CGACGGCCTC CTGGTGCACT CCGAAAACTG GCAGGCGCTG
AACCTCTTGC AGGAGAAGTA CCGCGAGCGG GTGAAGTGCA TTTACATTGA TCCGCCGTAT
AACACCGGGA ACGATGAGTT TCTCTACAAG GATAGTTACC AGCACTCCTG TTGGCTCAGT
ATGATGTCTC AACGCCTTTC TTTATCATCA AGTTATTTGG AAAGCCAAGG ATGCCTATTC
ATCAGCATAG ACGATATAGA GTTTCCCGCG CTGAGATATA TGCTGCAACA TCTATTCAGG
GAGGACACTA TCATTTCGGA ACTGGTTTGG AAAAAACGTA GCGGCGGTGA CATGACAGCC
GGGCGAGGGG CACGGCTTTC CGTCGATCAC GATTACGTAG TTGCAGCGTT AACCGAGGGC
GCTTCTGGCT TTTCTGGTTT GCCTATTAGT GAGGAAGATT ACACTAATCC CGATAATGAC
CCAGATGGCC CTTGGACCAC TGGCGATTTG ACGTGCAATA AAACTGCCGA AGAAAGACCC
AATCTCTTTT ACGATCTTAT TGACCCTACA ACAGGAAACG TGTTCAAGTG CAATCCTAAG
CGGGTTTGGG CATACGAACC CGAGAAGATG AAGCAGTTCA TCGAAAGAAC CGTTAATGGG
AGATGGCTGC CGAAGGTGCT ATTCCCAAGC GATCCAACAA AACGTCCAAA GCTGAAGGTT
TTTCTTAAAG AGCGCGAGAG AGCAACAAGA CTGTTCTCTT CTTGGATGGA AGAGGTGCCG
CTGAATGCTA AGGCAACGAG ATTACTTGAT AATATCCTCG GTCAGAGGGT GCTCCTTTAT
CCAAAGCCAG TTGAATTAAT CGAGAGTCTT GCCAGACAAA CCTTGATGAA TGAGGCAGAT
GTGGTTCTTG ATTACTTCGC CGGCTCCGGC ACGACCGGAC ACGCGGTTAT CAACCTGAAC
CGCAAGGACG GCGGCAGGCG CAAGTTCATC CTGGTGGAGA TGGCGCAGTA CTTCGACACT
GTGCTCCTGC CGCGCATCAA GAAGGTCACC TTCACGCCGG AGTGGAAGGA CGGCAAGCCC
AGGCGCATGG CCACCGCCGA GGAAGCCGCG CGCTCCCCGC GCATCGTCAA GGTCATCCGG
CTGGAGTCCT ACGAGGACGC GCTTAACAAC CTGACCTTTG ATGAGGAAAG CGGCCAGCAG
GCGCTCGATC TGTTCGGCCA AGAGTACCTC CTTTCCTACA TGCTCAAGTG GGAGACGCGC
CGCAGCGAGA CCCTGCTCAA CGTGGCGCAG TTGCAGTCGC CGTTCTCTTA CAAACTCCAC
ATCCACCGCG ACGGCGAAAC CCGCGAGCAG CCGGTGGACC TGCCCGAAAC CTTCGCCTAC
CTGCTGGGGC TGGACGTGCA GACGCGCAAG GTGTATCGAA ATGATGAACG CAGAATGATG
AATGATGAAT CTCATCATTC ATCATTCATC ACTCATCATT CCTACCTGGT TTATCGCGGG
GCGCTGCGCG ATGGCCGCAG CGTGGCCGTC ATCTGGCGCG AGACCAAAGG CTGGACAACG
GAAGACTACC GGCGCGACGC CGCCTTCGTC TCCGAACAGA AACTGGCTGA GGGCGCGGAT
GAGGTCTGGG TCAACGGCGA TGCGCTTATC CCCGGCGCGC GCTCGCTGGA TCCCATTTTC
AAGGAAAGAA TGATGAATGC AGAATGA
 
Protein sequence
MKTITLEKFQ SLLRELFQFD CADLDFGIYR IMNLKRAVIE RFIAEDLPKE IAEELQRGAF 
AEQERAQQTL KEVRQKLLDV LGEDALDANG NLAEKYRETR AGREYLEAQA KAAGGRSAEA
LEADVYNRLY AFFSRYWQEG DFISKRRYSK KERYAIPYNG EEVYLYWANH DQYYVKTGEY
FTDYTYQAPN GVTVQFKLKQ ADVEQNNVKG EKRFFLPRLD EISWDEPARL LTIPFEFRPL
TEQENIAFGQ KNQQESIIAK AVAEIPKRVQ AADALAALLA ERRKTEKGET VTCLEHHLRQ
YTRRNTSDFF IHKDLRGFLS RELDFYLKNE VLNLDEMEAA GEGLAEGWFQ LMRLIKRIGL
KIVEFLAQIE DFQKALWEKK KFVTETFYVV AVGNIPEAFY PEVAANDPQW EEWEQLGMVN
AECRKMTAEE RSSFLMHHSS LPLDTRHFPP EFTDRLLASF QNLDEMTDGL LVHSENWQAL
NLLQEKYRER VKCIYIDPPY NTGNDEFLYK DSYQHSCWLS MMSQRLSLSS SYLESQGCLF
ISIDDIEFPA LRYMLQHLFR EDTIISELVW KKRSGGDMTA GRGARLSVDH DYVVAALTEG
ASGFSGLPIS EEDYTNPDND PDGPWTTGDL TCNKTAEERP NLFYDLIDPT TGNVFKCNPK
RVWAYEPEKM KQFIERTVNG RWLPKVLFPS DPTKRPKLKV FLKERERATR LFSSWMEEVP
LNAKATRLLD NILGQRVLLY PKPVELIESL ARQTLMNEAD VVLDYFAGSG TTGHAVINLN
RKDGGRRKFI LVEMAQYFDT VLLPRIKKVT FTPEWKDGKP RRMATAEEAA RSPRIVKVIR
LESYEDALNN LTFDEESGQQ ALDLFGQEYL LSYMLKWETR RSETLLNVAQ LQSPFSYKLH
IHRDGETREQ PVDLPETFAY LLGLDVQTRK VYRNDERRMM NDESHHSSFI THHSYLVYRG
ALRDGRSVAV IWRETKGWTT EDYRRDAAFV SEQKLAEGAD EVWVNGDALI PGARSLDPIF
KERMMNAE