Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rcas_0397 |
Symbol | |
ID | 5537859 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Roseiflexus castenholzii DSM 13941 |
Kingdom | Bacteria |
Replicon accession | NC_009767 |
Strand | + |
Start bp | 501587 |
End bp | 504673 |
Gene Length | 3087 bp |
Protein Length | 1028 aa |
Translation table | 11 |
GC content | 56% |
IMG OID | 640892560 |
Product | DNA methylase N-4/N-6 domain-containing protein |
Protein accession | YP_001430547 |
Protein GI | 156740418 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG2189] Adenine specific DNA methylase Mod |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 7 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 8 |
Fosmid unclonability p-value | 0.00293449 |
Fosmid Hitchhiker | No |
Fosmid clonability | decreased coverage |
| |
Sequence |
Gene sequence | ATGAAAACAA TAACCTTAGA AAAATTCCAA TCCCTCCTGC GTGAGCTGTT CCAGTTCGAC TGCGCCGACC TCGATTTCGG CATCTATCGC ATCATGAACC TCAAACGGGC GGTCATTGAG CGTTTCATCG CTGAAGACCT GCCCAAGGAG ATTGCCGAAG AACTCCAGAG AGGCGCATTT GCTGAACAGG AACGGGCGCA ACAGACCCTG AAAGAAGTCC GGCAGAAGCT GCTCGATGTC CTGGGCGAGG ACGCGCTGGA CGCAAACGGC AATCTGGCCG AGAAGTATCG CGAGACGAGA GCGGGCCGGG AATACCTGGA GGCGCAAGCC AAAGCCGCGG GCGGCCGCTC CGCCGAGGCG CTGGAAGCGG ACGTATACAA CCGCCTCTAC GCCTTTTTCA GCCGCTACTG GCAGGAAGGC GACTTTATCT CCAAGCGCCG CTACTCGAAA AAGGAACGCT ACGCCATCCC CTACAACGGC GAGGAAGTCT ATCTCTACTG GGCCAACCAT GACCAGTACT ATGTCAAAAC CGGCGAGTAT TTCACTGACT ATACCTACCA GGCGCCCAAC GGCGTCACCG TGCAGTTCAA ACTCAAGCAA GCCGATGTGG AGCAGAACAA CGTCAAGGGC GAAAAGCGCT TCTTCCTGCC GCGCCTGGAC GAAATCAGTT GGGACGAGCC CGCCCGTCTG CTCACGATTC CCTTTGAGTT CCGTCCGCTC ACCGAGCAGG AAAACATCGC CTTTGGACAG AAGAACCAGC AGGAGTCCAT CATCGCCAAG GCGGTCGCCG AGATTCCGAA GCGCGTCCAG GCTGCCGACG CCCTGGCCGC GCTGCTCGCT GAACGCCGCA AGACCGAAAA AGGCGAGACC GTCACCTGCC TGGAGCACCA CCTGCGCCAG TACACTCGCC GCAACACGTC CGACTTTTTT ATCCACAAGG ACCTGCGCGG CTTTCTTTCG CGCGAACTGG ACTTTTACCT CAAGAATGAA GTGCTCAACC TGGACGAGAT GGAAGCCGCT GGCGAGGGGC TGGCCGAAGG CTGGTTTCAA CTCATGCGCC TCATCAAGCG CATTGGCCTG AAGATCGTCG AGTTTCTCGC GCAGATTGAA GACTTCCAGA AGGCGCTGTG GGAGAAGAAA AAGTTCGTCA CCGAGACCTT TTACGTGGTC GCCGTGGGCA ACATTCCCGA AGCCTTCTAC CCGGAAGTCG CCGCCAATGA CCCGCAGTGG GAAGAGTGGG AACAATTAGG AATGGTGAAT GCTGAATGCA GAAAGATGAC TGCGGAAGAG CGCTCATCAT TCCTCATGCA TCATTCCTCA TTGCCTTTGG ACACGCGCCA CTTCCCGCCG GAGTTCACCG ACCGCCTGCT GGCGTCTTTT CAGAACCTGG ACGAGATGAC CGACGGCCTC CTGGTGCACT CCGAAAACTG GCAGGCGCTG AACCTCTTGC AGGAGAAGTA CCGCGAGCGG GTGAAGTGCA TTTACATTGA TCCGCCGTAT AACACCGGGA ACGATGAGTT TCTCTACAAG GATAGTTACC AGCACTCCTG TTGGCTCAGT ATGATGTCTC AACGCCTTTC TTTATCATCA AGTTATTTGG AAAGCCAAGG ATGCCTATTC ATCAGCATAG ACGATATAGA GTTTCCCGCG CTGAGATATA TGCTGCAACA TCTATTCAGG GAGGACACTA TCATTTCGGA ACTGGTTTGG AAAAAACGTA GCGGCGGTGA CATGACAGCC GGGCGAGGGG CACGGCTTTC CGTCGATCAC GATTACGTAG TTGCAGCGTT AACCGAGGGC GCTTCTGGCT TTTCTGGTTT GCCTATTAGT GAGGAAGATT ACACTAATCC CGATAATGAC CCAGATGGCC CTTGGACCAC TGGCGATTTG ACGTGCAATA AAACTGCCGA AGAAAGACCC AATCTCTTTT ACGATCTTAT TGACCCTACA ACAGGAAACG TGTTCAAGTG CAATCCTAAG CGGGTTTGGG CATACGAACC CGAGAAGATG AAGCAGTTCA TCGAAAGAAC CGTTAATGGG AGATGGCTGC CGAAGGTGCT ATTCCCAAGC GATCCAACAA AACGTCCAAA GCTGAAGGTT TTTCTTAAAG AGCGCGAGAG AGCAACAAGA CTGTTCTCTT CTTGGATGGA AGAGGTGCCG CTGAATGCTA AGGCAACGAG ATTACTTGAT AATATCCTCG GTCAGAGGGT GCTCCTTTAT CCAAAGCCAG TTGAATTAAT CGAGAGTCTT GCCAGACAAA CCTTGATGAA TGAGGCAGAT GTGGTTCTTG ATTACTTCGC CGGCTCCGGC ACGACCGGAC ACGCGGTTAT CAACCTGAAC CGCAAGGACG GCGGCAGGCG CAAGTTCATC CTGGTGGAGA TGGCGCAGTA CTTCGACACT GTGCTCCTGC CGCGCATCAA GAAGGTCACC TTCACGCCGG AGTGGAAGGA CGGCAAGCCC AGGCGCATGG CCACCGCCGA GGAAGCCGCG CGCTCCCCGC GCATCGTCAA GGTCATCCGG CTGGAGTCCT ACGAGGACGC GCTTAACAAC CTGACCTTTG ATGAGGAAAG CGGCCAGCAG GCGCTCGATC TGTTCGGCCA AGAGTACCTC CTTTCCTACA TGCTCAAGTG GGAGACGCGC CGCAGCGAGA CCCTGCTCAA CGTGGCGCAG TTGCAGTCGC CGTTCTCTTA CAAACTCCAC ATCCACCGCG ACGGCGAAAC CCGCGAGCAG CCGGTGGACC TGCCCGAAAC CTTCGCCTAC CTGCTGGGGC TGGACGTGCA GACGCGCAAG GTGTATCGAA ATGATGAACG CAGAATGATG AATGATGAAT CTCATCATTC ATCATTCATC ACTCATCATT CCTACCTGGT TTATCGCGGG GCGCTGCGCG ATGGCCGCAG CGTGGCCGTC ATCTGGCGCG AGACCAAAGG CTGGACAACG GAAGACTACC GGCGCGACGC CGCCTTCGTC TCCGAACAGA AACTGGCTGA GGGCGCGGAT GAGGTCTGGG TCAACGGCGA TGCGCTTATC CCCGGCGCGC GCTCGCTGGA TCCCATTTTC AAGGAAAGAA TGATGAATGC AGAATGA
|
Protein sequence | MKTITLEKFQ SLLRELFQFD CADLDFGIYR IMNLKRAVIE RFIAEDLPKE IAEELQRGAF AEQERAQQTL KEVRQKLLDV LGEDALDANG NLAEKYRETR AGREYLEAQA KAAGGRSAEA LEADVYNRLY AFFSRYWQEG DFISKRRYSK KERYAIPYNG EEVYLYWANH DQYYVKTGEY FTDYTYQAPN GVTVQFKLKQ ADVEQNNVKG EKRFFLPRLD EISWDEPARL LTIPFEFRPL TEQENIAFGQ KNQQESIIAK AVAEIPKRVQ AADALAALLA ERRKTEKGET VTCLEHHLRQ YTRRNTSDFF IHKDLRGFLS RELDFYLKNE VLNLDEMEAA GEGLAEGWFQ LMRLIKRIGL KIVEFLAQIE DFQKALWEKK KFVTETFYVV AVGNIPEAFY PEVAANDPQW EEWEQLGMVN AECRKMTAEE RSSFLMHHSS LPLDTRHFPP EFTDRLLASF QNLDEMTDGL LVHSENWQAL NLLQEKYRER VKCIYIDPPY NTGNDEFLYK DSYQHSCWLS MMSQRLSLSS SYLESQGCLF ISIDDIEFPA LRYMLQHLFR EDTIISELVW KKRSGGDMTA GRGARLSVDH DYVVAALTEG ASGFSGLPIS EEDYTNPDND PDGPWTTGDL TCNKTAEERP NLFYDLIDPT TGNVFKCNPK RVWAYEPEKM KQFIERTVNG RWLPKVLFPS DPTKRPKLKV FLKERERATR LFSSWMEEVP LNAKATRLLD NILGQRVLLY PKPVELIESL ARQTLMNEAD VVLDYFAGSG TTGHAVINLN RKDGGRRKFI LVEMAQYFDT VLLPRIKKVT FTPEWKDGKP RRMATAEEAA RSPRIVKVIR LESYEDALNN LTFDEESGQQ ALDLFGQEYL LSYMLKWETR RSETLLNVAQ LQSPFSYKLH IHRDGETREQ PVDLPETFAY LLGLDVQTRK VYRNDERRMM NDESHHSSFI THHSYLVYRG ALRDGRSVAV IWRETKGWTT EDYRRDAAFV SEQKLAEGAD EVWVNGDALI PGARSLDPIF KERMMNAE
|
| |