Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rsph17029_0119 |
Symbol | |
ID | 4897384 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodobacter sphaeroides ATCC 17029 |
Kingdom | Bacteria |
Replicon accession | NC_009049 |
Strand | - |
Start bp | 134469 |
End bp | 135587 |
Gene Length | 1119 bp |
Protein Length | 372 aa |
Translation table | 11 |
GC content | 66% |
IMG OID | 640110702 |
Product | DNA methylase N-4/N-6 domain-containing protein |
Protein accession | YP_001042011 |
Protein GI | 126460897 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG0863] DNA modification methylase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 0.222875 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 12 |
Fosmid unclonability p-value | 0.070794 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCGACCA AGACCAAGAC GACGGAGGCC CCCGTGCTTC CGCTGAACCA GATCCTTGCG GGCGATTGCA TCGAGACGAT GCGGTCCCTG CCCGAATGTT CGGTCGACCT GATCTTCGCC GATCCGCCCT ACAACCTGCA GCTCCGCGGC GACCTGCACC GGCCCGACAA CAGCCGGGTG GATGCGGTGG ACGACCACTG GGACCAGTTC TCGAGCTTCT CGGTCTATGA CCAGTTCACC CGCGAATGGC TCGCGGCCGC CCGGCGGCTG CTGAAGCCCA ACGGCGCGAT CTGGGTCATC GGCAGCTATC ACAACATCTT CCGCGTGGGG GCAGCCCTTC AGGACCAGGG CTTCTGGATC CTGAACGATG TGGTCTGGCG CAAGTCGAAC CCGATGCCGA ACTTCAAGGG CAAGCGGCTG ACCAACGCGC ACGAGACGCT GATCTGGGCC TCGAAGCAGG AAGCCAGCAA ATATACCTTC AATTACGAGG CACTGAAGGC CCTGAACGAG GGCGTGCAGA TGCGCTCGGA CTGGGTGATC CCGATCTGCA CCGGCCATGA GCGGCTGAAG GACGAGCAGG GCGACAAGGC CCACCCGACC CAGAAGCCCG AGGCGCTGCT GCACCGGGTG ATGGTCGCCA CGACCAATCC GGGCGACGTG GTGCTCGACC CGTTCTTCGG CACCGGCACG ACCGGCGCGG TGGCCAAGAT GCTCGGCCGC GACTTCATCG GCATCGAGCG CGAAGAGAGC TACCGCAGGA TCGCGGCCGA GCGGCTGTCG CGCGTGCGCC GCTACGACGC CTCGGCGCTC GAGGTCTCGG GCTCGAAGCG GGCCGAGCCG CGGGTGCCCT TCGGCCAGCT GGTCGAGCGC GGGATGCTGC GCCCGGGCGA AGAGCTCTAT TCGATGAACA ACCGCCACAA GGCGAAGGTG CGCGCCGACG GCACGCTGAT CGGCAACGAT GTGAAGGGCT CGATCCACCA GGTCGGCGCC GCGCTGGAAG GCGCGCCCTC CTGCAACGGC TGGACCTACT GGTGCTACAA GCGCGAGGGG AAGATGATCC CCATCGACAT CCTGCGCCAG CAGATCCGGG CGGAGATGGA AGACCCGCGC CCCAACTGA
|
Protein sequence | MATKTKTTEA PVLPLNQILA GDCIETMRSL PECSVDLIFA DPPYNLQLRG DLHRPDNSRV DAVDDHWDQF SSFSVYDQFT REWLAAARRL LKPNGAIWVI GSYHNIFRVG AALQDQGFWI LNDVVWRKSN PMPNFKGKRL TNAHETLIWA SKQEASKYTF NYEALKALNE GVQMRSDWVI PICTGHERLK DEQGDKAHPT QKPEALLHRV MVATTNPGDV VLDPFFGTGT TGAVAKMLGR DFIGIEREES YRRIAAERLS RVRRYDASAL EVSGSKRAEP RVPFGQLVER GMLRPGEELY SMNNRHKAKV RADGTLIGND VKGSIHQVGA ALEGAPSCNG WTYWCYKREG KMIPIDILRQ QIRAEMEDPR PN
|
| |