Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | RoseRS_3189 |
Symbol | |
ID | 5210160 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Roseiflexus sp. RS-1 |
Kingdom | Bacteria |
Replicon accession | NC_009523 |
Strand | + |
Start bp | 4013187 |
End bp | 4014143 |
Gene Length | 957 bp |
Protein Length | 318 aa |
Translation table | 11 |
GC content | 60% |
IMG OID | 640596781 |
Product | HhH-GPD family protein |
Protein accession | YP_001277500 |
Protein GI | 148657295 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG1194] A/G-specific DNA glycosylase |
TIGRFAM ID | [TIGR01084] A/G-specific adenine glycosylase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 21 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 15 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACAATAC GTAAGAGTAT CATCCAAAAG AACGAAAGTT TACCGGCGTT TACCAGGTTC CATCAGGCGC TGATGAACTG GTTCAGTGAG GCGGCACGCG ACCTCCCCTG GCGCCGCACC CGCGATCCAT ACCGCATTAT GGTTGCAGAG GTGATGCTCC AGCAAACACA GGTTGATCGC GTGTTGCCGA AGTACGAAGC GTTCCTTACA TGCTTCCCGA CGCTTCAGGC GCTGGCAGAC GCACCGACCG CAGAGGTCAT CCGTCTGTGG TCGGGGCTTG GCTACAATCG CCGGGCGGTC AATCTGCAAC GCGCAGCACG TGAAATCGTC GAACGCTTCG ACGGCGTTTT TCCGCGCGAT GTCGCTGTGC TGCTGACGCT TCCGGGCATC GGACCCTACA CCGCTGGTGC TATCGCCTGT TTTGCCTTCG AGCAGGATGT GGCATTCATG GACACCAACA TCCGGCGCGT TATTCGCCGC GCATTGACCG ATCCTGCGGC AACGGTCAAC GAACGAGATT TGCTGGCGCT GGCGCAGGCA GCGCTCCCAA CCGGGCGCAG CTGGATGTGG AACCAGGCGT TGATGGAACT GGGGTCGCTG ATCTGCACTG CCGACTCGCC AGCATGCTGG CGCTGTCCAC TGCGCGATCT GTGCTGCGAC TATGCCGCGC GCCGCACGTC GGACGGGCAT CTTGAAGCGA CGCCGGTGCG CAAACGCATT GCTGAACATC GTGAACGCCC GTTCGTCGGA TCGAATCGCT ACTTCCGCGG ACGTGCTGTT GCCGCGCTCC GCGCATTACC CCCCGGCACA ACCCTTGACC TGGCAGAACT TGGACCACAA GTGCGCCCCG ATTATACCCC GGAAGATGAA GCCTGGCTGG TGACCCTCCT CAACGGATTG GAGCGCGATG GATTAGTCGT GTGGCATGGC AATGGGGTAC GACTTCCGGA GGAATGA
|
Protein sequence | MTIRKSIIQK NESLPAFTRF HQALMNWFSE AARDLPWRRT RDPYRIMVAE VMLQQTQVDR VLPKYEAFLT CFPTLQALAD APTAEVIRLW SGLGYNRRAV NLQRAAREIV ERFDGVFPRD VAVLLTLPGI GPYTAGAIAC FAFEQDVAFM DTNIRRVIRR ALTDPAATVN ERDLLALAQA ALPTGRSWMW NQALMELGSL ICTADSPACW RCPLRDLCCD YAARRTSDGH LEATPVRKRI AEHRERPFVG SNRYFRGRAV AALRALPPGT TLDLAELGPQ VRPDYTPEDE AWLVTLLNGL ERDGLVVWHG NGVRLPEE
|
| |