Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rcas_2917 |
Symbol | |
ID | 5540407 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Roseiflexus castenholzii DSM 13941 |
Kingdom | Bacteria |
Replicon accession | NC_009767 |
Strand | + |
Start bp | 3780227 |
End bp | 3781180 |
Gene Length | 954 bp |
Protein Length | 317 aa |
Translation table | 11 |
GC content | 63% |
IMG OID | 640895038 |
Product | HhH-GPD family protein |
Protein accession | YP_001432997 |
Protein GI | 156742868 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG1194] A/G-specific DNA glycosylase |
TIGRFAM ID | [TIGR01084] A/G-specific adenine glycosylase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 23 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACAATAC ATAGGAGTAT CGATCAGAAC CACGAACATT CATTAAGTCT TAACGATCTC CATCAGGCGC TTTTGAAGTG GTTCAGCGAG GCGGCGCGCG ATCTTCCCTG GCGCCGCACC CGTGATCCAT ACCGCATTCT GGTGGCAGAA GTAATGCTCC AGCAAACACA GGTTGATCGG GTGCTGCCGA AGTACGCAGC GTTCCTCGAG CGTTTTCCGA CATTGCACAC ACTGGCGGAA GCGCCAACTG CCGAGGTCAT CCGTATGTGG GCTGGTCTGG GTTACAATCG GCGGGCCGTC AATCTGCAAC GCGCGGCGCG CGCGATCTGC GCGCGCTACG GTGGCGTTTT CCCACGGGAT GTCGCTACCC TGGTCACATT GCCGGGCATC GGGTCCTACA CCGCCGGCGC GGTTGCCTGC TTCGCCTTCG AGCAGGATGT AGCGTTCATG GACACGAACA TTCGGCGCGT GATCCGGCGC GTGTTCACCG ATCCGACGGA AACGGTCAAT GAACGCGCGC TGCTGGCGCT GGCGCGCGCG GCGCTTCCCG TCGGTCGCAG CTGGATGTGG AACCAGGCGC TGATGGAACT GGGGTCGCTC GTTTGCACCG CCGATGCGCC GGCATGCTGG CGCTGCCCGT TGCGCGATCA GTGCCGCGAC TATGCTGCGC GACGCGAATC GGACGAGCGT TTTGCGTCCG CGCCGGTGCG CAAGCGCCTC GCCGAACGTC GTGAACGCCC GTTTATCGGC TCGAATCGCT ACTTCCGTGG GCGCATCATC GAGGCGCTCC GCATGCTTCC GTCTGGCGCG ACCTTCGCGC TGAACGATCT GGGACCGCAG GTGCGCCCGG AGTACACACC CGACGACGAA GTATGGCTCA CAACGCTGAT TCGTGGGTTG GAACGCGATG GACTGGTAGT GTGGACCGAG ACGGGGGTAC GGTTGCCGGA ATAG
|
Protein sequence | MTIHRSIDQN HEHSLSLNDL HQALLKWFSE AARDLPWRRT RDPYRILVAE VMLQQTQVDR VLPKYAAFLE RFPTLHTLAE APTAEVIRMW AGLGYNRRAV NLQRAARAIC ARYGGVFPRD VATLVTLPGI GSYTAGAVAC FAFEQDVAFM DTNIRRVIRR VFTDPTETVN ERALLALARA ALPVGRSWMW NQALMELGSL VCTADAPACW RCPLRDQCRD YAARRESDER FASAPVRKRL AERRERPFIG SNRYFRGRII EALRMLPSGA TFALNDLGPQ VRPEYTPDDE VWLTTLIRGL ERDGLVVWTE TGVRLPE
|
| |