Gene RoseRS_3189 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRoseRS_3189 
Symbol 
ID5210160 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRoseiflexus sp. RS-1 
KingdomBacteria 
Replicon accessionNC_009523 
Strand
Start bp4013187 
End bp4014143 
Gene Length957 bp 
Protein Length318 aa 
Translation table11 
GC content60% 
IMG OID640596781 
ProductHhH-GPD family protein 
Protein accessionYP_001277500 
Protein GI148657295 
COG category[L] Replication, recombination and repair 
COG ID[COG1194] A/G-specific DNA glycosylase 
TIGRFAM ID[TIGR01084] A/G-specific adenine glycosylase 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACAATAC GTAAGAGTAT CATCCAAAAG AACGAAAGTT TACCGGCGTT TACCAGGTTC 
CATCAGGCGC TGATGAACTG GTTCAGTGAG GCGGCACGCG ACCTCCCCTG GCGCCGCACC
CGCGATCCAT ACCGCATTAT GGTTGCAGAG GTGATGCTCC AGCAAACACA GGTTGATCGC
GTGTTGCCGA AGTACGAAGC GTTCCTTACA TGCTTCCCGA CGCTTCAGGC GCTGGCAGAC
GCACCGACCG CAGAGGTCAT CCGTCTGTGG TCGGGGCTTG GCTACAATCG CCGGGCGGTC
AATCTGCAAC GCGCAGCACG TGAAATCGTC GAACGCTTCG ACGGCGTTTT TCCGCGCGAT
GTCGCTGTGC TGCTGACGCT TCCGGGCATC GGACCCTACA CCGCTGGTGC TATCGCCTGT
TTTGCCTTCG AGCAGGATGT GGCATTCATG GACACCAACA TCCGGCGCGT TATTCGCCGC
GCATTGACCG ATCCTGCGGC AACGGTCAAC GAACGAGATT TGCTGGCGCT GGCGCAGGCA
GCGCTCCCAA CCGGGCGCAG CTGGATGTGG AACCAGGCGT TGATGGAACT GGGGTCGCTG
ATCTGCACTG CCGACTCGCC AGCATGCTGG CGCTGTCCAC TGCGCGATCT GTGCTGCGAC
TATGCCGCGC GCCGCACGTC GGACGGGCAT CTTGAAGCGA CGCCGGTGCG CAAACGCATT
GCTGAACATC GTGAACGCCC GTTCGTCGGA TCGAATCGCT ACTTCCGCGG ACGTGCTGTT
GCCGCGCTCC GCGCATTACC CCCCGGCACA ACCCTTGACC TGGCAGAACT TGGACCACAA
GTGCGCCCCG ATTATACCCC GGAAGATGAA GCCTGGCTGG TGACCCTCCT CAACGGATTG
GAGCGCGATG GATTAGTCGT GTGGCATGGC AATGGGGTAC GACTTCCGGA GGAATGA
 
Protein sequence
MTIRKSIIQK NESLPAFTRF HQALMNWFSE AARDLPWRRT RDPYRIMVAE VMLQQTQVDR 
VLPKYEAFLT CFPTLQALAD APTAEVIRLW SGLGYNRRAV NLQRAAREIV ERFDGVFPRD
VAVLLTLPGI GPYTAGAIAC FAFEQDVAFM DTNIRRVIRR ALTDPAATVN ERDLLALAQA
ALPTGRSWMW NQALMELGSL ICTADSPACW RCPLRDLCCD YAARRTSDGH LEATPVRKRI
AEHRERPFVG SNRYFRGRAV AALRALPPGT TLDLAELGPQ VRPDYTPEDE AWLVTLLNGL
ERDGLVVWHG NGVRLPEE