Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Hhal_1141 |
Symbol | |
ID | 4710125 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Halorhodospira halophila SL1 |
Kingdom | Bacteria |
Replicon accession | NC_008789 |
Strand | + |
Start bp | 1240978 |
End bp | 1242054 |
Gene Length | 1077 bp |
Protein Length | 358 aa |
Translation table | 11 |
GC content | 73% |
IMG OID | 639855614 |
Product | A/G-specific adenine glycosylase |
Protein accession | YP_001002719 |
Protein GI | 121997932 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG1194] A/G-specific DNA glycosylase |
TIGRFAM ID | [TIGR01084] A/G-specific adenine glycosylase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 24 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTCCCGCT GGGCCACACC GGAGCGCTGC CAGGCACTTC AGGAGCAACT GATCGCCTGG CAGCGGCAGC ACGGTCGAAA CGACCTGCCC TGGCAGCAGC CGGCCACGCC CTACCGGGTG TGGATCTCCG AGATCATGCT GCAGCAGACC CGTGTGGAGA CCGTCGTCCC CTACTTCGAG CGCTTCATGG AACGCTACCC GGACGTCGCC GCCTTGGCCG CGGCCGAGCT GGATGACGTC CTCGCGCTGT GGGCCGGCCT GGGCTACTAC GCCCGCGCCC GCAACCTGCA CGCCGCGGCG CAGCGGATCC AGACGGATTG GGGGGGGCAG CTGCCGGCCG AACTATCGGC GCTGCAGACA CTGCCCGGCA TCGGGCCCTC CACCGCCGGC GCAATCCGCT CGCTGGGCCA CGGCCAGCCG GCCCCGATCC TCGACGGCAA CGTCAAACGG GTGCTGGCGC GGCTCGCCGG CGTCGAGGGC TGGCCCGGAC GCAGCCCGGT GGCCAAGCAG CTATGGGCAC TCTCCGCCGC GCTGACCCCG GAGGCGGAGT GCCGCCGCTT CAACCAGGGC CTGATGGACC TTGGGGCGCT GGTCTGCACG CCGCGGGACC CGGCGTGCAA CGCCTGCCCA CTGGCCGCGT CGTGCACGGC CCGGGCCGCC GGCAACCCGG AGACCTACCC GGCCCCCCGT CCGGCACGTC AGCGCCCTCG GCGCGAGGTC CGCCTGCTGC TGATCGAACA CGCCGATGCC CTGCTGCTGG AGCGCCGCCC CGCCACCGGG ATCTGGGGCG GACTCTGGTC GCTGCCGGAG TGCCCGCCCA GCGAGGACCC GGTGACACGG GCCCTCCGCC TGGGCGCACG CTGCGAACCC GCCGGAGACC TACCGGCCCG CCACCACGCG CTGACCCACT TCGAGCTGAT CATGCAGCCG ACTCGGCTGC GCTGGAACGC GGCGACCCCC GATATCGGCG AACCCGATCC GCAGCGGATT TGGTTCCGGC CGGGGCAGGA TACCCTGCCC GGACTCCCTG CACCGATCCT GCGCATCCTT CGCGACGCCG GCTACCCGGT AGCCTGA
|
Protein sequence | MSRWATPERC QALQEQLIAW QRQHGRNDLP WQQPATPYRV WISEIMLQQT RVETVVPYFE RFMERYPDVA ALAAAELDDV LALWAGLGYY ARARNLHAAA QRIQTDWGGQ LPAELSALQT LPGIGPSTAG AIRSLGHGQP APILDGNVKR VLARLAGVEG WPGRSPVAKQ LWALSAALTP EAECRRFNQG LMDLGALVCT PRDPACNACP LAASCTARAA GNPETYPAPR PARQRPRREV RLLLIEHADA LLLERRPATG IWGGLWSLPE CPPSEDPVTR ALRLGARCEP AGDLPARHHA LTHFELIMQP TRLRWNAATP DIGEPDPQRI WFRPGQDTLP GLPAPILRIL RDAGYPVA
|
| |