Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_0357 |
Symbol | |
ID | 5668781 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | - |
Start bp | 425875 |
End bp | 426786 |
Gene Length | 912 bp |
Protein Length | 303 aa |
Translation table | 11 |
GC content | 77% |
IMG OID | 641239289 |
Product | HhH-GPD family protein |
Protein accession | YP_001504729 |
Protein GI | 158312221 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG1194] A/G-specific DNA glycosylase |
TIGRFAM ID | [TIGR01084] A/G-specific adenine glycosylase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 0.347637 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 6 |
Fosmid unclonability p-value | 0.0136889 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGGGCGGTG TGCCGAGGCT TGAGACGATT CCCGCGATGA CATCCGTGCG AATGGCTGAC GAGGTGCTGG AGTGGTTCGC GGTCTGTGCC CGCGACCTGC CCTGGCGGCG GCCCGAGACG ACGCCGTGGG GCGTCCTCGT CAGCGAGATC ATGTTGCAGC AGACGCCGGT CAACCGGGTC CTGCCGGTCT GGGCGGAGTG GCTCAGCCGC TGGCCGGCAC CGGCCGACCT CGCCGCGGAG CCCTCCGGTG AGGCCGTGCG GGCGTGGGGG CGCCTCGGCT ACCCTCGCCG GGCTCTGCGA CTGCACCAGG CCGCCACCGC GATGCTGGAA CGGCACGGCG GCGCCGTCCC GGACGAGCTT GACGACCTGC TGGCACTGCC CGGTGTCGGG AGCTACACGG CCAGGGCCGT GGCGGCGTTC GCCTTCCGGC AGCGCCACGC CGTGATCGAC GTGAACGTCC GCCGCCTGGT CGCCCGGGCT GTCGAGGGGG TCGCCGAGGG GCCGACGTCG GTCTCCCGGC GCGACCTCGC GCTGGTCGCC GACCTGCTGC CAGCGGACCC GGAGACCGCG GCGCGGGCGA GCGCGGCGTT CATGGAGCTG GGCGCGCTGG TCTGCGTGGC CCGGGCGCCC CGCTGCGCCG CGTGCCCCGT CCGGGACAGG TGCGCGTGGC TGGCGGTCGG CAGCCCGCCC TCCGAGGGCC CGGCCCGGCG CCCGCAGGGC TACGCCGGGA CCGACCGCCA GGTGCGCGGC CGCCTGCTGG CCGTGCTCCG GGAGGCGACC GGGCCGGTCG AGCAGGCCGA CCTCGACGCG GTGTGGGACG AGCCCGTCCA GCGCGACCGG GCGCTCGCCG GGCTCCTCAC GGACGGCCTG GTCAGCCGGA CGGCGCCGGG CGTCTACGCC CTTCCACGCT GA
|
Protein sequence | MGGVPRLETI PAMTSVRMAD EVLEWFAVCA RDLPWRRPET TPWGVLVSEI MLQQTPVNRV LPVWAEWLSR WPAPADLAAE PSGEAVRAWG RLGYPRRALR LHQAATAMLE RHGGAVPDEL DDLLALPGVG SYTARAVAAF AFRQRHAVID VNVRRLVARA VEGVAEGPTS VSRRDLALVA DLLPADPETA ARASAAFMEL GALVCVARAP RCAACPVRDR CAWLAVGSPP SEGPARRPQG YAGTDRQVRG RLLAVLREAT GPVEQADLDA VWDEPVQRDR ALAGLLTDGL VSRTAPGVYA LPR
|
| |