Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_1140 |
Symbol | |
ID | 5669553 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | + |
Start bp | 1360011 |
End bp | 1360886 |
Gene Length | 876 bp |
Protein Length | 291 aa |
Translation table | 11 |
GC content | 74% |
IMG OID | 641240072 |
Product | formamidopyrimidine-DNA glycosylase |
Protein accession | YP_001505500 |
Protein GI | 158312992 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG0266] Formamidopyrimidine-DNA glycosylase |
TIGRFAM ID | [TIGR00577] formamidopyrimidine-DNA glycosylase (fpg) |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 0.182904 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 6 |
Fosmid unclonability p-value | 0.0110455 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCCTGAGC TGCCCGAGGT CGAGGTCGTC CGCCGGGGCC TCGAGCGCGG GGTGGTCGGC CGCACTGTCG CCGAGGTGGA GGTCCATCAC CTGCGGGCGG TCCGCCGCCA TCTCGCCGGG GCCGACCACT TCGCCGCCTC GCTTGTCGGG CAGACGGTGG CCACGGCGCG CCGGCGCGGC AAGTACCTGT GGCTCGGGCT CACGCCGTCT GAACCGGGCG GCCCGGCGGT CGGTGACGCG CTGCTCGGTC ATCTCGGGAT GAGCGGCCAG CTCCTCGTGG TTCCGGCGGA CAGCCCGGAC CAGGTACACC TGCGGGTCCG TTTCCGGTTC ACCGACGAGG GCCGCGAGCT GCGCTTCGTG GACCAGCGGA CGTTCGGTGG TCTTGCTGTG GTCTCGGGCG GGGCGGAGCT GCCTGCCCCG ATCGCCCACA TCGCGCCCGA CCCGCTCTCC GTCGACTTCG ACCCGGAGCG TTTCGCCGAC GCGCTGCGCC GCCGGCGCAC CGGCCTCAAG CGCGCCCTGC TCGACCAGAC GCTGATCAGC GGCGTGGGCA ACATCTACGC GGACGAGGGC TTGTGGGCCG CGCGCCTGCA TTACGCCCGC CCGACCGAGA CGGTGACCCG CGCCGAGGCG CTGCGGCTGC TTGACGCGGT CCGCACGGTG ATGACGGCGG CGCTGGCCGC CGGCGGTACC TCCTTCGACC GGCTCTACGT CTCGACCGAG GGCGTCAGCG GGCTGTTCGA ACGCTCGCTG GAGGTCTACG GACGCGGCGG CCAGGCGTGC TCCCGCTGCG CCTCGACCAT CCGCCGGGAC GCGTTCATGA ACCGCTCGAG CTTCAGCTGC CCGGCGTGCC AGCCCCGGCC CCGCCGGGTC CGGTGA
|
Protein sequence | MPELPEVEVV RRGLERGVVG RTVAEVEVHH LRAVRRHLAG ADHFAASLVG QTVATARRRG KYLWLGLTPS EPGGPAVGDA LLGHLGMSGQ LLVVPADSPD QVHLRVRFRF TDEGRELRFV DQRTFGGLAV VSGGAELPAP IAHIAPDPLS VDFDPERFAD ALRRRRTGLK RALLDQTLIS GVGNIYADEG LWAARLHYAR PTETVTRAEA LRLLDAVRTV MTAALAAGGT SFDRLYVSTE GVSGLFERSL EVYGRGGQAC SRCASTIRRD AFMNRSSFSC PACQPRPRRV R
|
| |