Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Francci3_3600 |
Symbol | |
ID | 3904154 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. CcI3 |
Kingdom | Bacteria |
Replicon accession | NC_007777 |
Strand | - |
Start bp | 4302169 |
End bp | 4303140 |
Gene Length | 972 bp |
Protein Length | 323 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | 637880921 |
Product | formamidopyrimidine-DNA glycosylase |
Protein accession | YP_482681 |
Protein GI | 86742281 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG0266] Formamidopyrimidine-DNA glycosylase |
TIGRFAM ID | [TIGR00577] formamidopyrimidine-DNA glycosylase (fpg) |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 21 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 16 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCCAGAAC TCCCCGAAGT AGAGGTTGTT CGCCGCGGTC TGGAACGTGG TGTCGTCGGT CGCGTCATCG CATCGGTTGA CGTCCACCAT CCTCGTGCGG TGCGCCGTCA TCTGGCCGGT GCCGCCGACT TCTCGGCGCT GCTCGTCGGC CGCCGGATAA CGGCGGCACG ACGCCGCGGC AAGTATCTGT GGCTGGTACT TCAGCCGCCG GTAGACCATG CGGCGTGCGC TCCGGTGGTT CCAGAAGAAC CACCGGAGGA GGAATCAGCT GCCGTGCTGG CCGAGATGTC TCCGCCCGCC CTGCCCCCAG GTCATCCGGC TCAGGGGGAT GCACTGATCG CCCATCTCGG GATGAGCGGA CAGTTGCTCG TCGTTCCACC CGCCACCCCC GACCAGAAGC ATCTGCGGAT CCGGTTCGTC TTCACGGACG GAGGACGCGA ACTGCGATTT GTGGATCAAC GCACCTTCGG TGGCCTGGCG GTCGCAACGG GGGAAGCGGA TCTGCCTGCT CCCGTCGCGC ACATCGCCAG GGATCCTCTG GATCCCGCCT TCGACGAGCG GCTCGTCACG GAGAGGATGC GTCGACGCCG TACCGGCGTG AAGCGGGCCC TGCTCGATCA GACACTTGTC AGCGGGGTCG GGAACATCTA CGCGGATGAG GCCCTGTGGG CCGCGAAGCT GCACTACGCG CGGCCGACCG AGACCCTCAC CCGGGCGGAG GTCGGCAGGC TGCTGGGTTG CGTGCGGACG GTGATGATCG CGGCTCTTGA GGTGGGCGGT ACGTCCTTCG ACCGCCTGTA CGTGTCGGCG GACGGAGTCA GCGGGCTGTT CGAACGGTCG CTACAGGTGT ATGGCCGCGC GGGTCGGCCG TGCACACGAT GCGGAGACGC GGTGCGCCGG GACGCTTTCA TGAACCGGTC GAGCTTCACT TGCCCCACCT GCCAGCCGCA CCCAAGACGG GCCAGGTGGT AA
|
Protein sequence | MPELPEVEVV RRGLERGVVG RVIASVDVHH PRAVRRHLAG AADFSALLVG RRITAARRRG KYLWLVLQPP VDHAACAPVV PEEPPEEESA AVLAEMSPPA LPPGHPAQGD ALIAHLGMSG QLLVVPPATP DQKHLRIRFV FTDGGRELRF VDQRTFGGLA VATGEADLPA PVAHIARDPL DPAFDERLVT ERMRRRRTGV KRALLDQTLV SGVGNIYADE ALWAAKLHYA RPTETLTRAE VGRLLGCVRT VMIAALEVGG TSFDRLYVSA DGVSGLFERS LQVYGRAGRP CTRCGDAVRR DAFMNRSSFT CPTCQPHPRR ARW
|
| |