Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_6055 |
Symbol | |
ID | 5674376 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | + |
Start bp | 7367721 |
End bp | 7368614 |
Gene Length | 894 bp |
Protein Length | 297 aa |
Translation table | 11 |
GC content | 72% |
IMG OID | 641244903 |
Product | uracil-DNA glycosylase superfamily protein |
Protein accession | YP_001510305 |
Protein GI | 158317797 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG3663] G:T/U mismatch-specific DNA glycosylase |
TIGRFAM ID | [TIGR00584] mismatch-specific thymine-DNA glycosylate (mug) |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 23 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 10 |
Fosmid unclonability p-value | 0.245344 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCCAGCA CCAGACCGCC CAGCCAGGGC ACCGCGCCGG ACCAGCCGTC CCCGGCCGAC CAGACACCAG AAGACACGCT GCCGCAGACA CTCACCCTGT CGACGGGCCC CACGCTGCCA TCAGAAGCTG CGACGCCGGG AATGGAAGCC GCGCCGGCGC AAGAAGCCGC CCCCGCGCAG GAAGCCACGG ACAGGCCCGT AGCCGCCCCG GCACCGGAGG CCGCCGACAC GCCTGTAGCC GGGCCGGCGG CGCTGAGGTC ACCGAAGCCG CCGCGGCGGC CCCGACCGGA CCGCGCGGAG CTTCTCGCGG CCTACGGGAA GACCGTTCCG GACCTCGTCG GTCCGGAGAC CCGGGTGCTG CTGTGCGGTA TCAATCCGTC CCTGGAGTCC GGCGCCACCG GATTCCATTT CGGGACGCCC AGCAATCGGC TCTGGCCGGT CCTGCACTTC GCCGGGTTCA CCGGGCGCCG GCTGCATCCG TCCGAGACCG AGCACCTACG CGCCCGGGGC ATCGGAATCA CGAATCTGGT GCACCGCTCG ACCGCTCGCG CCGATGAGAT CGCTGACGAC GAAATCAGGG CCGGTGTGCC GGTACTCATC GAGCTTGTCG AACGGATCCG CCCGGAATGG GTCGCCTTTC TCGGGCTCGC CGCGTACCGC ATCGGCTTCG GGCGGCGGAC GGCGAAGGTC GGTCGACAGC CGGAGCGCAT CGGTCCCGCC GGGGTGTGGC TGCTACCGAA CCCCAGTGGG CTGAACGCGC ACTACCAGCT ACCCGACCTT GTCCGGGTCT ACGGCGAACT GCGCGAGGCC GCCTTCGGGC CTGTCACGGC CACCACGCCG GCAGCGGGCC CGACCGCGGG TCCCGGACTC AGGTCCGGGC ACGGCGGCGG CTGA
|
Protein sequence | MASTRPPSQG TAPDQPSPAD QTPEDTLPQT LTLSTGPTLP SEAATPGMEA APAQEAAPAQ EATDRPVAAP APEAADTPVA GPAALRSPKP PRRPRPDRAE LLAAYGKTVP DLVGPETRVL LCGINPSLES GATGFHFGTP SNRLWPVLHF AGFTGRRLHP SETEHLRARG IGITNLVHRS TARADEIADD EIRAGVPVLI ELVERIRPEW VAFLGLAAYR IGFGRRTAKV GRQPERIGPA GVWLLPNPSG LNAHYQLPDL VRVYGELREA AFGPVTATTP AAGPTAGPGL RSGHGGG
|
| |