Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_5934 |
Symbol | deoA |
ID | 5674255 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | - |
Start bp | 7207246 |
End bp | 7208580 |
Gene Length | 1335 bp |
Protein Length | 444 aa |
Translation table | 11 |
GC content | 75% |
IMG OID | 641244782 |
Product | thymidine phosphorylase |
Protein accession | YP_001510184 |
Protein GI | 158317676 |
COG category | [F] Nucleotide transport and metabolism |
COG ID | [COG0213] Thymidine phosphorylase |
TIGRFAM ID | [TIGR02644] pyrimidine-nucleoside phosphorylase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 6 |
Fosmid unclonability p-value | 0.0130951 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGCGCTG ATGCCCCCGG CAGCGATCAC GGCGGTCCTG CGGCGTCCCA CGACGTCGTC GACCTCATCC GGGCCAAGCG CGACGGGGCG GCGCTGCCGG CGGACGCGGT CGCCTGGCTC ATCGACGCCT ACACCCACGG CCGGGTCGCC GACGAGCAGA TGTCCGCCTA CCTGATGGCC GTGGTCTGGC GCGGCATGGC CTCCGACGAA CTGGATCACT GGACGTCGGC GATGATCGCC AGCGGCGAAC GGCTGGACCT GTCCGGCCTG ACCCGCCCGA CGGTCGACAA GCATTCCACC GGCGGGGTCG GGGACAAGGT CTCGCTGGTC CTCGCCCCGC TGGTGGCCGC GTGCGGAGCG GCCGTCCCGC AGCTGTCCGG ACGTGGGCTC GGGCACACCG GCGGGACGCT TGACAAAATG GAGGCGATCC CCGGCTGGCG CGCCGATCTC GACCCGGGCA CCATGCGTGC CGTGCTGGCG GACGTCGGCG CCGTCATCTG CGCCGCCGGC CCCGGCCTGG CTCCCGCGGA CCGCAGGCTG TACGCCCTGC GCGACGTCAC CGGCACGGTC GAGTCCATCC CGCTGATCGC CTCCTCGATC ATGAGCAAGA AGATCGCGGA GGGGACGTCC GCGCTGGTCC TGGACGTCAA GGTCGGCGCC GGCGCCTTCA TGACCTCACT CGCCGACGCC CGCGAGCTCG CGCGGACGAT GGTCGGCCTG GGCGCCCGCG CCGGGGTGCG CACCGAGGCC CTGCTCACCG CGATGGACAC CCCGCTGGGC CGCACCGCGG GCAACGGCCC CGAGGTGACC GAGGCGGTCG AGACCCTGCG CGGGGCGGGC CCGTCGGACC TCGTCGAGGT GACCGTCGCC CTCGCCCGCG TCATGCTGGA CATCGTCGGC CTCAGCGGTG GTTCCGGTGC CGCGCCGGAT CCGGCGGAGG TCCTGGCCTC CGGGGCGGCA TACGACGTGT GGCGCGCGAT GGTCGCCGCC CAGGGCGGCG ATCCGGACGC CCCGCTGCCG ACCGCCGCGT TCACCCGCAC TGTCTCCGCT CCGGCGGACG GCTACCTGAG CCGCCTCGAC GCCCGCGCCC TGGGCATCGC CGCCTGGCGC CTGGGCGCGG GGCGCGCGCG GAAGGAAGAT CCCGTCTCGC CGGCAGCTGG ACTGCGTTGG CTGGCGGCGG TGGGGGAGCA GGTCCAGGCC GGCGCCCCCC TGATCGAGCT CTACTCCGAC GACGAGGCGA CCTTCCCCCG CGCGCTCTCC GCACTCGCGG ACGCCGTCTC GGTCACCGAC GAGCCGCCCC CTCCCACCCC CCTGATCCTC GACCACATCC GCTGA
|
Protein sequence | MSADAPGSDH GGPAASHDVV DLIRAKRDGA ALPADAVAWL IDAYTHGRVA DEQMSAYLMA VVWRGMASDE LDHWTSAMIA SGERLDLSGL TRPTVDKHST GGVGDKVSLV LAPLVAACGA AVPQLSGRGL GHTGGTLDKM EAIPGWRADL DPGTMRAVLA DVGAVICAAG PGLAPADRRL YALRDVTGTV ESIPLIASSI MSKKIAEGTS ALVLDVKVGA GAFMTSLADA RELARTMVGL GARAGVRTEA LLTAMDTPLG RTAGNGPEVT EAVETLRGAG PSDLVEVTVA LARVMLDIVG LSGGSGAAPD PAEVLASGAA YDVWRAMVAA QGGDPDAPLP TAAFTRTVSA PADGYLSRLD ARALGIAAWR LGAGRARKED PVSPAAGLRW LAAVGEQVQA GAPLIELYSD DEATFPRALS ALADAVSVTD EPPPPTPLIL DHIR
|
| |