Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Francci3_0685 |
Symbol | deoA |
ID | 3905273 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. CcI3 |
Kingdom | Bacteria |
Replicon accession | NC_007777 |
Strand | + |
Start bp | 783059 |
End bp | 784363 |
Gene Length | 1305 bp |
Protein Length | 434 aa |
Translation table | 11 |
GC content | 73% |
IMG OID | 637878018 |
Product | thymidine phosphorylase |
Protein accession | YP_479798 |
Protein GI | 86739398 |
COG category | [F] Nucleotide transport and metabolism |
COG ID | [COG0213] Thymidine phosphorylase |
TIGRFAM ID | [TIGR02644] pyrimidine-nucleoside phosphorylase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 27 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 7 |
Fosmid unclonability p-value | 0.0366488 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGAGCGGAA GCTTCGACGT CGTCGATCTG ATCCGGGCCA AGCGGGACGG TCGACCCGTC GATCCCCGGG CCGTCGACTG GCTCGTCGAC GCCTACACCC GGGGTCTGGT TGCCGACGAG CAGATGTCGG CGTACCTGAT GGCGGTGGTG TGGCGCGGGA TGACTCCCGC CGAGCTGGAC CGCTGGACCG CCGCAATGAT CGACAGTGGG GAGCGCCTGG ATCTGACCGG TGTGGGACGT CCCACCGTCG ACAAACACTC CACCGGAGGG GTCGGCGACA AGGTCTCGCT CGTGCTGGTG CCGCTGGTCG CCGCCTGTGG GGCCGCGGTC CCACAGCTCG CCGGGCGGGG GCTCGGTCAC ACCGGTGGAA CGCTGGACAA GATGGAGGCG ATCCCCGGCT GGCGGGCGGA TCTGTCGGCC GCCCGGATGC GTGAGCTCCT CGCCGAGGTG GGTGCGGTGA TCGCCGCCGC GGGTGCCGGC CTCGCCCCGG CCGACCGCCG GCTCTACGCG CTGCGGGACG TCACCGGAAC CGTCGAGTCG ATCCCGCTCA TCGCATCGTC GATCATGAGC AAGAAGATCG CCGAGGGCAC GTCGGCGCTG GTCCTGGACG TCAAGGTCGG CTCCGGGGCC TTCATGACGT CGGTGGATGA GGCCCGTGAG CTCGCCCGGA CGATGGTTCG GATCGGCGTT GCCGCCGGGG TCCGCACCGA GGCCCTGCTG ACCGGGATGG ACCATCCCCT CGGCCGGACC GCCGGGCATG CGCTGGAGGT GGCCGAGGCC GTGGAGACCC TCCGTGGTGG TGGGCCGGCG GATCTGGTGG AGGTCACCGT CGCGCTGGCC AGGGTGATGA TCGACCTCGT CGCCGCCGAA CTCGGCCACC GGTCCGGTGC TCTTCATGAT CCCGCGCAGG TACTGGCTGC CGGGGACGCT TTTGCGGTGT GGCGGGCGAT GGTCGCGGCC CAGGGCGGCG ATCCGGACGC GCCGCTTCCG GCGGCGAGCC ATGTCGAGAC CGTCCCCGCG CCGGCGACCG GCCATCTCCA CCGCCTGGAC GCCCGAGCGG TCGGCCTGGC GGCCTGGCGG CTTGGCGCGG GCCGGGTCCG CAAGGAGGAC GCGGTCTCCG CGACGGCGGG CGTGCGGTGG CGGGTGGGAA TCGGTGACCC GGTCACGGCC GGCGAGCCGT TGCTGGAACT GCACACCGAT GACCCGGCCA GCGTCGAGCG GGCCCGGGAG GCCCTGGCGG GAGCGGTGGA GGTCGCCGCG ACGCCGCCCC CGAGCACACC GCTTGTCCTC GATCACATCA GCTGA
|
Protein sequence | MSGSFDVVDL IRAKRDGRPV DPRAVDWLVD AYTRGLVADE QMSAYLMAVV WRGMTPAELD RWTAAMIDSG ERLDLTGVGR PTVDKHSTGG VGDKVSLVLV PLVAACGAAV PQLAGRGLGH TGGTLDKMEA IPGWRADLSA ARMRELLAEV GAVIAAAGAG LAPADRRLYA LRDVTGTVES IPLIASSIMS KKIAEGTSAL VLDVKVGSGA FMTSVDEARE LARTMVRIGV AAGVRTEALL TGMDHPLGRT AGHALEVAEA VETLRGGGPA DLVEVTVALA RVMIDLVAAE LGHRSGALHD PAQVLAAGDA FAVWRAMVAA QGGDPDAPLP AASHVETVPA PATGHLHRLD ARAVGLAAWR LGAGRVRKED AVSATAGVRW RVGIGDPVTA GEPLLELHTD DPASVERARE ALAGAVEVAA TPPPSTPLVL DHIS
|
| |