Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Francci3_4306 |
Symbol | |
ID | 3907274 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. CcI3 |
Kingdom | Bacteria |
Replicon accession | NC_007777 |
Strand | + |
Start bp | 5142635 |
End bp | 5144827 |
Gene Length | 2193 bp |
Protein Length | 730 aa |
Translation table | 11 |
GC content | 71% |
IMG OID | 637881633 |
Product | thymidylate kinase |
Protein accession | YP_483381 |
Protein GI | 86742981 |
COG category | [F] Nucleotide transport and metabolism |
COG ID | [COG0125] Thymidylate kinase |
TIGRFAM ID | [TIGR00041] thymidylate kinase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 15 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGGCCACCG ACCATGCAGC CGGATCTCGG GACGCCGGGC GGCGCCCACC CTGGCCCGTC CGGCGACCGC GGGTCCGCCG CTCGCCGAGC AGGCATCCGG TGACGACGGT TCCCTCGCCG CCAATGCCGT CCACCATCGC CGCCGGAGGC GACAGCGACA TCCGGGCCGT CCTGCGGATC CCCGCGTTCC GCCGGATGTG GATTCAGCTC AGCCTGTCCA GCCTGGGTGA CTGGATGGGT CTGCTGGCAA CCACCGCACT GGTGACCCAG CTCCAGGAGA GCTTCTCCGG GCAGGCGTAC GCCATCGGCT CGCTGCTGAT CGTCCGGTTG ATGCCGGCCC TGATCCTGGG CCCACTCGCC GGTGCCCTCG CCGACAGGCT GGACCGCCGG CTGACCATGG TCGTGACGGA CGTCATGCGG TTCGGGCTGT TCATGTCGAT CCCGATCATC GGGACGCTCA AGTGGCTGCT CATCGCATCG TTCCTGGTGG AGGCGGTCAC CCTGGTGTGG GCGCCGGCGA AGGACGCGAG CGTCCCGCAC CTGGTGCCCA AGGAGCGGCT CGCGGCCGCC AACACCCTCA GCCTGATCAT GACCTACGGC ACGGCCCCGA TCGCCGCGAC GATCTTCACG ATGCTTGCGA CGATCTCACG GACGCTCGGC TCCAGCGTGT CGTTCTTCCA TGACTCCTCG GTCGACCTCG CGTTGTACTT CAACGCGGCG ACGTTCCTCG GCTCGGCAAT CGTGATCTGG GGACTGAAGG GCATCGGACG GGCCGCGCGG CCGGAGACCG GCCCCGAGCC CGGGTTCTTG GCCTCCATCA CCGAGGGATG GCGCTACGTC GGCCAGGACA AACTGGTCCG CGGACTCGTC GTGGGCATCC TGGGCGGGTT CGCCGGCGCC GGCTGCGTCA TCGCCCTCGG CCGGCTCTAC GTCGAGATCC TGGGCGGCGG CGACTCCGCG TACGGGGTGC TCTTCGGCGC GGTGTTCGTC GGCCTCGCCG CCGGCATGGC CGCCGGTCCC AAGCTGCTCG GCGACTACAG TCGGACCCGC CTGTTCGGCG TCTGCGTGAC CGGGGCGGGT ATCACCCTGG TGATCGTCGC GATCATCCCG AACCTGGTCA TCGCCTGCAT CCTCGTGGTC GCGGTGGGGG CGTTCGCCGG CGTGGCCTGG GTGACCGGAT ACACCCTGCT CCAGGCCGAG GTCTCCGACG AGCTGCGGGG GCGCACCTTC GCCCTGGTGC AGTCGCTGGT CCGCATCGAC CTGCTGCTCG TGCTCGCCGC CGCGCCCGGC CTGGTTGGCC TCATCGGCTC GCACAGGATC CATCTGTGGG GCGACATCAA CGTCCGCGCG GACGGCGTCA CCGCGGTCCT GCTCGCCGGC GGTCTGCTCG CCGTCGCGGT CGGCCTGTTC TCCTATCGGC AGATGGACGA CCCGACCGGA GCCGCGGTCT GGCCCGAGCT GTGGAACGCG CTGCGCGGCC GGCGGCCGGC GGCCACACGT CCCCGCCGCA CCGGCCTGTT CATCGCGATC GAGGGCATCG ATGGCGCCGG CAAGAGCACA CAGGTGGAGC TGCTACGAAC GTGGCTCGCC TCCTCCGGCC GGGAGGTGGT GGTGACCTCC GAATCCGGCG GCACCGCACT GGGAACGGGC CTGCGAGAAC TGCTCCTCGA CCCGACGGTA CGGCTGCACC CCCGGACCGA GGCGCTGCTG ACCGCGGCGG ACCGCGCCGA ACACGTCGCG CAGGTGATCG AACCCGCGCT GGCCCGGGGA GCGATCGTCA TCACCGATCG TTACGTCGAC TCGTTCATCG CCTTCCAGAG CGTGAGCCAG GCCCTCAGCA CCGACGAGCT GTCCGTCCTC ACCCAGTGGG CGACGCACGC GCTGCTTCCC GACGTCACGG TCCTGTTGGA TCTGCCGGCC GAGATCGTCC TGCGCCGCGC GCGGATGACC GCGTATCCTG ATCCGCCGCC GGGGCCGCCG GGGCCGCCGG ACGGCCCGGC CGAGGCGGAC GACGCCGGGA AGTTGACCTT CCAGGGCCGG GTACGCGACA CGTTCCGTCG GCTGGCGCAG AAGGAGCCCG ATCGTTACCT TCCGCTAGAC GCGACGCGGT CACCGGAGGA GATCCACGCC GAGATCCGCT CGTTCATGGC CAGCCGCATC GATCGGCCAG CCGTCCTGCT CGGTGTGCTC TGA
|
Protein sequence | MATDHAAGSR DAGRRPPWPV RRPRVRRSPS RHPVTTVPSP PMPSTIAAGG DSDIRAVLRI PAFRRMWIQL SLSSLGDWMG LLATTALVTQ LQESFSGQAY AIGSLLIVRL MPALILGPLA GALADRLDRR LTMVVTDVMR FGLFMSIPII GTLKWLLIAS FLVEAVTLVW APAKDASVPH LVPKERLAAA NTLSLIMTYG TAPIAATIFT MLATISRTLG SSVSFFHDSS VDLALYFNAA TFLGSAIVIW GLKGIGRAAR PETGPEPGFL ASITEGWRYV GQDKLVRGLV VGILGGFAGA GCVIALGRLY VEILGGGDSA YGVLFGAVFV GLAAGMAAGP KLLGDYSRTR LFGVCVTGAG ITLVIVAIIP NLVIACILVV AVGAFAGVAW VTGYTLLQAE VSDELRGRTF ALVQSLVRID LLLVLAAAPG LVGLIGSHRI HLWGDINVRA DGVTAVLLAG GLLAVAVGLF SYRQMDDPTG AAVWPELWNA LRGRRPAATR PRRTGLFIAI EGIDGAGKST QVELLRTWLA SSGREVVVTS ESGGTALGTG LRELLLDPTV RLHPRTEALL TAADRAEHVA QVIEPALARG AIVITDRYVD SFIAFQSVSQ ALSTDELSVL TQWATHALLP DVTVLLDLPA EIVLRRARMT AYPDPPPGPP GPPDGPAEAD DAGKLTFQGR VRDTFRRLAQ KEPDRYLPLD ATRSPEEIHA EIRSFMASRI DRPAVLLGVL
|
| |