Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Francci3_0713 |
Symbol | |
ID | 3903503 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. CcI3 |
Kingdom | Bacteria |
Replicon accession | NC_007777 |
Strand | + |
Start bp | 817471 |
End bp | 818547 |
Gene Length | 1077 bp |
Protein Length | 358 aa |
Translation table | 11 |
GC content | 66% |
IMG OID | 637878046 |
Product | glucose-1-phosphate thymidyltransferase |
Protein accession | YP_479826 |
Protein GI | 86739426 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG1209] dTDP-glucose pyrophosphorylase |
TIGRFAM ID | [TIGR01208] glucose-1-phosphate thymidylylransferase, long form |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 22 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 14 |
Fosmid unclonability p-value | 0.974379 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAGGCCC TCGTCCTTGC TGGTGGTTCG GGAACCCGTC TGCGGCCGAT CACCCACACC TCGGCCAAAC AGCTTGTTCC AGTGGCGAAC AAGCCGGTGC TTTTCTACGG CCTGGAGGCC ATCCGTGACG CCGGTATCAC CGATGTCGGG ATCATCGTGG GGGAGACGGC CGGCGAGATC CAGGCGGCGG TAGGTGACGG CTCGGCATTC GGGATCCAGG TCACCTACAT CCGCCAGGAC GCGCCGCTCG GACTGGCACA TGCCGTGCTC ATCGCCCGCG ACTTCCTCGT CGATGAACCG TTCGTCATGT ACCTCGGCGA CAACCTGATT ATCGGTGGGA TCTCCAGCCT GGTCGAGGAA TTCCGACGGA CCACTCCGGA CGCCCTGATC CTGCTGACCA GGGTCGACAA CCCCTCCGCC TTCGGTGTCG CCGAGCTCGG TGCGGATCGG CAGATCATTC GACTGGTCGA GAAGCCGCTC GTTCCGCCGA GTGACCTGGC GCTCGTCGGC GTCTACATGT TCGGCACCCC CATCCACGAC GCCGTGGGGG CGATCAAACC GTCGGCCCGC GGCGAGCTGG AGATCACCGA GGCGATCCAG TGGCTCGTCG ACGGCGGATA CGAGGTCGCC TCCCACCTGG TCGAGGGTTA CTGGAAGGAC ACCGGCCGAC TCGACGACAT GCTCGAGACG AACCGCCATC TCCTCGAGTC GATCGAGCCG GCGATCCGTG GCAGCGTTGA CGAGCACAGC ACGATCGTTG GCCGTGTGGT GATCGAGGAG GGCGCCTCGC TGGTCCGGTC CACCGTGCGC GGGCCGGCCA TCATCGGCCG GGACACCACC CTCGTCGATA CCTATGTCGG CCCGTTCACC TCGATCTTTC ACTCCTGCGT CATCGAACGG ACCGAGATCG AGTACTCGAT TGTGCTGGAG CGGGCGACGA TCCGTGGGAT CGGGCGGATC GAGGACTCGC TGATCGGCCG GGATGTCGAG GTCGTCCCCT CCGCGGCCCT GCCCAGGGCG CATCGGCTCA TGCTGGGCGA CCACTCCCGC GTCTCGGTGG CTACGACGGG GACCTGA
|
Protein sequence | MKALVLAGGS GTRLRPITHT SAKQLVPVAN KPVLFYGLEA IRDAGITDVG IIVGETAGEI QAAVGDGSAF GIQVTYIRQD APLGLAHAVL IARDFLVDEP FVMYLGDNLI IGGISSLVEE FRRTTPDALI LLTRVDNPSA FGVAELGADR QIIRLVEKPL VPPSDLALVG VYMFGTPIHD AVGAIKPSAR GELEITEAIQ WLVDGGYEVA SHLVEGYWKD TGRLDDMLET NRHLLESIEP AIRGSVDEHS TIVGRVVIEE GASLVRSTVR GPAIIGRDTT LVDTYVGPFT SIFHSCVIER TEIEYSIVLE RATIRGIGRI EDSLIGRDVE VVPSAALPRA HRLMLGDHSR VSVATTGT
|
| |