Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Francci3_0747 |
Symbol | |
ID | 3905815 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. CcI3 |
Kingdom | Bacteria |
Replicon accession | NC_007777 |
Strand | + |
Start bp | 864921 |
End bp | 870062 |
Gene Length | 5142 bp |
Protein Length | 1713 aa |
Translation table | 11 |
GC content | 75% |
IMG OID | 637878080 |
Product | glycosyltransferases-like |
Protein accession | YP_479860 |
Protein GI | 86739460 |
COG category | [R] General function prediction only |
COG ID | [COG1216] Predicted glycosyltransferases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 15 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACGGGCG ATCCGGGCAG GTCACAGGCG GCCACGGTCG CCGCGGTGAC CGTGGCCGAT GACGCGGGGC CGCCGGTCGT CGCGGTCTGC GTGCTCGGTC GAGGGGGTGG TGGACCCGAC GTTGCCGCCC AGGTCCGGGC CGCCCTGCGT GCCCAGACCA GACCGCCGGA CCGGGTCGTC GAGGTGCGGC TCGGTGGCGG GGGATGCGTT CCCCGGCAAC GCGCGTCTCC CCTGGACGGG GAGGGTCGCC TCCCGTCCGC CGACCATCCC GTCGACCCGG GCACCGACGG CGCGGATGCT CAGGCCGGTG ACCCGGTCGC CGAGCCCCTC GTCGTAGAGG TGCCGTCCGC CACACCGTTC GGCGCCGCCG TAGCCGTGGG GCTGGCTCGG CTGGGGAACC CACCGGAGCA CGGATTCGTC TGGCTCCTCG ACGACACCGC GATCCCCACC CCGGGCGCGT TGCACACCCT GCTGGCCTAC GCCCGGCTGG ACCGGGGCGC CGCCGTCCTC GGCCCCAAAC TGCTCGCCCG CGCTCCGGAA TCCGCCCCAC CCGCCGAATT CGTCCCACCC GCCGGAGCGG GCGGTTCCGC TGGACCGGCA GGTGCGCGGT CCCGGCCGGT GACGAGACCC CGGGTGGTTG AAGCCGGGGT GAGCGTCGAC CGGGCCGGGC GACGGCATCC CGGAATTCGC CCGGACTATG ATGATCATGG ACAGTGTGAC GGGGTACGTG ACGTCCTCGC CGTCGCGTGT TCGGGCATGC TCGTGCGGGC GGTGGTCTGG CGACTGCTGG GTGGACTCGA CCCCGAGCTC GATGGTGGTC ATGACATCGA TCTCGGCTGG CGGGTGGCCC GTGCCGGCCT ACGGGTCGTC GTCGTCCCGT CGGCCACGGT GACCGTGACC TCGGCCTCAG CCTTTTCCGG GGGGCCGGCC TTCCCCAGGG CATCGGCCGA CCGCGCGGGG GTTCTACGGG TCCGGCTGGC GAACACGGCG ACGGTGCTGT TTCCGGCGAT GGTGGTCGCC CTGCTCGTCG GAACGGTCGT GCGCACCCTC GGCCTGCTCC TGCGCGGCCG ACGCCGGGCG GCGCTCGCCG AGCTCACGCT CGCCGGTACC GTGCTGGGGC GGCCATGGCA GCTGCGCCGG ATGCGTCGGC GCGCCGCCCG CACGTGGGCG GTGCCGCATC GGGCCGTCCG CCCGTTGTTC CCGGCCCGCC TGCCGTGGCG GCGCGGAGCC GGCGCCGATG ATCTCCTTGA TCGGGCCGGA GCCGCCGAAG ACGTCGCCGA CACCACCTTG ATGGTCACCG GGCCCAGACC GGATGTGCAC CCGCGCCGGC CGGCCTGGCC GCCGACGACC CTGGCGGCCC TGCTGGGTGC CGTCGGGCTC CTGGCCATGC GGGGGATGCC GGCGGACATG CTCGGCGGGG GCCTGCCGCT CCCGGCAGGC AGCCGGAACC TGTGGTCGGC CGTGTGGTCT GGATGGGCGG GGGCGCCCGA TGGACCTCTG AGTGGTCCTA GGCCGGCACC CCCTTTCACC GCGCTGCTCG CCGCGGCAGC CACTCTGCTG GGGGGGAGGC CGGCGCCGGC CGCTTGGATC CTGCTGGCGG CCGGCCCGGC GCTCGCGGGC CTGAGCGTCT ACCGGGCCCT CGCCCGGCTG GGACCACATC CGATCGCCCG GTTCGGGCTC GCCGCCGGCT ACGGGTTGAA CCCGGTGATG ACCGGGTCGG TCCTCGCCGG CCGCACCGAC ACCGTGGGCG CGGTGATCGG CCTTCCCCTG GTGCTCGCCG CCGCCGACGC GGTCCTGCGG GACCGGCGGG ACGGCTCCGG CTTTGGCGTC GGTGGGGCGC CGGGCCGGCC GGTGTGGACG CTCGCGATCT GTCTGGTCGG GTTCATCGCC TGCGCCCCGT CGACGCTGGC CATCGCCTGG GCGGGCCTGG TCGTCGTCGC GTTCCTGACC GCGCGGTCCA GGCTGCCCGG TATGATCGTC GCTCTGCTCG CGACGATCAT ACCGGTGCTT CCGGACCTGC GGGTGGATCC CGTCGCTGTC CTCACCGAAT CGGCGATCAC CCTGGCCCGG TTGCCCCAGT ACTCCCTGGC CCTGCTGGCC GGTGCGGACA GCGGACCCAG GGCCGATGTG GCCGGCTTCG TCGGTGGATG TCTGGCCTGC TGGTTGTTCG GCCGCCTCGC GGGCCGGGAG CGGGGCCGCC GGCGGCGCAC GGCCGTGCTG GTGGTCCTTC CCGTGACGCT GCTCCTCCTC GTCGGGCCGC TGATCCTTGC CGCCCGCTTC CTGGCCGTGG ACGTGGCCCG GCCCGGCGAC ACCACAGGGG CGAACCCGGC CGGCGTCGAG GGGTCCGCCC TCGCCCACGC GGTCGCGGCC GTGACCCGCG CCGCCGGGCC GGGTGCCCGC GTCCTCGTCC TGCACCAACC GGCCTCGGCC AGAACGATCC GGTACACCCT GGCCTCCGCG TCCGGGCCGA CGTTTCCCGA GGCCGGCGCC CGCACCCCGC GGCGGTCCGG CCGGTTCCTC GCCGACATGG TCGCGGACCT CGCCACCAAC GGTGGCCGGG CAGCCGGCTG GCTCCCGCTT CTGGGCGTGA CATCGGTCGC TGTACCTACG GTCGACTCGG CCTCTCAGCT CGTCGCCCGC CTGGACGCGA CGCCGTCCTT GATCCGGGAT CGTCCCCGCC CGGGGGCGAT CCTCTGGCGC CCCGCCGTTG CGGCCCGTGG TGCGACGGCC TTGGGAGAGC CCGGTGGGGC CGGCGCGCCG CGACCGGACG CATCCGGGTG GGCGGGCGGA CCCCTGGCCT GGGTGATGGA GTCCGATCGG GCCGATCGTC GTGAGCCGGT GGCGCGGCAC TCCGCTCCCG CGGCGACGAG GCCGACAACC CGCCCGATCG GTACGGCGCC GGGTGTTCCT CCCCTCCTCT CGGCGCCGCT GCCGCCAGCG CCCTCCTCGG TACGTCGGTT GGCCTCCGGC GAGCGGTTGC CCGCCCTCGC AGCCCGGTCC GGCCCGGCAC GCTGCCTGGT GCTCGCCGTG CCGGCGGATG CCGGATGGCA CGCGTGGCTG GAAGGCCGGC CGCTGGCTCC GTTCCGGGCG TGGGGCTGGG CAGCCGGATT CCAACTGCCG CCCGACGGCG GTCGGCTCCG CATCGACCAC GACCGGACTT CACCCCACCG GGCAGTGGTG GGCCAGGCTG CGGCACTCGC CGCGCTCGTC CTGCTGGGAA TCCCGGTCAT GCTGGGAATC CCGGTCATGC GGTCGCGGCG GATGGCTCCC CCGGCTGCGA CGCAGCCGGG GCGGGTCGGG AAGGTCGAAC GGCCGTCATA CCAGCGGCCG TCGTGTCACG TCGTGGTCGT TTCGCGGACG GTCGTTTCGC GGACGGTGCT GCCGCTGGCG GTCCTCGGAC TGCTCGGGCT GACCGGGGCG TTCCTCGCCG GCACCGGCAC CGGCACCAGC ACCAGCACCA GCACCAGCAC CGGTGCCGGG AAGGGACGGA TCTGGCCGCT GTCAGGCGCC ATGCTGGCCT GCCCGGACCT GACGCTGGTG GCACCCGATC CGGACGGGCC GGGCGGGCGG GCCCGTTCGC TCGGCCTGGA CGTGGCCGCC GGCGGTGGAC CGACGGGCCG GGTGCGGGTG TCCACGGTGG GGTCCGAGGT TCCAACCAAA GGGCTTGGAC TACGGATGGA CGCCCGTGGT CGACGTCACC TGGACCTTCC GCGGCCCTCG GAGAGCAGCG CCGTCACCGG CGTGGCGGGC AGCCGCGGAT CGATCCTGGT GACCGCCCAG GGTGCGTCCG CCGGGGGACT GTCGGCCACC GTGACGGCCC GGGACAAAGA CGGCGTCGGG TCAATGGACG GCGTCGGGCC GATTCAGGCG CGTTGCGAGC CGTCCCGGGC CCGGGTCTGG TTCACCGGAC CATCGACCGT GGCCGGTCGG GATCCGGTGG TGATCTGGTC GAATCTGGCA GATGAACCGG CCCTGATCAA CGTGCGGGTG CTTTCCGACG GACCGGCGAC GTCGCCGCAG GACATCACCG TGCCCCCGGA GCACGCGGTG ATCCGGCGAC TCGCCTCCCT CGCACCGGAG GCCACCGCGA CCGCGCTCGA CGTCCAGGTG CGCTCCGGCC GGGTTCTCAC CTGGGTCGCG GACCGGCCCA GCGCGGGGCG TACGGACTCG GTGCCGGGGA CGGACCGGGC GGGGCGTGCG GATCCGGTGA GCCTTGTCCC GAGCACCTCC GCCCCGGCCC GGCGTCTCCT GCTTGGCGGG GTCGTCGTCC CGGCGGGGTC AGCGTCCTCG ACGGCGCACC TGGTGCTCGC CGCACCGTTG CGGGAGGCCA CGGTGCGCAT CACGGTCCTC ACGGGTACGG GTCGGCACAC GCCGATCGGC CTGGAGGCCG TCGTCGTGCC CGCCACGGGT GCGATCACCA CGCCCGTGCC GCTGCCGGCG GGCGCCGCCT CGGCGGTCCT GGTCGAATCG GCCGATGACG CACCGGTGCT CGCCGGACTG GCCGCACCGT CGGGCCGGGC GGGAGAGCGG TCGACCGGCT CGGCCGGCGG ATCCACCTGG GTCGGCGCCA CCAGCCTTGA TCCTCCGCTC GACCCGCGCG GGCTCGACGC GTACGGCCAG GCGGTCGGCC TCTCCGGGCC GACGGGGCTG GCCGCCGGGG CGGGATCCGC GCCGGACGCG GTCGTCTCCC TCCCACCGGT GCCGCCCGGC GCGACCGGCG TCGTCGTGCT CGCAGCTCCG GCGGGAGCGG TCACGGCGTG GATCGACGGC GAGCTGGTAC GGGTACGGCC GGGCACCGTC GCGACTGCCC GCATGCCGAT CGGACGGGCC GGTGGACGGC TTGTCGCGGT CGGCGGCCCG CTGGAGGCCA CCGCCGTGAT CGGCCCTCCC CCCGGTACGC CGGGGGAGGG CCGGGTATCC GGGGATCCGC CCCGTGGGCC TACCCCCTCC GGTTCCTCCG GCTCCACCGG CTCCGCGTCG TCGACCGCCG CGTCGTCGAC CGCCGCGTCG TCGACCTTCG GGATGCCCAA GCCCGTGATG CCGCGGAGCA TCTCCACCGT CGTTCCGTTG TGGGGGGCGT CGAGGCTCCT CGACGCTCCG GTCTCCATCG AGGATCCCAC CGTCCTGTAT CAGCGCGGCT AG
|
Protein sequence | MTGDPGRSQA ATVAAVTVAD DAGPPVVAVC VLGRGGGGPD VAAQVRAALR AQTRPPDRVV EVRLGGGGCV PRQRASPLDG EGRLPSADHP VDPGTDGADA QAGDPVAEPL VVEVPSATPF GAAVAVGLAR LGNPPEHGFV WLLDDTAIPT PGALHTLLAY ARLDRGAAVL GPKLLARAPE SAPPAEFVPP AGAGGSAGPA GARSRPVTRP RVVEAGVSVD RAGRRHPGIR PDYDDHGQCD GVRDVLAVAC SGMLVRAVVW RLLGGLDPEL DGGHDIDLGW RVARAGLRVV VVPSATVTVT SASAFSGGPA FPRASADRAG VLRVRLANTA TVLFPAMVVA LLVGTVVRTL GLLLRGRRRA ALAELTLAGT VLGRPWQLRR MRRRAARTWA VPHRAVRPLF PARLPWRRGA GADDLLDRAG AAEDVADTTL MVTGPRPDVH PRRPAWPPTT LAALLGAVGL LAMRGMPADM LGGGLPLPAG SRNLWSAVWS GWAGAPDGPL SGPRPAPPFT ALLAAAATLL GGRPAPAAWI LLAAGPALAG LSVYRALARL GPHPIARFGL AAGYGLNPVM TGSVLAGRTD TVGAVIGLPL VLAAADAVLR DRRDGSGFGV GGAPGRPVWT LAICLVGFIA CAPSTLAIAW AGLVVVAFLT ARSRLPGMIV ALLATIIPVL PDLRVDPVAV LTESAITLAR LPQYSLALLA GADSGPRADV AGFVGGCLAC WLFGRLAGRE RGRRRRTAVL VVLPVTLLLL VGPLILAARF LAVDVARPGD TTGANPAGVE GSALAHAVAA VTRAAGPGAR VLVLHQPASA RTIRYTLASA SGPTFPEAGA RTPRRSGRFL ADMVADLATN GGRAAGWLPL LGVTSVAVPT VDSASQLVAR LDATPSLIRD RPRPGAILWR PAVAARGATA LGEPGGAGAP RPDASGWAGG PLAWVMESDR ADRREPVARH SAPAATRPTT RPIGTAPGVP PLLSAPLPPA PSSVRRLASG ERLPALAARS GPARCLVLAV PADAGWHAWL EGRPLAPFRA WGWAAGFQLP PDGGRLRIDH DRTSPHRAVV GQAAALAALV LLGIPVMLGI PVMRSRRMAP PAATQPGRVG KVERPSYQRP SCHVVVVSRT VVSRTVLPLA VLGLLGLTGA FLAGTGTGTS TSTSTSTGAG KGRIWPLSGA MLACPDLTLV APDPDGPGGR ARSLGLDVAA GGGPTGRVRV STVGSEVPTK GLGLRMDARG RRHLDLPRPS ESSAVTGVAG SRGSILVTAQ GASAGGLSAT VTARDKDGVG SMDGVGPIQA RCEPSRARVW FTGPSTVAGR DPVVIWSNLA DEPALINVRV LSDGPATSPQ DITVPPEHAV IRRLASLAPE ATATALDVQV RSGRVLTWVA DRPSAGRTDS VPGTDRAGRA DPVSLVPSTS APARRLLLGG VVVPAGSASS TAHLVLAAPL REATVRITVL TGTGRHTPIG LEAVVVPATG AITTPVPLPA GAASAVLVES ADDAPVLAGL AAPSGRAGER STGSAGGSTW VGATSLDPPL DPRGLDAYGQ AVGLSGPTGL AAGAGSAPDA VVSLPPVPPG ATGVVVLAAP AGAVTAWIDG ELVRVRPGTV ATARMPIGRA GGRLVAVGGP LEATAVIGPP PGTPGEGRVS GDPPRGPTPS GSSGSTGSAS STAASSTAAS STFGMPKPVM PRSISTVVPL WGASRLLDAP VSIEDPTVLY QRG
|
| |