Gene Francci3_0747 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFrancci3_0747 
Symbol 
ID3905815 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. CcI3 
KingdomBacteria 
Replicon accessionNC_007777 
Strand
Start bp864921 
End bp870062 
Gene Length5142 bp 
Protein Length1713 aa 
Translation table11 
GC content75% 
IMG OID637878080 
Productglycosyltransferases-like 
Protein accessionYP_479860 
Protein GI86739460 
COG category[R] General function prediction only 
COG ID[COG1216] Predicted glycosyltransferases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACGGGCG ATCCGGGCAG GTCACAGGCG GCCACGGTCG CCGCGGTGAC CGTGGCCGAT 
GACGCGGGGC CGCCGGTCGT CGCGGTCTGC GTGCTCGGTC GAGGGGGTGG TGGACCCGAC
GTTGCCGCCC AGGTCCGGGC CGCCCTGCGT GCCCAGACCA GACCGCCGGA CCGGGTCGTC
GAGGTGCGGC TCGGTGGCGG GGGATGCGTT CCCCGGCAAC GCGCGTCTCC CCTGGACGGG
GAGGGTCGCC TCCCGTCCGC CGACCATCCC GTCGACCCGG GCACCGACGG CGCGGATGCT
CAGGCCGGTG ACCCGGTCGC CGAGCCCCTC GTCGTAGAGG TGCCGTCCGC CACACCGTTC
GGCGCCGCCG TAGCCGTGGG GCTGGCTCGG CTGGGGAACC CACCGGAGCA CGGATTCGTC
TGGCTCCTCG ACGACACCGC GATCCCCACC CCGGGCGCGT TGCACACCCT GCTGGCCTAC
GCCCGGCTGG ACCGGGGCGC CGCCGTCCTC GGCCCCAAAC TGCTCGCCCG CGCTCCGGAA
TCCGCCCCAC CCGCCGAATT CGTCCCACCC GCCGGAGCGG GCGGTTCCGC TGGACCGGCA
GGTGCGCGGT CCCGGCCGGT GACGAGACCC CGGGTGGTTG AAGCCGGGGT GAGCGTCGAC
CGGGCCGGGC GACGGCATCC CGGAATTCGC CCGGACTATG ATGATCATGG ACAGTGTGAC
GGGGTACGTG ACGTCCTCGC CGTCGCGTGT TCGGGCATGC TCGTGCGGGC GGTGGTCTGG
CGACTGCTGG GTGGACTCGA CCCCGAGCTC GATGGTGGTC ATGACATCGA TCTCGGCTGG
CGGGTGGCCC GTGCCGGCCT ACGGGTCGTC GTCGTCCCGT CGGCCACGGT GACCGTGACC
TCGGCCTCAG CCTTTTCCGG GGGGCCGGCC TTCCCCAGGG CATCGGCCGA CCGCGCGGGG
GTTCTACGGG TCCGGCTGGC GAACACGGCG ACGGTGCTGT TTCCGGCGAT GGTGGTCGCC
CTGCTCGTCG GAACGGTCGT GCGCACCCTC GGCCTGCTCC TGCGCGGCCG ACGCCGGGCG
GCGCTCGCCG AGCTCACGCT CGCCGGTACC GTGCTGGGGC GGCCATGGCA GCTGCGCCGG
ATGCGTCGGC GCGCCGCCCG CACGTGGGCG GTGCCGCATC GGGCCGTCCG CCCGTTGTTC
CCGGCCCGCC TGCCGTGGCG GCGCGGAGCC GGCGCCGATG ATCTCCTTGA TCGGGCCGGA
GCCGCCGAAG ACGTCGCCGA CACCACCTTG ATGGTCACCG GGCCCAGACC GGATGTGCAC
CCGCGCCGGC CGGCCTGGCC GCCGACGACC CTGGCGGCCC TGCTGGGTGC CGTCGGGCTC
CTGGCCATGC GGGGGATGCC GGCGGACATG CTCGGCGGGG GCCTGCCGCT CCCGGCAGGC
AGCCGGAACC TGTGGTCGGC CGTGTGGTCT GGATGGGCGG GGGCGCCCGA TGGACCTCTG
AGTGGTCCTA GGCCGGCACC CCCTTTCACC GCGCTGCTCG CCGCGGCAGC CACTCTGCTG
GGGGGGAGGC CGGCGCCGGC CGCTTGGATC CTGCTGGCGG CCGGCCCGGC GCTCGCGGGC
CTGAGCGTCT ACCGGGCCCT CGCCCGGCTG GGACCACATC CGATCGCCCG GTTCGGGCTC
GCCGCCGGCT ACGGGTTGAA CCCGGTGATG ACCGGGTCGG TCCTCGCCGG CCGCACCGAC
ACCGTGGGCG CGGTGATCGG CCTTCCCCTG GTGCTCGCCG CCGCCGACGC GGTCCTGCGG
GACCGGCGGG ACGGCTCCGG CTTTGGCGTC GGTGGGGCGC CGGGCCGGCC GGTGTGGACG
CTCGCGATCT GTCTGGTCGG GTTCATCGCC TGCGCCCCGT CGACGCTGGC CATCGCCTGG
GCGGGCCTGG TCGTCGTCGC GTTCCTGACC GCGCGGTCCA GGCTGCCCGG TATGATCGTC
GCTCTGCTCG CGACGATCAT ACCGGTGCTT CCGGACCTGC GGGTGGATCC CGTCGCTGTC
CTCACCGAAT CGGCGATCAC CCTGGCCCGG TTGCCCCAGT ACTCCCTGGC CCTGCTGGCC
GGTGCGGACA GCGGACCCAG GGCCGATGTG GCCGGCTTCG TCGGTGGATG TCTGGCCTGC
TGGTTGTTCG GCCGCCTCGC GGGCCGGGAG CGGGGCCGCC GGCGGCGCAC GGCCGTGCTG
GTGGTCCTTC CCGTGACGCT GCTCCTCCTC GTCGGGCCGC TGATCCTTGC CGCCCGCTTC
CTGGCCGTGG ACGTGGCCCG GCCCGGCGAC ACCACAGGGG CGAACCCGGC CGGCGTCGAG
GGGTCCGCCC TCGCCCACGC GGTCGCGGCC GTGACCCGCG CCGCCGGGCC GGGTGCCCGC
GTCCTCGTCC TGCACCAACC GGCCTCGGCC AGAACGATCC GGTACACCCT GGCCTCCGCG
TCCGGGCCGA CGTTTCCCGA GGCCGGCGCC CGCACCCCGC GGCGGTCCGG CCGGTTCCTC
GCCGACATGG TCGCGGACCT CGCCACCAAC GGTGGCCGGG CAGCCGGCTG GCTCCCGCTT
CTGGGCGTGA CATCGGTCGC TGTACCTACG GTCGACTCGG CCTCTCAGCT CGTCGCCCGC
CTGGACGCGA CGCCGTCCTT GATCCGGGAT CGTCCCCGCC CGGGGGCGAT CCTCTGGCGC
CCCGCCGTTG CGGCCCGTGG TGCGACGGCC TTGGGAGAGC CCGGTGGGGC CGGCGCGCCG
CGACCGGACG CATCCGGGTG GGCGGGCGGA CCCCTGGCCT GGGTGATGGA GTCCGATCGG
GCCGATCGTC GTGAGCCGGT GGCGCGGCAC TCCGCTCCCG CGGCGACGAG GCCGACAACC
CGCCCGATCG GTACGGCGCC GGGTGTTCCT CCCCTCCTCT CGGCGCCGCT GCCGCCAGCG
CCCTCCTCGG TACGTCGGTT GGCCTCCGGC GAGCGGTTGC CCGCCCTCGC AGCCCGGTCC
GGCCCGGCAC GCTGCCTGGT GCTCGCCGTG CCGGCGGATG CCGGATGGCA CGCGTGGCTG
GAAGGCCGGC CGCTGGCTCC GTTCCGGGCG TGGGGCTGGG CAGCCGGATT CCAACTGCCG
CCCGACGGCG GTCGGCTCCG CATCGACCAC GACCGGACTT CACCCCACCG GGCAGTGGTG
GGCCAGGCTG CGGCACTCGC CGCGCTCGTC CTGCTGGGAA TCCCGGTCAT GCTGGGAATC
CCGGTCATGC GGTCGCGGCG GATGGCTCCC CCGGCTGCGA CGCAGCCGGG GCGGGTCGGG
AAGGTCGAAC GGCCGTCATA CCAGCGGCCG TCGTGTCACG TCGTGGTCGT TTCGCGGACG
GTCGTTTCGC GGACGGTGCT GCCGCTGGCG GTCCTCGGAC TGCTCGGGCT GACCGGGGCG
TTCCTCGCCG GCACCGGCAC CGGCACCAGC ACCAGCACCA GCACCAGCAC CGGTGCCGGG
AAGGGACGGA TCTGGCCGCT GTCAGGCGCC ATGCTGGCCT GCCCGGACCT GACGCTGGTG
GCACCCGATC CGGACGGGCC GGGCGGGCGG GCCCGTTCGC TCGGCCTGGA CGTGGCCGCC
GGCGGTGGAC CGACGGGCCG GGTGCGGGTG TCCACGGTGG GGTCCGAGGT TCCAACCAAA
GGGCTTGGAC TACGGATGGA CGCCCGTGGT CGACGTCACC TGGACCTTCC GCGGCCCTCG
GAGAGCAGCG CCGTCACCGG CGTGGCGGGC AGCCGCGGAT CGATCCTGGT GACCGCCCAG
GGTGCGTCCG CCGGGGGACT GTCGGCCACC GTGACGGCCC GGGACAAAGA CGGCGTCGGG
TCAATGGACG GCGTCGGGCC GATTCAGGCG CGTTGCGAGC CGTCCCGGGC CCGGGTCTGG
TTCACCGGAC CATCGACCGT GGCCGGTCGG GATCCGGTGG TGATCTGGTC GAATCTGGCA
GATGAACCGG CCCTGATCAA CGTGCGGGTG CTTTCCGACG GACCGGCGAC GTCGCCGCAG
GACATCACCG TGCCCCCGGA GCACGCGGTG ATCCGGCGAC TCGCCTCCCT CGCACCGGAG
GCCACCGCGA CCGCGCTCGA CGTCCAGGTG CGCTCCGGCC GGGTTCTCAC CTGGGTCGCG
GACCGGCCCA GCGCGGGGCG TACGGACTCG GTGCCGGGGA CGGACCGGGC GGGGCGTGCG
GATCCGGTGA GCCTTGTCCC GAGCACCTCC GCCCCGGCCC GGCGTCTCCT GCTTGGCGGG
GTCGTCGTCC CGGCGGGGTC AGCGTCCTCG ACGGCGCACC TGGTGCTCGC CGCACCGTTG
CGGGAGGCCA CGGTGCGCAT CACGGTCCTC ACGGGTACGG GTCGGCACAC GCCGATCGGC
CTGGAGGCCG TCGTCGTGCC CGCCACGGGT GCGATCACCA CGCCCGTGCC GCTGCCGGCG
GGCGCCGCCT CGGCGGTCCT GGTCGAATCG GCCGATGACG CACCGGTGCT CGCCGGACTG
GCCGCACCGT CGGGCCGGGC GGGAGAGCGG TCGACCGGCT CGGCCGGCGG ATCCACCTGG
GTCGGCGCCA CCAGCCTTGA TCCTCCGCTC GACCCGCGCG GGCTCGACGC GTACGGCCAG
GCGGTCGGCC TCTCCGGGCC GACGGGGCTG GCCGCCGGGG CGGGATCCGC GCCGGACGCG
GTCGTCTCCC TCCCACCGGT GCCGCCCGGC GCGACCGGCG TCGTCGTGCT CGCAGCTCCG
GCGGGAGCGG TCACGGCGTG GATCGACGGC GAGCTGGTAC GGGTACGGCC GGGCACCGTC
GCGACTGCCC GCATGCCGAT CGGACGGGCC GGTGGACGGC TTGTCGCGGT CGGCGGCCCG
CTGGAGGCCA CCGCCGTGAT CGGCCCTCCC CCCGGTACGC CGGGGGAGGG CCGGGTATCC
GGGGATCCGC CCCGTGGGCC TACCCCCTCC GGTTCCTCCG GCTCCACCGG CTCCGCGTCG
TCGACCGCCG CGTCGTCGAC CGCCGCGTCG TCGACCTTCG GGATGCCCAA GCCCGTGATG
CCGCGGAGCA TCTCCACCGT CGTTCCGTTG TGGGGGGCGT CGAGGCTCCT CGACGCTCCG
GTCTCCATCG AGGATCCCAC CGTCCTGTAT CAGCGCGGCT AG
 
Protein sequence
MTGDPGRSQA ATVAAVTVAD DAGPPVVAVC VLGRGGGGPD VAAQVRAALR AQTRPPDRVV 
EVRLGGGGCV PRQRASPLDG EGRLPSADHP VDPGTDGADA QAGDPVAEPL VVEVPSATPF
GAAVAVGLAR LGNPPEHGFV WLLDDTAIPT PGALHTLLAY ARLDRGAAVL GPKLLARAPE
SAPPAEFVPP AGAGGSAGPA GARSRPVTRP RVVEAGVSVD RAGRRHPGIR PDYDDHGQCD
GVRDVLAVAC SGMLVRAVVW RLLGGLDPEL DGGHDIDLGW RVARAGLRVV VVPSATVTVT
SASAFSGGPA FPRASADRAG VLRVRLANTA TVLFPAMVVA LLVGTVVRTL GLLLRGRRRA
ALAELTLAGT VLGRPWQLRR MRRRAARTWA VPHRAVRPLF PARLPWRRGA GADDLLDRAG
AAEDVADTTL MVTGPRPDVH PRRPAWPPTT LAALLGAVGL LAMRGMPADM LGGGLPLPAG
SRNLWSAVWS GWAGAPDGPL SGPRPAPPFT ALLAAAATLL GGRPAPAAWI LLAAGPALAG
LSVYRALARL GPHPIARFGL AAGYGLNPVM TGSVLAGRTD TVGAVIGLPL VLAAADAVLR
DRRDGSGFGV GGAPGRPVWT LAICLVGFIA CAPSTLAIAW AGLVVVAFLT ARSRLPGMIV
ALLATIIPVL PDLRVDPVAV LTESAITLAR LPQYSLALLA GADSGPRADV AGFVGGCLAC
WLFGRLAGRE RGRRRRTAVL VVLPVTLLLL VGPLILAARF LAVDVARPGD TTGANPAGVE
GSALAHAVAA VTRAAGPGAR VLVLHQPASA RTIRYTLASA SGPTFPEAGA RTPRRSGRFL
ADMVADLATN GGRAAGWLPL LGVTSVAVPT VDSASQLVAR LDATPSLIRD RPRPGAILWR
PAVAARGATA LGEPGGAGAP RPDASGWAGG PLAWVMESDR ADRREPVARH SAPAATRPTT
RPIGTAPGVP PLLSAPLPPA PSSVRRLASG ERLPALAARS GPARCLVLAV PADAGWHAWL
EGRPLAPFRA WGWAAGFQLP PDGGRLRIDH DRTSPHRAVV GQAAALAALV LLGIPVMLGI
PVMRSRRMAP PAATQPGRVG KVERPSYQRP SCHVVVVSRT VVSRTVLPLA VLGLLGLTGA
FLAGTGTGTS TSTSTSTGAG KGRIWPLSGA MLACPDLTLV APDPDGPGGR ARSLGLDVAA
GGGPTGRVRV STVGSEVPTK GLGLRMDARG RRHLDLPRPS ESSAVTGVAG SRGSILVTAQ
GASAGGLSAT VTARDKDGVG SMDGVGPIQA RCEPSRARVW FTGPSTVAGR DPVVIWSNLA
DEPALINVRV LSDGPATSPQ DITVPPEHAV IRRLASLAPE ATATALDVQV RSGRVLTWVA
DRPSAGRTDS VPGTDRAGRA DPVSLVPSTS APARRLLLGG VVVPAGSASS TAHLVLAAPL
REATVRITVL TGTGRHTPIG LEAVVVPATG AITTPVPLPA GAASAVLVES ADDAPVLAGL
AAPSGRAGER STGSAGGSTW VGATSLDPPL DPRGLDAYGQ AVGLSGPTGL AAGAGSAPDA
VVSLPPVPPG ATGVVVLAAP AGAVTAWIDG ELVRVRPGTV ATARMPIGRA GGRLVAVGGP
LEATAVIGPP PGTPGEGRVS GDPPRGPTPS GSSGSTGSAS STAASSTAAS STFGMPKPVM
PRSISTVVPL WGASRLLDAP VSIEDPTVLY QRG