Gene Francci3_1050 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFrancci3_1050 
Symbol 
ID3905296 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. CcI3 
KingdomBacteria 
Replicon accessionNC_007777 
Strand
Start bp1244128 
End bp1248954 
Gene Length4827 bp 
Protein Length1608 aa 
Translation table11 
GC content73% 
IMG OID637878384 
Producthypothetical protein 
Protein accessionYP_480161 
Protein GI86739761 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGACCGTGA CCACGTCGCC CTCCGGTCCG GCCTCGTCCG ATCGCCCCGC TTCCGGTCGC 
CCCGAGTCCG GTCGCCCCGA GTCCGGTCCG GCCTCGGCGG CGATCCGCCG GCGGCGACCG
CCCTCTTGGC CGACCCTGGT GCTGGCCGCC GTGGCGTATC TGCCGCTGCT GGCGACGGCA
CCGGGAAAGA TCGGCGCCGA CACCAAGGCA TACCTCTACC TCGATCCGAG CCGGATGCTG
CGCCGAGCGG TCTCCATGTG GGACCCCGGC ATCGGCATGG GCACCGTCAC CCATCAGAAC
ATCGGCTATC TGTTTCCGCA GGGCGCCTTC TACTGGCTGC TGGATCTCGT CGGCCTGCCC
GACTGGGTCG CGCAGCGGCT GTGGACGGGC TCCATCCTGT TCGGCGCCGG CACCGGCGTG
CTGTTCTTGC TCCGCTCGTT GCGCTGGCCC GACCGCTTCG CGTTCGTCGC CGCGTTCGGC
TACATGCTCT CGCCCTACAT CCTGGAATAC GAGGCGCGGA TCTCGGCGAT CCTGCTGCCC
TACGCCGGAC TGGGCTGGCT GATCGGCATC ACCGTGCGCG GGTTGCGCGA GGCGGAGCCC
GGCCGGGCCG GCTGGCGTCC CGCCGGATGG CGCTTCCCCG CCGCCTTCGC CCTCGTGGTC
ACGACGATCG GCAGCATCAA CGCCTCCAGC CTGATCTTCA TCCTGTTCGC CCCACTGCTG
TGGATACCGT TCGCCGTCTG GGGTACCCGC GAGGTCCGAC TCGGCGCCGC CGTGAGTATG
TGCACCCGTG CGGTGCTGGT CGTCCTCGTC ACGTCGGCGT GGTGGATCGC GGGGCTCTAC
ACCCAGGCCG GTTATGGCCT CAACGTCCTC GCCTTCACCG AGACGATCGA GACGGTCGCC
AGCAGCTCGC AGGCCTCCGA GGTGCTGCGG GGCCTGGGCA ACTGGTTCTT CTATGGACAG
GACGCGCTCG GTCCGTGGAT CGGGCCGGCC ACCCACTACA CCCAGAGCAT CTGGCTGATC
GCGGTCAGCT TCACCATACC CATCCTCGCC CTGATCGGCG CCTCGGCCAT CCGCTGGGGC
CAGCGCGGCT ACTTCGTCGC GCTGATCCTG CTGGGCACCA CGATCGCCGT CGGCGTCTAC
CCGTACGACG ATCCCTCTCC CTGGGGACGG CTGTTCAAGG ACTTCGCCGA GGGATCGACG
GCCGGTCTCG CGCTGCGTTC GCTGCCCCGG GCGGTCCCCA TGGTGGCACT GGGCATGTCG
GTGCTGCTCG CGGGCGGGCT CGCAGCTCTC TGGGAGCGGT ACGCGGCGGC CGGTCCGACC
GCTACGGCTG CCCCCGGTGG GCGGCCGGTC CGGTCGGGCC GGCGCCGACG GCTGGTTCCG
CCGCTCGCCA CCGGGGTGGT CCTGCTGTTG CTGGCCCTCG ACATGTCACC GCTGTTCCGC
GGCGAGTTCG TCGAGCCGTT ACTCGAACGC CCCGAGCACG TCCCCGCCTA CGAGACCGAA
CTCGCGCGCG CCCTGGACAC CAGGAACAAC GGCACCCGGG TGCTGGAGCT TCCCGGTGCC
GACTTCTCCC ACTACCGCTG GGGAAGCACC CTCGACCCGG TCACCGAGGG GCTGATGGAC
CGTCCGCTGG TCATGCGGGA ACTCATCCCC TACGGCGAGG CGGGCAGCGT CGACCTGGTC
CGCTCGCTGG ACCGCCGGCT GCAGGAGGGG GTGTTCGAGA CCTCCGCCCT GCCGGACCTC
GCCCGGTTGA TGGGCGTAGG CGACGTGGTG CTGCGCAGCA ACCTCGCCTA CGAGCGGTAC
CGGACCCCGC GCCCCCGGGC CACCTGGAAC CTGTTCACCT CGAACAGACC GGACGGGCTG
GGTGTACCAC AGACCTTCGG ACCGCCGGTT GCCGAGGATC CCGGCATCCC GTTCACCGAC
GAGGTCACCC TCGGCACCGA CCCGTCGGTG CCCGATCCGC CGGCGCTGGC CGTCTTCCCG
GTCAGCGGCG CGCAGCCGAT CGTGCGCACC GCCACCACCG CGCGGCCCCT GCTCGTCAGC
GGCAACGGGG AGGCGCTGGT GGACGCGGCG GCCTCCGGCC TGCTCGCCGA GCCAATCGCC
GACGGGCGGG CGATCCTCTA CGCCCCGGAA CTCGCCACCG ATCCCACCGC GATGCGCCAA
GCCCTCGACG ACGGTGCGGA CCTGCTCGTC ACCGACACCA ACCGGCGGCG CGCCGAACGG
TGGACGGGCA TCCGGGAGAA CTTCGGCTAC GTCGAGCAAC CCGGAGTCAG GCCGCTGGCG
AAGGACCCGA ACGACAACCG GCTGCCGCTG TTCCCCGATC AGAACGTCAC CAGCGAGACC
ACAGCCACCC TGCACGCCCC GGGATCCCCG GCGAAGATCG CCGAGGTGAC CGCCACCAGC
TACGGGAACA GCTTCTCCTA CGGGGCGTCC GACCGACCGG TCCGCGCGAT CGACGGCGAC
CCGGGCACCG CGTGGCGGGT CGGCGCCTTC ACCGACCCGA CCGGGGAGGC CTGGCAGACC
ACGCTGGCCG AGCCGACGAC GACGAACCAG ATCCGCCTCG TCCAGCCGCT GACCGGGCCG
CGCAACCGGT GGATCACCCG GGCGACGCTG ACCTTCGACG GCGGCTCCCC GGTGACCGTC
GCGCTCACCG ACGCCTCCCG CACCGCCGCC GGGCAGGTCG TGCGCTTCCC CACCCGGGCC
TTCCGAACGT TACGCATCCA CATCGACGCG ACGAACACCG GCCGGCAGCG CAGCTACGAC
AACGTCTCCG CGGTCGGGTT CGCCGAGGTC GCCATCCCCG GAGCGAACGG GGCACCGCTG
ACGGCCGAGG AGGTGCTGCG GATGCCGAGT GACAGCCTCG ACGCCGCCGG TGCCGCCTCC
CTCGGCCACC GGCTGGCACT GGAGGTGTCC CGGGACCGGG CCAACCCGTC GGAACCGTTC
AAGCAGGACA CCGAGTCGGT CATCAGCCGG TCGTTCTCCC TGCCCACGGC CCGGACGTTC
GCGCTCACCG GCACCGCCCG CGTCTCCGCC TACTCCCCCG ACGATCGCGT CGACCAGGTG
CTGGGCCGCC CGGCCACCCT CCCGGTGGTG ACCTCCTCCG GCCGGCTGCC GGGATCGCTC
GCCGCGCGGG CGTCGAGCGC CTTCGACGGA AACCCGGCCA CCGCGTGGAG CCCGGGCATC
GGCAACCCGC GAGGCAGCTG GATCCAGGTC GCCTCGCCGA CCCCGTTGAC GGTCTCGTCG
ATGACGATGT CGATCGTCGC CGACGGTCGG CACTCGATCC CGACCCGCAT CGGGATCGAG
GTCGACGGCC AGCGGGTCGG CGCGGTGACC GTGCCTGCGG TCACCGACAC CTCGGCGCGC
GGGGGCACCC GCGAGGTGAC CCTCACCTTC CCGGCCGTCA CCGGCAGCAC GTGGAAGTTC
GTCATCGACG ATGCCCGGAC CGTCACCAGC ATCGACCCCA TCAGTCGCTC GCCGCTGGCG
ATGCCGGTGG GCCTCGCCGA GATCGGGATA CCCGGCCTCG CGACCGTGCC TGCCGCGGGC
ACCGGTCAGC CCACGGCGGG CACCGGTCAG CCCACGGCAG GCACGGCAGG GAGCACCGCG
ACGGGGCTGT TGCCGGCCCA GATCCCCGCG CCCTGCCGGA CCGACCTGCT GAGCATCGAC
GGTCAGCCGG TGGGGGTGCA GGTCACCGGG GGCACCGCCG ACGCGGTCAA CCGGCTCGGG
TTGACCGTCG CCACCTGCGG CCAGCCGGTC ACCCTCGGCC CGGGGGAGCA CGTGATCCGG
ACCGCCGACG GGGCCCTCGC CGGCATCGAT CTCGATCGCC TGCTGCTGGC CTCGGACGTC
GGCGGCGGCC CGTGGCTGAA CGCGACCGGC GCCGCCTCCG CCGTCTCCCC CGCCGGGACC
CCACCGGCCG CGACGGGCAC CACGGGAACG ACGTCGGCCG CCACCGGAAC GACGGGCACC
ACCGGAACGA CGGGCACCAC CGGAACGACG GGCACCACCG GAACGACGTC GGCCGCCACG
CCACGGGTGA CGGTCCGATC GGCCGGCGAC ACATCCTTCA CCGTCGACGT GACGGGGGCG
CGGCCCGGTA CCCCGTTCTG GCTGGTGCTC GCGGAGAGCC TCTCCCCGGG TTGGAAGGCG
ACCGTGGGCG GCGCCGACCT GGGCACCCCC AGGCTGGTGG ACGGCTACGC GAACGGCTGG
CGGATCACCC CGACGGCCGC CGGGTTCACC GTCACGCTGA CCTGGACCCC ACAACGGATC
GTGTGGTTCG CCCTGGCGCT GTCCGCGGTG ACGGTGCTGC TCTCCCTAGC ACTGGTGATC
TTCTACACCC GCCGAGCCCG CGCGACCGTG GCCACGCGGG CGTCCCGGTC GTCCCGGTCG
TCCCGGGCAC CGGAGCCGGA TCTGCCGACA GCCGAGCCGT GGTCCGTCCG GGCGGGGCGT
GTCGGCCCGC GCACCACCGT GCTCGCCGTG TTGGGGATGG GTCTGCTCTC GGCGGTGCTG
GTCTCCCCGG TAGCGGGGGT CATCGTCGCC TTGGCGACGG CCGTGGCGCT GCTCGTGCCC
CGCGGCCGGC TGCTGACCCG CGTCGGCCCG GTGGTCTGCC TGGGCGTCAG CGCCCTGTAC
GTGCTGCAGG TGCAGGCCCG GCATGCGCTG CCGACGGACG GAGACTGGGT CGCCGCGTTC
GGGAAGGTGG CGACGATCTC CTGGCTCACG GTGCTGCTAC TGGCGAGCGA CCAGCTCGTC
GCCCAGCTGC AACGACGGCG CGAGGCGGGG CAGCCGGGCC CGCCGTCTCC CCGGACGGGC
CCGGGGGAGG AGCCTCCGGG CCACTGA
 
Protein sequence
MTVTTSPSGP ASSDRPASGR PESGRPESGP ASAAIRRRRP PSWPTLVLAA VAYLPLLATA 
PGKIGADTKA YLYLDPSRML RRAVSMWDPG IGMGTVTHQN IGYLFPQGAF YWLLDLVGLP
DWVAQRLWTG SILFGAGTGV LFLLRSLRWP DRFAFVAAFG YMLSPYILEY EARISAILLP
YAGLGWLIGI TVRGLREAEP GRAGWRPAGW RFPAAFALVV TTIGSINASS LIFILFAPLL
WIPFAVWGTR EVRLGAAVSM CTRAVLVVLV TSAWWIAGLY TQAGYGLNVL AFTETIETVA
SSSQASEVLR GLGNWFFYGQ DALGPWIGPA THYTQSIWLI AVSFTIPILA LIGASAIRWG
QRGYFVALIL LGTTIAVGVY PYDDPSPWGR LFKDFAEGST AGLALRSLPR AVPMVALGMS
VLLAGGLAAL WERYAAAGPT ATAAPGGRPV RSGRRRRLVP PLATGVVLLL LALDMSPLFR
GEFVEPLLER PEHVPAYETE LARALDTRNN GTRVLELPGA DFSHYRWGST LDPVTEGLMD
RPLVMRELIP YGEAGSVDLV RSLDRRLQEG VFETSALPDL ARLMGVGDVV LRSNLAYERY
RTPRPRATWN LFTSNRPDGL GVPQTFGPPV AEDPGIPFTD EVTLGTDPSV PDPPALAVFP
VSGAQPIVRT ATTARPLLVS GNGEALVDAA ASGLLAEPIA DGRAILYAPE LATDPTAMRQ
ALDDGADLLV TDTNRRRAER WTGIRENFGY VEQPGVRPLA KDPNDNRLPL FPDQNVTSET
TATLHAPGSP AKIAEVTATS YGNSFSYGAS DRPVRAIDGD PGTAWRVGAF TDPTGEAWQT
TLAEPTTTNQ IRLVQPLTGP RNRWITRATL TFDGGSPVTV ALTDASRTAA GQVVRFPTRA
FRTLRIHIDA TNTGRQRSYD NVSAVGFAEV AIPGANGAPL TAEEVLRMPS DSLDAAGAAS
LGHRLALEVS RDRANPSEPF KQDTESVISR SFSLPTARTF ALTGTARVSA YSPDDRVDQV
LGRPATLPVV TSSGRLPGSL AARASSAFDG NPATAWSPGI GNPRGSWIQV ASPTPLTVSS
MTMSIVADGR HSIPTRIGIE VDGQRVGAVT VPAVTDTSAR GGTREVTLTF PAVTGSTWKF
VIDDARTVTS IDPISRSPLA MPVGLAEIGI PGLATVPAAG TGQPTAGTGQ PTAGTAGSTA
TGLLPAQIPA PCRTDLLSID GQPVGVQVTG GTADAVNRLG LTVATCGQPV TLGPGEHVIR
TADGALAGID LDRLLLASDV GGGPWLNATG AASAVSPAGT PPAATGTTGT TSAATGTTGT
TGTTGTTGTT GTTGTTSAAT PRVTVRSAGD TSFTVDVTGA RPGTPFWLVL AESLSPGWKA
TVGGADLGTP RLVDGYANGW RITPTAAGFT VTLTWTPQRI VWFALALSAV TVLLSLALVI
FYTRRARATV ATRASRSSRS SRAPEPDLPT AEPWSVRAGR VGPRTTVLAV LGMGLLSAVL
VSPVAGVIVA LATAVALLVP RGRLLTRVGP VVCLGVSALY VLQVQARHAL PTDGDWVAAF
GKVATISWLT VLLLASDQLV AQLQRRREAG QPGPPSPRTG PGEEPPGH