Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Francci3_1050 |
Symbol | |
ID | 3905296 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. CcI3 |
Kingdom | Bacteria |
Replicon accession | NC_007777 |
Strand | - |
Start bp | 1244128 |
End bp | 1248954 |
Gene Length | 4827 bp |
Protein Length | 1608 aa |
Translation table | 11 |
GC content | 73% |
IMG OID | 637878384 |
Product | hypothetical protein |
Protein accession | YP_480161 |
Protein GI | 86739761 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 12 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGACCGTGA CCACGTCGCC CTCCGGTCCG GCCTCGTCCG ATCGCCCCGC TTCCGGTCGC CCCGAGTCCG GTCGCCCCGA GTCCGGTCCG GCCTCGGCGG CGATCCGCCG GCGGCGACCG CCCTCTTGGC CGACCCTGGT GCTGGCCGCC GTGGCGTATC TGCCGCTGCT GGCGACGGCA CCGGGAAAGA TCGGCGCCGA CACCAAGGCA TACCTCTACC TCGATCCGAG CCGGATGCTG CGCCGAGCGG TCTCCATGTG GGACCCCGGC ATCGGCATGG GCACCGTCAC CCATCAGAAC ATCGGCTATC TGTTTCCGCA GGGCGCCTTC TACTGGCTGC TGGATCTCGT CGGCCTGCCC GACTGGGTCG CGCAGCGGCT GTGGACGGGC TCCATCCTGT TCGGCGCCGG CACCGGCGTG CTGTTCTTGC TCCGCTCGTT GCGCTGGCCC GACCGCTTCG CGTTCGTCGC CGCGTTCGGC TACATGCTCT CGCCCTACAT CCTGGAATAC GAGGCGCGGA TCTCGGCGAT CCTGCTGCCC TACGCCGGAC TGGGCTGGCT GATCGGCATC ACCGTGCGCG GGTTGCGCGA GGCGGAGCCC GGCCGGGCCG GCTGGCGTCC CGCCGGATGG CGCTTCCCCG CCGCCTTCGC CCTCGTGGTC ACGACGATCG GCAGCATCAA CGCCTCCAGC CTGATCTTCA TCCTGTTCGC CCCACTGCTG TGGATACCGT TCGCCGTCTG GGGTACCCGC GAGGTCCGAC TCGGCGCCGC CGTGAGTATG TGCACCCGTG CGGTGCTGGT CGTCCTCGTC ACGTCGGCGT GGTGGATCGC GGGGCTCTAC ACCCAGGCCG GTTATGGCCT CAACGTCCTC GCCTTCACCG AGACGATCGA GACGGTCGCC AGCAGCTCGC AGGCCTCCGA GGTGCTGCGG GGCCTGGGCA ACTGGTTCTT CTATGGACAG GACGCGCTCG GTCCGTGGAT CGGGCCGGCC ACCCACTACA CCCAGAGCAT CTGGCTGATC GCGGTCAGCT TCACCATACC CATCCTCGCC CTGATCGGCG CCTCGGCCAT CCGCTGGGGC CAGCGCGGCT ACTTCGTCGC GCTGATCCTG CTGGGCACCA CGATCGCCGT CGGCGTCTAC CCGTACGACG ATCCCTCTCC CTGGGGACGG CTGTTCAAGG ACTTCGCCGA GGGATCGACG GCCGGTCTCG CGCTGCGTTC GCTGCCCCGG GCGGTCCCCA TGGTGGCACT GGGCATGTCG GTGCTGCTCG CGGGCGGGCT CGCAGCTCTC TGGGAGCGGT ACGCGGCGGC CGGTCCGACC GCTACGGCTG CCCCCGGTGG GCGGCCGGTC CGGTCGGGCC GGCGCCGACG GCTGGTTCCG CCGCTCGCCA CCGGGGTGGT CCTGCTGTTG CTGGCCCTCG ACATGTCACC GCTGTTCCGC GGCGAGTTCG TCGAGCCGTT ACTCGAACGC CCCGAGCACG TCCCCGCCTA CGAGACCGAA CTCGCGCGCG CCCTGGACAC CAGGAACAAC GGCACCCGGG TGCTGGAGCT TCCCGGTGCC GACTTCTCCC ACTACCGCTG GGGAAGCACC CTCGACCCGG TCACCGAGGG GCTGATGGAC CGTCCGCTGG TCATGCGGGA ACTCATCCCC TACGGCGAGG CGGGCAGCGT CGACCTGGTC CGCTCGCTGG ACCGCCGGCT GCAGGAGGGG GTGTTCGAGA CCTCCGCCCT GCCGGACCTC GCCCGGTTGA TGGGCGTAGG CGACGTGGTG CTGCGCAGCA ACCTCGCCTA CGAGCGGTAC CGGACCCCGC GCCCCCGGGC CACCTGGAAC CTGTTCACCT CGAACAGACC GGACGGGCTG GGTGTACCAC AGACCTTCGG ACCGCCGGTT GCCGAGGATC CCGGCATCCC GTTCACCGAC GAGGTCACCC TCGGCACCGA CCCGTCGGTG CCCGATCCGC CGGCGCTGGC CGTCTTCCCG GTCAGCGGCG CGCAGCCGAT CGTGCGCACC GCCACCACCG CGCGGCCCCT GCTCGTCAGC GGCAACGGGG AGGCGCTGGT GGACGCGGCG GCCTCCGGCC TGCTCGCCGA GCCAATCGCC GACGGGCGGG CGATCCTCTA CGCCCCGGAA CTCGCCACCG ATCCCACCGC GATGCGCCAA GCCCTCGACG ACGGTGCGGA CCTGCTCGTC ACCGACACCA ACCGGCGGCG CGCCGAACGG TGGACGGGCA TCCGGGAGAA CTTCGGCTAC GTCGAGCAAC CCGGAGTCAG GCCGCTGGCG AAGGACCCGA ACGACAACCG GCTGCCGCTG TTCCCCGATC AGAACGTCAC CAGCGAGACC ACAGCCACCC TGCACGCCCC GGGATCCCCG GCGAAGATCG CCGAGGTGAC CGCCACCAGC TACGGGAACA GCTTCTCCTA CGGGGCGTCC GACCGACCGG TCCGCGCGAT CGACGGCGAC CCGGGCACCG CGTGGCGGGT CGGCGCCTTC ACCGACCCGA CCGGGGAGGC CTGGCAGACC ACGCTGGCCG AGCCGACGAC GACGAACCAG ATCCGCCTCG TCCAGCCGCT GACCGGGCCG CGCAACCGGT GGATCACCCG GGCGACGCTG ACCTTCGACG GCGGCTCCCC GGTGACCGTC GCGCTCACCG ACGCCTCCCG CACCGCCGCC GGGCAGGTCG TGCGCTTCCC CACCCGGGCC TTCCGAACGT TACGCATCCA CATCGACGCG ACGAACACCG GCCGGCAGCG CAGCTACGAC AACGTCTCCG CGGTCGGGTT CGCCGAGGTC GCCATCCCCG GAGCGAACGG GGCACCGCTG ACGGCCGAGG AGGTGCTGCG GATGCCGAGT GACAGCCTCG ACGCCGCCGG TGCCGCCTCC CTCGGCCACC GGCTGGCACT GGAGGTGTCC CGGGACCGGG CCAACCCGTC GGAACCGTTC AAGCAGGACA CCGAGTCGGT CATCAGCCGG TCGTTCTCCC TGCCCACGGC CCGGACGTTC GCGCTCACCG GCACCGCCCG CGTCTCCGCC TACTCCCCCG ACGATCGCGT CGACCAGGTG CTGGGCCGCC CGGCCACCCT CCCGGTGGTG ACCTCCTCCG GCCGGCTGCC GGGATCGCTC GCCGCGCGGG CGTCGAGCGC CTTCGACGGA AACCCGGCCA CCGCGTGGAG CCCGGGCATC GGCAACCCGC GAGGCAGCTG GATCCAGGTC GCCTCGCCGA CCCCGTTGAC GGTCTCGTCG ATGACGATGT CGATCGTCGC CGACGGTCGG CACTCGATCC CGACCCGCAT CGGGATCGAG GTCGACGGCC AGCGGGTCGG CGCGGTGACC GTGCCTGCGG TCACCGACAC CTCGGCGCGC GGGGGCACCC GCGAGGTGAC CCTCACCTTC CCGGCCGTCA CCGGCAGCAC GTGGAAGTTC GTCATCGACG ATGCCCGGAC CGTCACCAGC ATCGACCCCA TCAGTCGCTC GCCGCTGGCG ATGCCGGTGG GCCTCGCCGA GATCGGGATA CCCGGCCTCG CGACCGTGCC TGCCGCGGGC ACCGGTCAGC CCACGGCGGG CACCGGTCAG CCCACGGCAG GCACGGCAGG GAGCACCGCG ACGGGGCTGT TGCCGGCCCA GATCCCCGCG CCCTGCCGGA CCGACCTGCT GAGCATCGAC GGTCAGCCGG TGGGGGTGCA GGTCACCGGG GGCACCGCCG ACGCGGTCAA CCGGCTCGGG TTGACCGTCG CCACCTGCGG CCAGCCGGTC ACCCTCGGCC CGGGGGAGCA CGTGATCCGG ACCGCCGACG GGGCCCTCGC CGGCATCGAT CTCGATCGCC TGCTGCTGGC CTCGGACGTC GGCGGCGGCC CGTGGCTGAA CGCGACCGGC GCCGCCTCCG CCGTCTCCCC CGCCGGGACC CCACCGGCCG CGACGGGCAC CACGGGAACG ACGTCGGCCG CCACCGGAAC GACGGGCACC ACCGGAACGA CGGGCACCAC CGGAACGACG GGCACCACCG GAACGACGTC GGCCGCCACG CCACGGGTGA CGGTCCGATC GGCCGGCGAC ACATCCTTCA CCGTCGACGT GACGGGGGCG CGGCCCGGTA CCCCGTTCTG GCTGGTGCTC GCGGAGAGCC TCTCCCCGGG TTGGAAGGCG ACCGTGGGCG GCGCCGACCT GGGCACCCCC AGGCTGGTGG ACGGCTACGC GAACGGCTGG CGGATCACCC CGACGGCCGC CGGGTTCACC GTCACGCTGA CCTGGACCCC ACAACGGATC GTGTGGTTCG CCCTGGCGCT GTCCGCGGTG ACGGTGCTGC TCTCCCTAGC ACTGGTGATC TTCTACACCC GCCGAGCCCG CGCGACCGTG GCCACGCGGG CGTCCCGGTC GTCCCGGTCG TCCCGGGCAC CGGAGCCGGA TCTGCCGACA GCCGAGCCGT GGTCCGTCCG GGCGGGGCGT GTCGGCCCGC GCACCACCGT GCTCGCCGTG TTGGGGATGG GTCTGCTCTC GGCGGTGCTG GTCTCCCCGG TAGCGGGGGT CATCGTCGCC TTGGCGACGG CCGTGGCGCT GCTCGTGCCC CGCGGCCGGC TGCTGACCCG CGTCGGCCCG GTGGTCTGCC TGGGCGTCAG CGCCCTGTAC GTGCTGCAGG TGCAGGCCCG GCATGCGCTG CCGACGGACG GAGACTGGGT CGCCGCGTTC GGGAAGGTGG CGACGATCTC CTGGCTCACG GTGCTGCTAC TGGCGAGCGA CCAGCTCGTC GCCCAGCTGC AACGACGGCG CGAGGCGGGG CAGCCGGGCC CGCCGTCTCC CCGGACGGGC CCGGGGGAGG AGCCTCCGGG CCACTGA
|
Protein sequence | MTVTTSPSGP ASSDRPASGR PESGRPESGP ASAAIRRRRP PSWPTLVLAA VAYLPLLATA PGKIGADTKA YLYLDPSRML RRAVSMWDPG IGMGTVTHQN IGYLFPQGAF YWLLDLVGLP DWVAQRLWTG SILFGAGTGV LFLLRSLRWP DRFAFVAAFG YMLSPYILEY EARISAILLP YAGLGWLIGI TVRGLREAEP GRAGWRPAGW RFPAAFALVV TTIGSINASS LIFILFAPLL WIPFAVWGTR EVRLGAAVSM CTRAVLVVLV TSAWWIAGLY TQAGYGLNVL AFTETIETVA SSSQASEVLR GLGNWFFYGQ DALGPWIGPA THYTQSIWLI AVSFTIPILA LIGASAIRWG QRGYFVALIL LGTTIAVGVY PYDDPSPWGR LFKDFAEGST AGLALRSLPR AVPMVALGMS VLLAGGLAAL WERYAAAGPT ATAAPGGRPV RSGRRRRLVP PLATGVVLLL LALDMSPLFR GEFVEPLLER PEHVPAYETE LARALDTRNN GTRVLELPGA DFSHYRWGST LDPVTEGLMD RPLVMRELIP YGEAGSVDLV RSLDRRLQEG VFETSALPDL ARLMGVGDVV LRSNLAYERY RTPRPRATWN LFTSNRPDGL GVPQTFGPPV AEDPGIPFTD EVTLGTDPSV PDPPALAVFP VSGAQPIVRT ATTARPLLVS GNGEALVDAA ASGLLAEPIA DGRAILYAPE LATDPTAMRQ ALDDGADLLV TDTNRRRAER WTGIRENFGY VEQPGVRPLA KDPNDNRLPL FPDQNVTSET TATLHAPGSP AKIAEVTATS YGNSFSYGAS DRPVRAIDGD PGTAWRVGAF TDPTGEAWQT TLAEPTTTNQ IRLVQPLTGP RNRWITRATL TFDGGSPVTV ALTDASRTAA GQVVRFPTRA FRTLRIHIDA TNTGRQRSYD NVSAVGFAEV AIPGANGAPL TAEEVLRMPS DSLDAAGAAS LGHRLALEVS RDRANPSEPF KQDTESVISR SFSLPTARTF ALTGTARVSA YSPDDRVDQV LGRPATLPVV TSSGRLPGSL AARASSAFDG NPATAWSPGI GNPRGSWIQV ASPTPLTVSS MTMSIVADGR HSIPTRIGIE VDGQRVGAVT VPAVTDTSAR GGTREVTLTF PAVTGSTWKF VIDDARTVTS IDPISRSPLA MPVGLAEIGI PGLATVPAAG TGQPTAGTGQ PTAGTAGSTA TGLLPAQIPA PCRTDLLSID GQPVGVQVTG GTADAVNRLG LTVATCGQPV TLGPGEHVIR TADGALAGID LDRLLLASDV GGGPWLNATG AASAVSPAGT PPAATGTTGT TSAATGTTGT TGTTGTTGTT GTTGTTSAAT PRVTVRSAGD TSFTVDVTGA RPGTPFWLVL AESLSPGWKA TVGGADLGTP RLVDGYANGW RITPTAAGFT VTLTWTPQRI VWFALALSAV TVLLSLALVI FYTRRARATV ATRASRSSRS SRAPEPDLPT AEPWSVRAGR VGPRTTVLAV LGMGLLSAVL VSPVAGVIVA LATAVALLVP RGRLLTRVGP VVCLGVSALY VLQVQARHAL PTDGDWVAAF GKVATISWLT VLLLASDQLV AQLQRRREAG QPGPPSPRTG PGEEPPGH
|
| |