Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Francci3_3449 |
Symbol | |
ID | 3905689 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. CcI3 |
Kingdom | Bacteria |
Replicon accession | NC_007777 |
Strand | + |
Start bp | 4106416 |
End bp | 4109586 |
Gene Length | 3171 bp |
Protein Length | 1056 aa |
Translation table | 11 |
GC content | 77% |
IMG OID | 637880772 |
Product | tetratricopeptide TPR_4 |
Protein accession | YP_482532 |
Protein GI | 86742132 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 5 |
Plasmid unclonability p-value | 0.0502975 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 17 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGACCGGCG GGGGCGATGG GCACGGCGCT CCCGACTGTT TCGTGTCGTA CGCGGACGAC GGGCGGGCCT GGGCCGAATG GATCGCCGGG ACGTTGGAGG ATGCCGGCTA CCAGGTCCTG CTTGAGTCCT GGGCGCCGGC GGGGACGCAT CGGGTGGGGT GGCTGGACCG GGCGGTGAGC CAGGCCCGGC ACACGATCGC GGTGGTCTCC GACGGCTACC TGCGCTCCTC GCCCGCGGTC GCCGAATGGG CGGCGGCGTG GTCGGCCGAC CCGACCGGAG GCGAACGCCG GCTGCTGGTG ACCAGGGTGG CGGACCGGCC GCTGCCCGGT CTGCTAGGCC AACTCGTCCC GGTCGACCTG TTCGACCGCA GCGAGATCGC CGTCCGGGCC AGCCTGCTCG CGGCCCTGCG CGGCGACCGG CCACAACCGG AACAGCCGCC GGGCTTCCCG GGCCGGCGGG GGATCTTCCC GCCGGAGCTG CCGGCGGTGT GGAACGTGCC GGATCAGCCG GCCCGGTTCG TCGGCCGGAC GGCCGCGCTC GCCCGGCTCG ACGAGGCGCT GTCCCGCTCG CCGCTGGTCA CCGTGACGGG AATCGCGGGG ATCGGGAAGA CCAGCCTGGT CACCGAGTAC GTCCGCCAGC ACCGGGCGGA TCTCGACGCC GTGTGGTGGG TGCCGGCGGA ACGCCCCGAG CTGATCGGCG AGCGGGTGCG CGAGCTCGCC CCGGCGGTGG GCCTGCCCGG CCACGCGGAA CCGGCCGCCG TGATCGCGAA CCTCGGCCGG GCCGACGGCC GCTGGCTGAT CGTCCTCGAC GACGCCGCCG ACGCCGACAC GCTGCCGGAC TGGCTGCGGC CCACCGAACC CGGCCGGCTG CTACTGACCT CCCGCAACCC CGGCTGGGAC CATCTCGGCC CGGTCGTGCC GGTCGGACCG ATGGACCGGG CCGAGTCCGT CACCCTGCTC GCCGGCCGGC TGCCCGCCGT CGAACGGACG GTCGCCGACC GGATCGCCGA GCGGCTCGGG GACCACCCGC TCGCCCTCGA CCAGGCCGCC CACCGCATCA GCACCGGCCG CACCCCCGCC GAGACCTACC TTCAGGTGTT AATCGACCGG CCGGAGGTCC TCCTGGCGCA GGGGGAGGTG AGCGGGCGGC CCGGGGCCAC CGCCGCCACC CTCTGGGACG AGCCGATCCG CCGGCTCGAC GCCGAGGCAC CGGCCGCCGG CCACCTGCTG CGGATCGTCG CGCACGGCGA CAACGAACCG ATCCCGATCC GGCTGTTCAC CGCCGAGCCG GACGTGATCC CCGATTCCGG GCTGCGGGCG GCGGCCGGCG ACCCGCTCGC CCTCGCCGAC ACCGTCGGCG TCCTGGAGCG GTACGGCCTG GCCCACCGCG ACGCGGACAC CGTGACGATG CACCGGCTGG TGCGGGCCGC CGTGCACACC CACACCGGCC CCGACCAGGC CGACGAGATC GTCGCGACCA TGGCCCGGAT GCTGCGCGCG TCGCTGCCCG ACGCCGTCAC GGCCAACCCG GAAGCCTGGC CCGCCTGGCG GGAGCTGCTC CCGCACACGC TCGCCGTGCT CGACGCCACC GCCACCGACT CCACCGACGC CGACTCCACC GACGCCGACG GGCCGGAGGC CGACGGGCCG GAGGCCGACG GTCCGGAGGC CGCCTGGCTG GCGGAACGCT CGGCGGCCTA CCTGCTCGGG CAGGGCCGAG CCGATCAGGC CCTCCCCCTC GCGGCCCGCG CCCTGGCCGC CCGCGAGCGG CTCGACGGCC CGGTGCACGT CGACACCCTG GCCTGCCGGG AGACCCTCGC GCGGGTCGCC CTCGGCGCCG GACGGATCGA CGCCGCCGGC CGCCTGGCCG AACACACCGT CGCCGACCGC GAACGCATCC TCGGCCCCGA GCACCCCGAC ACCCTGGTCA GCCGGGACAC CCTAGCCCAG CTATTCCAGA AGGTCGGCCA GACCGACCGC GCGGTCGAGA TGTTCCAGCG CACCCTCGCC ACCCGGGAAC GCATCCTCGG CCCCGAGCAC CCCGACACCC TGGAAGGCCG GCACAGCCTC GGCCGCGCCT ACGACGCCGC CGGCCGCGAC GACGAGGCGG CCCGGCTGCT GCGGGACACC CTCGCCGACC AGCGGCGGAT CCTCGGCCCG GAGCACCCCG CCACCCTCGA CACCCGCCAC AGCCTCGCCG TCGCCTACCG CCGGATCGGC GCGGTGGGCG ACGCCGTGCC GCTGTTCGAG CAGACCCTCG CCGCTCGCGA ACGGGTCCTC GGCCCCGAGC ATCCCGCCGC CCTCGGCACC CGCCACCAGC TCGCCGTCAC CCATCACCAG GCCGGACGGC TCGACGACGC CACGCGCGAG TTCGGCCGCG CGCTGACCGG CCGCGAGCGC ACCCTCGGCC CCGACCACCC CGACACCCTC GAAACGGTCG ACGGACTCGC CCGCGCCCAC CTGTACGCCG GACGGCTCGA CGACGCCGCA CGCGAGTTCG GCCGCGCGCT GACCGGCCGG GAACGCGCCC TCGGCCCCGA CCACCCCGAG ACCACCGAGA CCCGCGAGAG CCTCGCCACC GCCCACCTCA GGCTGGGCCG GCCCGGCGAG GCCGTCCCCC ATCTCGAACG CGCCCTCGAC CGCCACGAGC ACGTCCTCGG CCGCGCGCAC CCTTACACTG TCGAGGCCCG GAACGCCCTC GCGACGACCT ACCGGCGCAC CGGGCAGCTT GAGGCGGCCG TGCCGCTGCT CGAACGGCTG CTCGCCGACC GGAGCCGGGC GCACGGCGTC GGCGACGCGC GCACCCTGCG CACCGCCGAC GGGCTCGCCG AGGTCTACCG GACCACCGGC CGGCTTCCGC AGGCCGTCGG CCTGAACGAA CGTGTCCTCG CCGTCCGCGA GCGGCTCCTC GGCCCCGGCC ACCCCGACAC CCGCGCCACC CGGGGCGCCC TCGCCGACAC CTACCGCCAG GCCGGCCGGC CCGCCGACGC CGTCCCCCTC TACCGCGCCG CGCTCACCGA CGCGCTGCGC GAGCACGGCC CGTTCCATCC GGACAGCACC CGGGCCCGCC GCACCCTCGC CGACACTGTC GGCGAGACCC GCCACGATCA GCCGCCGCCC CTCGAACGGC CGATCCCGGT GGAGCCGAGA TTCCGCCACC GCGACCCCTG A
|
Protein sequence | MTGGGDGHGA PDCFVSYADD GRAWAEWIAG TLEDAGYQVL LESWAPAGTH RVGWLDRAVS QARHTIAVVS DGYLRSSPAV AEWAAAWSAD PTGGERRLLV TRVADRPLPG LLGQLVPVDL FDRSEIAVRA SLLAALRGDR PQPEQPPGFP GRRGIFPPEL PAVWNVPDQP ARFVGRTAAL ARLDEALSRS PLVTVTGIAG IGKTSLVTEY VRQHRADLDA VWWVPAERPE LIGERVRELA PAVGLPGHAE PAAVIANLGR ADGRWLIVLD DAADADTLPD WLRPTEPGRL LLTSRNPGWD HLGPVVPVGP MDRAESVTLL AGRLPAVERT VADRIAERLG DHPLALDQAA HRISTGRTPA ETYLQVLIDR PEVLLAQGEV SGRPGATAAT LWDEPIRRLD AEAPAAGHLL RIVAHGDNEP IPIRLFTAEP DVIPDSGLRA AAGDPLALAD TVGVLERYGL AHRDADTVTM HRLVRAAVHT HTGPDQADEI VATMARMLRA SLPDAVTANP EAWPAWRELL PHTLAVLDAT ATDSTDADST DADGPEADGP EADGPEAAWL AERSAAYLLG QGRADQALPL AARALAARER LDGPVHVDTL ACRETLARVA LGAGRIDAAG RLAEHTVADR ERILGPEHPD TLVSRDTLAQ LFQKVGQTDR AVEMFQRTLA TRERILGPEH PDTLEGRHSL GRAYDAAGRD DEAARLLRDT LADQRRILGP EHPATLDTRH SLAVAYRRIG AVGDAVPLFE QTLAARERVL GPEHPAALGT RHQLAVTHHQ AGRLDDATRE FGRALTGRER TLGPDHPDTL ETVDGLARAH LYAGRLDDAA REFGRALTGR ERALGPDHPE TTETRESLAT AHLRLGRPGE AVPHLERALD RHEHVLGRAH PYTVEARNAL ATTYRRTGQL EAAVPLLERL LADRSRAHGV GDARTLRTAD GLAEVYRTTG RLPQAVGLNE RVLAVRERLL GPGHPDTRAT RGALADTYRQ AGRPADAVPL YRAALTDALR EHGPFHPDST RARRTLADTV GETRHDQPPP LERPIPVEPR FRHRDP
|
| |