Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Francci3_4186 |
Symbol | |
ID | 3907151 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. CcI3 |
Kingdom | Bacteria |
Replicon accession | NC_007777 |
Strand | - |
Start bp | 4995603 |
End bp | 4998494 |
Gene Length | 2892 bp |
Protein Length | 963 aa |
Translation table | 11 |
GC content | 70% |
IMG OID | 637881514 |
Product | tetratricopeptide TPR_4 |
Protein accession | YP_483263 |
Protein GI | 86742863 |
COG category | [R] General function prediction only |
COG ID | [COG0457] FOG: TPR repeat |
TIGRFAM ID | |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 17 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAACCCGG CTGGCACGGA TACCACCCAC GACGCCGGGA AGACGGTGGA CTTCTTCGTC TCCTACGCCG GCGTCGACCA GGCGTGGGCG GAGTGGATCG CGGACACCCT GGAGAGGGCT GGCCGCCGCG TGCTGCTCGA GGCATGGGAC TTCGTCCCCG GGTCGAACTG GCCTGCGCTC GTCCAGGCGG GACTGACCAC CGCAGGCCGG CTCCTGGCGG TGCTGACCCC GGCCTATCTC GCCTCGGTCG GCGCGGCGGC GCAGTGGCAG GCGATCTGGG CGGCGGACAT GGGCGGGCGG GGACGCCGGC TGATTCCGGT GCAGGTCGAG CCGTGCGAGC CCGACACGCT CGGCCTGCTC GGCAGCCTCA CCTGGATCGA CCTGACCGAC CTGACCGACC TGGCCGGGCC GGGCGCCGCC GCAGCCGCTC GCGCCCGGCT GCTGGACGGG ATACGTGCCG CCGTGACAGG GCAGGTGCGG TCGGTCAATC CCACGCCGCT CCCCACCCGC GACACCCGCG ACGTGCCCGC GGCGGCGGAC GAGTCCGAAC GGGCAGCGCG GTATCCGGGT CGGCCCCCGC TCGTGTGGGG GCTGCCGCCG GAACTGGTGC GTAACCCCCG CTTTGTCGGG CGTGCATCCG AACTCGCCGA GCTGCGCGCG CGACTGACGG GCTTCGACCG GGCCGTGCCC GGCCCCGGTT CCCCTGTGCC GGGGGTGCGG GCCCAGGTGG TGCATGGCCT GGGCGGGGTC GGCAAGACGG AGCTCGTCGT GGAGTTCGCC TACCGGTACA GCCACGCCTA CGACCTGGTC TGGTGGATCC GCGGCGACAA GATGACGGAG GTCGTCGCCG GCCTCGCCGC CCTCGCGGCC CAGCTGGGAC TCGCCCACGT GGAGGGGGAC GAGGCAGCGG CCGCCGCGGC GGCCGAGGCG TTGCGCGCCG GTCGACCTCA TCGCCGGTGG CTGTTGGTCG TCGACAATGC CGTTATTGGC AATGCCGTTA TTGGCAATGC CGTTATTGGC AATGCCGTCA TGGGCAACAA CGCCGGTGCT ACCGCCACCA ACCCCTACCT GTCCAGGCTT CTGCAGGCCG CGGAGACCGC CGAGTTCGGA CATCTCGTGA TTACCTCGCG CAACCCCGAC TGGGCCGGTC GGGCTACCGC AACCGAGGTC GCGGTGCTGC CCCGGAGGGA CACGATCGAT CTGCTGAGGA GGCACCGAGC CGCGCTGGAG GACGCCGAGG CTGAGCGGCT CGCGGCCGCG GTCGGGGACC TGCCGCTGGC GGCGGAACAG GCCGGAGCCT GGCTCGCCGC CAGCGGCATG ACCGTTTCGG ACTACCTGGC TGCGTTGCGC GTGGAGACTC GGGAACTCCT TACCCGCGGA AAACCGGACC GCTATCCGTT CATTGTCGCC GCGGCGTGGA ACCTGGCCCT GGACGAGATC GGACCGGACG AACCGGCCGT CGTCGAGCTG CTGCAGCTGG CCGGATTCTT CGGCCCGGAG CCCATCCCGC TCGACCTGTT TCCCGCGCTG GTGGTGCTGG GGGAGAGCAA CGGGGAGCTG GGGGAGAGCA ACGGGGAGGG GACGCTCTGG TCGGCGCTGA GCGCGGCCTG TTCCTCCCCG CTGCGCTGGG GGGACGTGGT CGCCCGCCTC CGTGCTCTCG GGCTGGGCAA GGTCGAACAG GGAACCATCA CCTTGCATCG GCTGGTGCAG GCGGTGCTGC GGGACCGGGT TCCGACGTCG CGGCACCTGG AGTTCCGGGC CACCGTCGGC TGGTTGATGA TCAGGGCCCT GCCCACCGCG ATCCAGGGTA ATCCGGCGGC CTGGCCGCGC TGGGCCGCGC TGCTCCCACA CCTTCAGGCC CAGATCGAAG CCGAGCCGTC CCCGGCGGAC GCGAACGCGG CGGGCATGCT GGGGCTGGGC AATCTGGCGG CGCTCTATCT CCAGGAAACC GGACAGCTCG CCGCCGCCGT GGACCTGCAC ACCTGGGTGC TGGCTCGAAC CGAGCAGGTC GTGGGAACCG ACCACCCCAA CACCCTGGTT CTCCGGAACA ACCTCGCCCT CGCCCTGCAG AAGGCCGGAC GCATTTCCGA GGCCATCGGG TTGTACGAGC GGATCCTGGC GCGGGCCAGC CGCATCCTCG AATCCGATGA TCCCAATCTG GGGATCAGCC GTAACAATCT CGCCAGCGCC TATTGGGCCG CCGGACGCAT CGCGGAGGCC GTCGAACTGC TCGAGCAGGT TGTCGACGAC GCTGGACGGA TCCGTGGCCC GGACCATCGC GACACCCTGC TGGCCCGGAG CAACCTGGCC AACGCATACC AGACGGCGGG GCGCTTGGCG GAGGCGATCA GGCTGCACGA AGAGGTTCTC GTCGACGTCG TCCGGGTTCT CGGCCAGGAG CATCTCACGA CCTTCCTCGT GCGCAACAAT CTCGCCAGCT CCTATCAGGC CAGCGGACGG ACGACTGAGG CCCTCGACCT GTACGAGCAG GTACTGACCG GCCGGGAAGG CGTTCTCGGC GACAACCATC CCGACACCCT GCTCTCCCGC AGCAATCTCG CCGACGCCTA CCAGGCGCTC GGACGTCTGC CCGAGGCGAT CGATCTGTAC GAGCGGGTTC TCTCCGATGC CCGCGGTGTT CTCGGCGCTG ACCATCCACA CATCCTGATC ACCTGGAACA ACCTGGCCTG CGCCGTTGCG GCCGCGGGGC GCCTCGCGGA GGCGATCGAC CTGTACGAGC GGGTACTGGC CGACCAGCGG CGTGTCCTCG GACCGGATCA CCGCGACACC CTGACCTCGC AGGGAAACCT CGCCAACGCC TACCGGGCCG CCGGACGCAC GAATGCCGAC GCCGCTGATG GTGCCGACGC CGCTGATGGT GCCGGGGACT GA
|
Protein sequence | MNPAGTDTTH DAGKTVDFFV SYAGVDQAWA EWIADTLERA GRRVLLEAWD FVPGSNWPAL VQAGLTTAGR LLAVLTPAYL ASVGAAAQWQ AIWAADMGGR GRRLIPVQVE PCEPDTLGLL GSLTWIDLTD LTDLAGPGAA AAARARLLDG IRAAVTGQVR SVNPTPLPTR DTRDVPAAAD ESERAARYPG RPPLVWGLPP ELVRNPRFVG RASELAELRA RLTGFDRAVP GPGSPVPGVR AQVVHGLGGV GKTELVVEFA YRYSHAYDLV WWIRGDKMTE VVAGLAALAA QLGLAHVEGD EAAAAAAAEA LRAGRPHRRW LLVVDNAVIG NAVIGNAVIG NAVMGNNAGA TATNPYLSRL LQAAETAEFG HLVITSRNPD WAGRATATEV AVLPRRDTID LLRRHRAALE DAEAERLAAA VGDLPLAAEQ AGAWLAASGM TVSDYLAALR VETRELLTRG KPDRYPFIVA AAWNLALDEI GPDEPAVVEL LQLAGFFGPE PIPLDLFPAL VVLGESNGEL GESNGEGTLW SALSAACSSP LRWGDVVARL RALGLGKVEQ GTITLHRLVQ AVLRDRVPTS RHLEFRATVG WLMIRALPTA IQGNPAAWPR WAALLPHLQA QIEAEPSPAD ANAAGMLGLG NLAALYLQET GQLAAAVDLH TWVLARTEQV VGTDHPNTLV LRNNLALALQ KAGRISEAIG LYERILARAS RILESDDPNL GISRNNLASA YWAAGRIAEA VELLEQVVDD AGRIRGPDHR DTLLARSNLA NAYQTAGRLA EAIRLHEEVL VDVVRVLGQE HLTTFLVRNN LASSYQASGR TTEALDLYEQ VLTGREGVLG DNHPDTLLSR SNLADAYQAL GRLPEAIDLY ERVLSDARGV LGADHPHILI TWNNLACAVA AAGRLAEAID LYERVLADQR RVLGPDHRDT LTSQGNLANA YRAAGRTNAD AADGADAADG AGD
|
| |