Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_6784 |
Symbol | |
ID | 5675097 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | + |
Start bp | 8261621 |
End bp | 8266201 |
Gene Length | 4581 bp |
Protein Length | 1526 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | 641245633 |
Product | TPR repeat-containing protein |
Protein accession | YP_001511024 |
Protein GI | 158318516 |
COG category | [R] General function prediction only |
COG ID | [COG0457] FOG: TPR repeat |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 7 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 20 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGACCGACA GCTTCGACGT GTTCGTTTCC TACGGGCATG ACGATCAGGT GTGGGTGCGG GCGCTGGCGG AGAACCTGGA ACGCGCCGGC CTGCACGTTT TCTACGACGA GTGGGAGATC CTCGCCGGGG ATGTGCTGGC CCACCGGCTC GACGCCGGCG TTCTGGGCTC AACGTCAGGG ATCCTGGTGG TCAGTCCGCA TGCGCTGTCG CGGCCGTGGG TGCGCGAGGA GTATGCGGCG ATGCTGACCC GGGCGGTCGC TGGCCAGCAG CGGCTGATCC CGGTGATCCT TGAGGATGCG GAGTTGCCGG CGTTTCTGGC GTCGCGGGTG TGGGTCGATT TCCGTGACAC CCACGACCCG GCGACCTACA CAGTCCGGGT CGGGGAGCTG GTGGCGTCGT TGCGTGGCCA GCGGCGGTCC CGGCCGGCTG CGGACGGCAC CGTACGCCCT CCACCAGGGC AGGCGTATGT GGCCGAAGGG CCCCGCGTTG CAACGTTGCG GATCACCCCG GACCGGGTGG AACTGCATAC CACCGGCGGG ACCGTGGAAG GCCGCCCAGT CGGGATGGAC TGGTCAGCAG GTCAGGTACT GGGCGAGCTG GAACGAGTCC GGACCCGCAG AACGGCCAGC GGGCTGGCCC TGAAAGCGCC AGCCGCTGGC GGTGTGGCCG CCGATCCCCT CGACGGGGCG CTGCGGGCTG TCGGGCGGAT GCTCGGCGAA CGGTTCCTGC CCGCCGACGT CGCGGCCGGC CTCGCGGGTG AGGTCGACGC GGCGGTACGC GGCGGGGTAT CCCTGCAGCT GGCGGTGCAG ACAGCTGATG GGGTGGCGGA TCTGCCGTGG GAGACCCTCA CCCTACCCGC GCACACCCGC CCGCTGGTGC TGCACCCGGG CGTGCAGCTG TACCGGGCGG TCGTCGGGTT GGGAGCGACA CCGGCGATGG GGATCCGGGG GCCGCTACGG ATCCTCGCGG TGATCGCCAG CCCCGACCAC GGTGGCGGGG CGCTGCTGGA CTACGAAGCT GAGCTCGCGC GGATCCTTGA CGCCGTCGAC CCGGCCCGCC GCAAAGGCGA GGCCTATGTG CGGATCTTGA ACGGGGGTAG CCGGGAGGCG ATCCGGGCAG CCCTGTTGCA GGAGCGGTTC CACGTCCTGC ACCTGTCCTG CCACGCCGAG CCCGGCCGCC TCGTGTTGGA AGCCGAGGAC GGCGCCGCCG ACCTTGTCGA CACCCAGAGG TTCGTCGAGG AGGTCCTCGT CGTCGGGCGG GGTGTGCCCC TGGTGGTGCT CGCTGGCTGC TCCACCGCAC TGCATAGTGG CAGCGCCGTG GACCCGAACG AGGAACAAGC GGGGGGTGGC GCGGGGTTGC TGCCGGGGTT GGCCCGGGGG CTGCTCGCGC ACGGGGTGCC CGCGGTGTTG GCGATGACCG CCCCGGTCAG CGATCTGTAT GCCACGGATC TGGCCTCGCG GCTGTATGGC GAACTGGCCG GACGCGCGGT GGTCGCTCCC CTCGCCGCGC TGACGGACGC CCGCCAGGCG GTAGAAACCA GCCGGGTGGG CCTACCCGCC TCCGATCGGC GGGCGGGGCT TGCCGAATGG GCCACCCCCG CGCTGTTCCT CCGCGGCCCG GTACTGCCGT TGTATGACCC GGCTGCGGGT TTCGACACGA TCACCGAACC GGTTACGCCG GTGCTGGCGG AAGGGATCGT GGTCCGCCGG GTCGGGGAGT TCGTCGGCCG CCGCACCGAA CTCCGCACCC TGCACCGCGC CCTCCACGGC GAAGGCGCCG GGGTTGTCCT GCACGGCATC GGCGGCGTCG GGAAATCAAC CCTCGCCGCA CAACTCCTCA CCGACCTCGG CGACGACGCA GGCCTGGTCG TATCGCTGAC CGGGCCGGTC GCGGTCGATC AGATCCTCGC CGCGATCGCC GCCCGGCTGG CCTCCTGGTG CTTAAAGCAC AACATCAACA TCGACGATGT ACGCCGGCGC CTGATCGATG CCCTCCGCTC CGGGGCCCCG TGGCAGGAAC GCGTCGCGCT GCTCGCAGAG CATCTGCTGC CGACAGTGCC GGTCACGCTG TTCCTCGACA ACGCCGAGGA CACCCTCACC CCCCACGGTG TCGGTCACCG GTTCACCGAC CCGCAGCTGA ACGCGTTCCT GGCCGCGTGG GTCGGGCTGC CGGGGCGGGC CCGGCTGCTG GTCACCAGCC GCTACCCGCT GCCCGTCGAC ACCGCTACCG CCCACCGGCT GACCGTGCAC CACCTCGGCC CGCTCTCGGT CGCGGAAGCC CGCAAACTGT TGTGGCGGCT CCCCGCCCTC GACGCACTCA CCGCCAACGA ACAGGCCCGC GCGATCGCGG ACGTAGGCGG GCACCCCCGC ACCCTGGAAT ACCTCGACGC GCTCCTCGCC GGCGGCCACG CCCACTTCCC CGAGATCGCC ACCCGCCTCG AGAACACCCT GAAAACCCGC GGTGTCACCG ACCCCGCCGC CTGGTACACC ACCCTCGCCG GGAACATCGA TCGGGCGCTG GCCGAAACCA TCACCCTCGC CGTCGACGAC ATCGTCCTCG ACACCCTCCT CACCCGCCTC GACACCATCC CCCTCGCCCG GCGGCTGCTG ACCGGCGCGG CGGTCTACCG CCTACCCGTC GACCGCAACG GCCTCGCCTG GCAGATCTCC ACAACCGTCG AACCGACACC CGACTCCGCC CAGGACCCGG ACATCGCCAG GATTCTCAAG TTTCTCGCGG AGGCTCGTAC CAGCGGGGTG ACCGCCCTGG AAGACGCCGG CCTCACCGAT ACCCAGATCC AGAGGTATCA GCGGTGGGTG GAACAGCAGC GCCGGCCACC ACTGGCCATC CCCCAAGAAT TCGACACCGC CCTGACCAGC CTGCTCGAGC TCGGACTCCT CTCCCCCACC CGCGATACCG GCGGGCAGCT GGCGTTCACC GTGCACCGCT GGACCGCCAC CGCCCTCCTG TCCCGCGCCA CCCCCCACGA CCACACCGAC GCCCACCACC GGGCCGCCGC CTACTGGGAA TGGCGGGTCG CAGTCTGGCC TCAGGACCAG TACACCGACG TCACCCAGCT GTTGGAGGCC GGCCATCACC ATCTCACCAC AGGCGCTCAC GACCGCATCG ACCAAACCAC CGGCGCCGCC TGTCAGCAGC TACACACCTG GGGCCTGTGG GCTTGGGAAG AACAGATCTG CCGCCACTCC CTGAACCGCC TTCCCACCAC CGACACCAGC CGGCTTACGG CCGCGTTCAC CCAGCAACTC GGCATGATAG CTCAGCTATG GGGGGACTAC ACTACCGCCG AAAATCGCTA CCACAAATCC CTCGCCATCT TCCAAGAGCT CGGAGACCAA GCTGGCATCG CCAACTGTTA CCACCAGCTC GGCAGGATCA CCCAAGAGCG GGCAGACTAC ACCACCGCCG AAAACCGCTA CCAAAAATCC CTCACGATCC GCGAGGAGCT CGGAGACCGA GCCGGCATCG CCACTAGTCA TCACCAGCTT GGCATTATTG CCCAAGACCG AGGGGACCAC ACCACCGCCG AAAACCGCTA CCAGAAAGCC CTTACCATCT TTCAAGAGCT CGGGGACCGA GCTGGCATCG CCGGCAGCTA TGGTCAACTC GGCATGATCG CCGAGCTTCG AGGGGACTAC ACCACCGCCG AAAACCGCTA CCAGAAAGCC CTCACGATCC GCGAGGAGCT CGGAGACCGA GCCGGCATCG CCACTAGTCA TCACCAGCTT GGCATTATTG CCCAAGACCG AGGGGACCAC ACCACCGCCG AAAACCGCTA CCAGAAAGCC CTCACGATCC GCGAGGAGCT CGGAGACCGA GCCGGCATCG CCACTAGTCA TCACCAGCTT GGCATTATTG CCCAAGACCG AGGGGACTAC ACCACCGCCG AAAACCGCTA CCAGAAAGCC CTTACCATCT TTCAAGAGCT CGGGGACCGA GCTGGCATCG CCGGCAGCTA TGGTCAACTC GGCATGATCG CCGAGCTTCG AGGGGACTAC ACCACCGCCG AAAACCTCTA CCAACGGACC CTCACTATAT TTAGAGAGCT CGGGAACCGG GTCGGAATTG CCACCAGCTA TCAGAGGCTC GGCATCGTGG CCGAACTGGT AGAGGACTAC GCGACCGCGG AAGAAAACCA CCAGAAATCC CTAGCCATCT TCAAAGAGCT CGGACACCGG ATGGGCATCT CCACTAGCCA TCTCCAGATC GGCGTCATCG CCCAGAATCA GGGAGATTAC ACCACCGCCG AAAAACGCTA CCAGAAAGCA CTCACCATCT TCCAAGAGCT TGGGAAGCGA GCCGACATCG CAGCCACCGC CTCCCAAATC GGCGTTCTGT ACACCACTCG GGACCGCATC GCCGATGCCA TCCCGTACAA CCTAACGGCA CTATCCATTC GCCTCAGCAT CAAATCACCC GAATCGGCCA TCGACATCCA TTGGCTGAAA AAGCAACGCG AAGCACTCAC CGACGCCGGT TTTCGGGCAG CCCTTGCCAA CCATGTATCC CACAATGATA TCAACACCAC CATCGATTTC CTCGACTCGA TCCCCGAATA G
|
Protein sequence | MTDSFDVFVS YGHDDQVWVR ALAENLERAG LHVFYDEWEI LAGDVLAHRL DAGVLGSTSG ILVVSPHALS RPWVREEYAA MLTRAVAGQQ RLIPVILEDA ELPAFLASRV WVDFRDTHDP ATYTVRVGEL VASLRGQRRS RPAADGTVRP PPGQAYVAEG PRVATLRITP DRVELHTTGG TVEGRPVGMD WSAGQVLGEL ERVRTRRTAS GLALKAPAAG GVAADPLDGA LRAVGRMLGE RFLPADVAAG LAGEVDAAVR GGVSLQLAVQ TADGVADLPW ETLTLPAHTR PLVLHPGVQL YRAVVGLGAT PAMGIRGPLR ILAVIASPDH GGGALLDYEA ELARILDAVD PARRKGEAYV RILNGGSREA IRAALLQERF HVLHLSCHAE PGRLVLEAED GAADLVDTQR FVEEVLVVGR GVPLVVLAGC STALHSGSAV DPNEEQAGGG AGLLPGLARG LLAHGVPAVL AMTAPVSDLY ATDLASRLYG ELAGRAVVAP LAALTDARQA VETSRVGLPA SDRRAGLAEW ATPALFLRGP VLPLYDPAAG FDTITEPVTP VLAEGIVVRR VGEFVGRRTE LRTLHRALHG EGAGVVLHGI GGVGKSTLAA QLLTDLGDDA GLVVSLTGPV AVDQILAAIA ARLASWCLKH NINIDDVRRR LIDALRSGAP WQERVALLAE HLLPTVPVTL FLDNAEDTLT PHGVGHRFTD PQLNAFLAAW VGLPGRARLL VTSRYPLPVD TATAHRLTVH HLGPLSVAEA RKLLWRLPAL DALTANEQAR AIADVGGHPR TLEYLDALLA GGHAHFPEIA TRLENTLKTR GVTDPAAWYT TLAGNIDRAL AETITLAVDD IVLDTLLTRL DTIPLARRLL TGAAVYRLPV DRNGLAWQIS TTVEPTPDSA QDPDIARILK FLAEARTSGV TALEDAGLTD TQIQRYQRWV EQQRRPPLAI PQEFDTALTS LLELGLLSPT RDTGGQLAFT VHRWTATALL SRATPHDHTD AHHRAAAYWE WRVAVWPQDQ YTDVTQLLEA GHHHLTTGAH DRIDQTTGAA CQQLHTWGLW AWEEQICRHS LNRLPTTDTS RLTAAFTQQL GMIAQLWGDY TTAENRYHKS LAIFQELGDQ AGIANCYHQL GRITQERADY TTAENRYQKS LTIREELGDR AGIATSHHQL GIIAQDRGDH TTAENRYQKA LTIFQELGDR AGIAGSYGQL GMIAELRGDY TTAENRYQKA LTIREELGDR AGIATSHHQL GIIAQDRGDH TTAENRYQKA LTIREELGDR AGIATSHHQL GIIAQDRGDY TTAENRYQKA LTIFQELGDR AGIAGSYGQL GMIAELRGDY TTAENLYQRT LTIFRELGNR VGIATSYQRL GIVAELVEDY ATAEENHQKS LAIFKELGHR MGISTSHLQI GVIAQNQGDY TTAEKRYQKA LTIFQELGKR ADIAATASQI GVLYTTRDRI ADAIPYNLTA LSIRLSIKSP ESAIDIHWLK KQREALTDAG FRAALANHVS HNDINTTIDF LDSIPE
|
| |