Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_0791 |
Symbol | |
ID | 5669207 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | + |
Start bp | 918872 |
End bp | 922693 |
Gene Length | 3822 bp |
Protein Length | 1273 aa |
Translation table | 11 |
GC content | 77% |
IMG OID | 641239719 |
Product | O-antigen and teichoic acid-like export protein |
Protein accession | YP_001505155 |
Protein GI | 158312647 |
COG category | [R] General function prediction only |
COG ID | [COG2244] Membrane protein involved in the export of O-antigen and teichoic acid |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 3 |
Plasmid unclonability p-value | 0.750231 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 10 |
Fosmid unclonability p-value | 0.326449 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGTCGGGCG TGACCGATCT CGGTATGGAG CCGAGGGCGG CCGGCGCGGA CGACGCGGAC CTCCCGCCCG AGCCGCCCGC CGGCGACGCC GACACCACGC AGTACGTGCC GCGCGCCCCG CGTCAGCCGC GCACCGGGCC GACCGGCCGG AACGACGAGC CCCAGCCCCG GCCTGGCGCG ACCGATCCCC GGCAGCAGGG GGTCCACGTC GGCACTCCGC TGCCGCGGAT CGTCCTGCCG GGCACGCCCG ATCCGCGGGG CGCCGCCGAT CCCCGCGGTG GTGCTGACCA GTGGGGTGGT GCTGACGGGC GGGCTGGAAC TGATCCGCGC GGGGGGCCGG AGCAGCACCG GCCGGCTCCC GAGCCGTTCG TGGTGGAGAG CCGATCCGGC TACCGGGACG GGCGCACCCC ACTCCCCGGG CGCACTCCAC CAGCCGGGCG TACTCCGCCC CCGGGCTACC GGCCGGAGGC GGGTCACACT CCGCCCCCGG GGCGCACGCC AGCCCCGGGG CGCTCCGAGC CGTACGGGCG TACCCCATCC TCTGGCGCGA CACCGTGGCC AGGCCATGCC GCGCCACCCG CGCACACCCC GCCGCCGGGA CGCACCCCGC CGCCGGGCGG CGCCGCGCCG TTCGAGCGGG CTCCGGCCCC GGGGAACACC CCACCGGCCG GGCAGGAGCC TCCCGGTCAC ACACCGGCAG GGCACGTGCC TCCCGGGCGC ACGCCGTCCG GGCGCACGCC GCCGCCCGGC TACACCCCCC GTCCGGGCGG TACGCCCCTG CCGGGCGTCC GGCCGCCGGA CGGGACGCCG CCGAGCCAGG GCGCGGCGCG CGCGCTGACG CCGCCGCCGG GCTGGACCCC CATGCCCCGG CACTCCGCTG GCAACACGCC CGCCCCCGGC GGCACCCCAA CCGACCCTGC TGCCCGGCAC GACCAGCCGG GACAGGGCGA GCGGGGCGCC GCGCAACGGG CGGGCGGGGC GCCCCCGTCC GGCGGATGGG TTCCGCCCGA GACCACCGTC CGCATGGGCA CCACCGGGCC GCAGGCACCG CCTCCGCCGG GAGGCGTGGC CCGTCAGCCC ACGCCGAGGC CCGCTCCGCC GAGCGGCTTC ACGGTGGGCA CGCCACTGCC GGGGCGCCCG GCCGGCACCC CGCCGCCCGG CGCCGGGCCG GCGCAGACTC CACCACCGCA GATGTCACCA CCGCGGACCT CGCCACCCGT CCCGTCGCCC GGCCTGCCGC CGGCGGGCCG GCGGGGCGAT GACTTCATGC GGGTCGACGT CGCCGGCACG GGTGCCACCG CACCCCCGCC GGCGACCCAG CTCCCGTCCA CACCGCCCGC AGCCCCCCAG GCCCCGGCCG CCCACCCGCC TGCCGGGCAG TCGCCGGCGG ACCAGCGCCA GGCCCCGCCG CCGCCCGCCG CACCGCGTTC CGTGGGAGTC CCCCCCGAGG GGCGCCCTGC TGGCGCGGCC GGCGCGGGGG ACACGCGGCG ATCGGGGCGC GGATGGTCGG CGACGGGGCG CCGGGAGCCG CGCCCACCGG GCCGGGGCGG CGGCGCGGCC GGATCGGACC AGACCGCGCT GACCAGCCGG GCCGGCGACG GCCGTGGCCG GCAGGGCCGT GGCGGGCGCC GCGGGCGGCA GGGCGCGGCC CGCAACCCCG ACATCCACTT CATCAACGCG TTCACCGACA CCATCGACCT GTCGTCGCTG CGGCGCCGGC TGGAGATGGA GGACGGTGCC GCCGCCGTCG ACCAGACGGA CTACATCCCG CGTCTCTACG GCCGTAACCG GCCGGAGTCC GACGCCGCGA GCACGCAGGT CATGCGGCGC TCGGGGCCCG GAACGGGGCC GGGGCAGGCC GGTGGCCGGT GGGCCGGCCC GGCCGGCTCC GGGAGCGCGG CCGAAGCGGC CGCGGGCGAG GGCGACCCGG CGGACGGAAC CGTCCGGCCC GGCCGGGCGC GCCACGCACG CGGCGGAGGT TCCGGCACCG GGGCGCTGGA GGTGGCCGAG CAGCGTCCAG GCACCGGGCC GGCCGCCGCC GAGGAGGCCG TCGTAGGCGG CGGGTCGACG AAGGCGAAGG CGATCTGGAC CCTCGCCGAC CAGGGCGTCT CCAGCGCCAC CAACGCCGCC GTCTCGCTCC TGATCGCCCG GCAGGTGAGC TCGTCCGAGT ACGGCTCGTT CGCGATCGCC TACATCATCT TCTCGGTGAT CATCGGGATC TCCCGGGCGG GTGGCTGCCT GCCGCTGGGG ATCTCGTACT CGGGCAAGTC GGTGTCCGCG TTCAGGTACG CGGCCGCGTC CGCCACCGGA GCGTGCCTGG TCTTCGGTGG ATTGCTGGGG ATCGTGCTCG TCGGCGTGGG GGCCGTCGCC GGTGGGTCGG TCGGCTCCGC CCTCGTCGTG GTGGGCTTCG TCCTGCCCGG CCTGCTGCTC CAGGACGCCT GGCGCTACGT GTTCTTCGCG ATGGGCAAAC CCCTCGGCGC GTTCCTGAAC GATGTCGCCT GGGCCCTGGT GCAGGTGGTC GGGCTGACGG TGCTCATCGA GCGGGGCGTG ACGGCGTCGC CGCCGATGCT GCTCGCCTGG GGCATCTCCG CGCTGGTCGC GGCGCTGCTG GGGGTCGCGC AGGCCGGTCT GTGGCCGGCG CCGTCGCGGG CGTTGACCTG GGTCCGGGAG AACCGGGCCA ACGCGGCCAA CCTCGCGGCC GAGTTCGTCA CCGTCCAGGG CGCGCTGCAG GCGTCGATGC TGCTGATCGG GCTGCTCAGC TCCAAGGAGA CGATCGGGGC GCTGAACGGC GTGCGGACGC TGCTCGGCCC GACCACCGTC ATCGGTGTGG GCATCGTCAG CTTCGCGGTG CCGGAGCTCT CCCGCCGCAT CGACATGTCC GTGCGGGCGC GCGAGCGCGC GGCGGTGCTG CTCACGGTGA TCGTGGTCGG CGTCGGCGGC CTGTGGAGCC TGCTGTTCAT CGCATTCCCG GCCATCGGGG AAACGCTGCT CGGTGACACC TGGCCCGGGG CGCACCACAT CCTCGTGTAC TCCGCGTTCC ACTACGCGGG GACGGCGCTG CCGACCGGGC CGGCCTGCAT CATGTACGCG CTCGGCCGAA CCAAGATCAC GTTTCGGATC AACCTGTCCA TGGCGCCGAT GCTCTTCGCC TTCCCGATTC TCGGCCTGCT GCTCGCGGAC GCCACGGGCG CGGTCATCGG GTACAACCTC GTGTTCTGGG GCATCGCCCC GGTCTGGTGG ATCCTGCTGC GTCGGATCGT CCGCGAACAC GCACGCGGAC GACCGGACGC CCGCGACGCG GCCGAGATGT CCGTCCAGGT CGGGGCGCCC GAACAGGCCG AGACGTTCGA GGAGATCGCG ACGTCCGACC GGGCCGAGAG GTTCGATCAC ACCGGGACGT CCGACCGGGC CGGCCTGCCG GCCGGGAACG TGCGGGTCGT CGAGCCCTCC GGACGGGAGG GCCCCGGCCG GGAACGGGCC CAGGTGTCGA GGCTGCCCGC GCTCGCGCCG CCACCACCAT CCGGTCGTCC GAGCCGCGGC GAGGCGGTAC ATGCCAGGCC GGCACACGGC GGCGAGGCGC ACGGAACCTC CGTGCGCGGC GGCGCGGGTG CTGGGGGCGC GGGCTCCGGT GGGGCGGCGC AGGAGGACGT CGAGCGGACC ACGGTTCTGC AGGGCCCGCC CCCGCGCGGA AGCTCGGTTG ATGTGTCGCC GCAGGCCGGC GAGCGGGACC GCCGTGACCC TGAGCCGCTG GACCTGCTCG AACCGGAACA GCGCGGCGGT GCCGACGGAC CGCGAGGCGG CGGGCGAAAC CCTGGCCGCT AG
|
Protein sequence | MSGVTDLGME PRAAGADDAD LPPEPPAGDA DTTQYVPRAP RQPRTGPTGR NDEPQPRPGA TDPRQQGVHV GTPLPRIVLP GTPDPRGAAD PRGGADQWGG ADGRAGTDPR GGPEQHRPAP EPFVVESRSG YRDGRTPLPG RTPPAGRTPP PGYRPEAGHT PPPGRTPAPG RSEPYGRTPS SGATPWPGHA APPAHTPPPG RTPPPGGAAP FERAPAPGNT PPAGQEPPGH TPAGHVPPGR TPSGRTPPPG YTPRPGGTPL PGVRPPDGTP PSQGAARALT PPPGWTPMPR HSAGNTPAPG GTPTDPAARH DQPGQGERGA AQRAGGAPPS GGWVPPETTV RMGTTGPQAP PPPGGVARQP TPRPAPPSGF TVGTPLPGRP AGTPPPGAGP AQTPPPQMSP PRTSPPVPSP GLPPAGRRGD DFMRVDVAGT GATAPPPATQ LPSTPPAAPQ APAAHPPAGQ SPADQRQAPP PPAAPRSVGV PPEGRPAGAA GAGDTRRSGR GWSATGRREP RPPGRGGGAA GSDQTALTSR AGDGRGRQGR GGRRGRQGAA RNPDIHFINA FTDTIDLSSL RRRLEMEDGA AAVDQTDYIP RLYGRNRPES DAASTQVMRR SGPGTGPGQA GGRWAGPAGS GSAAEAAAGE GDPADGTVRP GRARHARGGG SGTGALEVAE QRPGTGPAAA EEAVVGGGST KAKAIWTLAD QGVSSATNAA VSLLIARQVS SSEYGSFAIA YIIFSVIIGI SRAGGCLPLG ISYSGKSVSA FRYAAASATG ACLVFGGLLG IVLVGVGAVA GGSVGSALVV VGFVLPGLLL QDAWRYVFFA MGKPLGAFLN DVAWALVQVV GLTVLIERGV TASPPMLLAW GISALVAALL GVAQAGLWPA PSRALTWVRE NRANAANLAA EFVTVQGALQ ASMLLIGLLS SKETIGALNG VRTLLGPTTV IGVGIVSFAV PELSRRIDMS VRARERAAVL LTVIVVGVGG LWSLLFIAFP AIGETLLGDT WPGAHHILVY SAFHYAGTAL PTGPACIMYA LGRTKITFRI NLSMAPMLFA FPILGLLLAD ATGAVIGYNL VFWGIAPVWW ILLRRIVREH ARGRPDARDA AEMSVQVGAP EQAETFEEIA TSDRAERFDH TGTSDRAGLP AGNVRVVEPS GREGPGRERA QVSRLPALAP PPPSGRPSRG EAVHARPAHG GEAHGTSVRG GAGAGGAGSG GAAQEDVERT TVLQGPPPRG SSVDVSPQAG ERDRRDPEPL DLLEPEQRGG ADGPRGGGRN PGR
|
| |