Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_6583 |
Symbol | |
ID | 5674898 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | + |
Start bp | 8009511 |
End bp | 8012456 |
Gene Length | 2946 bp |
Protein Length | 981 aa |
Translation table | 11 |
GC content | 74% |
IMG OID | 641245434 |
Product | phosphoenolpyruvate carboxylase |
Protein accession | YP_001510826 |
Protein GI | 158318318 |
COG category | [C] Energy production and conversion |
COG ID | [COG2352] Phosphoenolpyruvate carboxylase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 6 |
Plasmid unclonability p-value | 0.800439 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 27 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGACCACCA CACCTGGCAT CCCCGGGACG GCAAGCACCG CTAGCCCGGA GAGCAGCAAC AACCCGGCGA GCACCGGTAA CCCGGCGGAC GGGCGTCCGG ACCAGCAGGC TCAGGACCCG GGCGGCTTCT CCGGCGGCGT GTCCTCCGAC GCGCGCCACG CGGCCGTCCG CGCCGGTGTG CGCCGCCTCG GCGCGCTGCT CGGCGAGGCC CTCACCCGCC ACGAGGGCCC GGAACTGCTC GCGCTGGTCG AACGGGTCCG AATGCTCGCC CGCGGCGAGG ACGGCGGCAC CGAGCTCGCC GCCGTCCTCG ACGGCGTCGA CGCGCGGCAG GCGATCCTGC TGGCCCGCGC GTTCACCGCC TACTTCCAGC TCGCGAACAT CACCGAGCAG TTGCACCGGG CGCGGGAGAT CTCCGACCGC CCGCGCAGCC GGCTGCGCGA CCTCGCCGAC CGCATCGCCG AAGCGGTGGA CGAGGGTGTC GTCGACCCGG GCCTTCCGGC CGAGGTCATG CGGCGGCTGC AGCTGCGCCC GGTCTTCACC GCGCACCCCA CCGAGGCCAG CCGCCGTTCC GTCCTGGAGA CGCTGCGCGG CATCGCCGAC CTGCTCGACG CCACCGACGA CCCGCGCCGC CCCGGCGCCG ACGACGAGCG GCTGGCCCGC CGCCTCGCCG AGCTGGTGGA CGTCCTCTGG CAGACCGACG AGCTGCGGGT GGAGCGGCCG AAGCCCGCGG ACGAGGCACG CTCGGCCGCG TACTACCTCA CCTCGATCGC CACCGAGGTC CTGCCCGACC TGCTGGAGGA GCTCGACACC GCGTTCGCCG GGATCGGGGT CGAGCTGCCG GTGACCGCCC GTCCGCTGGC CTTCGGCTCG TGGGCCGGCG GCGACCGGGA CGGGAACCCG AACGTCACGC CGGAGGTCAC CCTGGAGGTA CTCAACCTCC AGTACGAGTT CGGCATCCGG GTCCTCACCG CCCTCGTCGA CCGGCTGGTC CGTGAGCTGA CGGCGTCCAC CCGGGTGGTC GGGGAGGTCA GCGGCGACCT GCTCGCCGCA CTGGCCGCGG ACCGCGCCGC GCTGCCCGAG GTGTACGACC GCTTCATCCG GCTCAACGCC GAGGAGCCGT ACCGGCTCAA GTGCAGCTAC ATCCTGGCGC GGCTGGCCGG CACCCGGGCC CGGCTGGCCG CCGGCGCACC GCACGTCCCC GGCAAGGACT ACCTGCGGCC CACCGACCTG GTGGACGAGC TGGAGCTCAT GCGGGTCTCC CTCGCCGCGA GCCACGGCGA GCTCGTCGCG AACGGGCCGC TGCGACGCGC GATCCGCACG GCGAGCGCCG TCGGCCTCTC GCTGGCCACC CTCGACATCC GCGAGCACGC CGACGCCCAC CACGCCGCCC TCGGGGCGAT CTACGACCAG CTCGCCGAGC TGGACGTCCC CTATCGCGAC CTGGACCGGC CGGCCCGGCT CGCGCTGCTG TCGGCCGAGC TGGCGGGGCG CCGGCCGCTG CTCGGCGCCG TCCCGCCGCG GCTGCCCGAG CGGGTGGCGC GCACCCTCGA ACTGCTCACC ACGGTGCGCC ACGCGCTGGA CCAGTACGGC GACGGCGTCA TCGAGAGCTA CATCGTCTCG ATGACCCGCG ACGCCGACGA CATCCTGGCC GTCGCCGTGC TCGCCCGCGA GGCCGGTCTG GTCGGCCTCG GCCGGCCACC GGGGACGTCC AGCGGCCCGG GAATCGGCAC GGAACGTGAC GGGCCCGCGG TGCCGGCGGT CGCGTGGGCG CGGATCGGGT TCGTGCCGCT GTTCGAGACC GTCGCCGAAC TGCGCGCCGC GGGGACGCTG CTCGACCGGC TGCTCGCCGA CCCGTCCTAC CGGGCGCTGG TCGCGGCACG CGGTGACGTC CAGGAGGTCA TGCTCGGCTA TTCGGACTCG TCCAAGGACG CCGGCATCAC CGCGTCCCAG TGGGAGATCC ACAAGGCACA GCGTGAGCTG CGGGACACCG CCCGCGCCCA CGGGGTCGTG CTGCGGCTCT TCCACGGCCG CGGCGGATCG GTCGGCCGCG GCGGCGGCCC CTCCGGCGAG GCGATCCTTG CCCAGCCGTT CGGCACCCTG GACGGACCGA TCAAGGTCAC CGAACAGGGC GAGGTGATCT CGGACAAGTA CACCCTGCCG CAGCTGGCCC GGCAGAACCT GGAGATCACC CTGTCGGCGG TGGTGGAGGC GTCCGTCCTG CACCGCACCT CCCGGCTCAC CGGACAGGCG CTCGCCGGCT GGAACGGGAC CATGACGGCG GTCGCGGACG CGGCGAAGGA CGCCTACCGC TCCCTCGTCG CCGACCCGGC CCTGGTTCCG TTCTTCCTGG CGGCGACGCC GGTGGAGGAG CTCGGCAACC TCAACATCGG CTCCCGCCCC TCGCGGCGGC CCGGCGGCTC CGGAGGGCTG GCGGACCTGC GGGCCATCCC CTGGGTGTTC GGCTGGACGC AGGCCCGGAT CATCCTGCCC GGCTGGTTCG GGGTCGGCTC CGGGCTGGCG GCGGCGCGCG AGGCCGGCGC GGGCGGGGAG CTGGCCGAGA TGTACCGCTC ATGGCATTTC TTCCGGACGT TTCTCGGCAA CGTCCAGATG ACCCTGGCGA AGTCGGACCT CGCGATCGCC CAGCGCTACG TCTCGGCGCT GGTCGACCCG GCCACGGCCG CGCTGTTCGA CGTCATCCGG GACGAGCACG AGCGGACGGT GCGGGAGGTG CTGGCCGTCA CCGGGCAGGC CGGCCTGCTC GAGACCGCGC CGGTGCTGCG CCACACGCTG CTGGTCCGGG ACGCCTACCT GGCCCCGCTG CACGCGTTGC AGGTCTCCCT GCTGGCCCGC ACCAGGGCGG CCGGTGACAT CGCCGACCCT CAGCTACGCC GCGGCCTCCT ACTGACCATC AATGGAATAG CCGCCGGCCT CCGCAACACG GGCTGA
|
Protein sequence | MTTTPGIPGT ASTASPESSN NPASTGNPAD GRPDQQAQDP GGFSGGVSSD ARHAAVRAGV RRLGALLGEA LTRHEGPELL ALVERVRMLA RGEDGGTELA AVLDGVDARQ AILLARAFTA YFQLANITEQ LHRAREISDR PRSRLRDLAD RIAEAVDEGV VDPGLPAEVM RRLQLRPVFT AHPTEASRRS VLETLRGIAD LLDATDDPRR PGADDERLAR RLAELVDVLW QTDELRVERP KPADEARSAA YYLTSIATEV LPDLLEELDT AFAGIGVELP VTARPLAFGS WAGGDRDGNP NVTPEVTLEV LNLQYEFGIR VLTALVDRLV RELTASTRVV GEVSGDLLAA LAADRAALPE VYDRFIRLNA EEPYRLKCSY ILARLAGTRA RLAAGAPHVP GKDYLRPTDL VDELELMRVS LAASHGELVA NGPLRRAIRT ASAVGLSLAT LDIREHADAH HAALGAIYDQ LAELDVPYRD LDRPARLALL SAELAGRRPL LGAVPPRLPE RVARTLELLT TVRHALDQYG DGVIESYIVS MTRDADDILA VAVLAREAGL VGLGRPPGTS SGPGIGTERD GPAVPAVAWA RIGFVPLFET VAELRAAGTL LDRLLADPSY RALVAARGDV QEVMLGYSDS SKDAGITASQ WEIHKAQREL RDTARAHGVV LRLFHGRGGS VGRGGGPSGE AILAQPFGTL DGPIKVTEQG EVISDKYTLP QLARQNLEIT LSAVVEASVL HRTSRLTGQA LAGWNGTMTA VADAAKDAYR SLVADPALVP FFLAATPVEE LGNLNIGSRP SRRPGGSGGL ADLRAIPWVF GWTQARIILP GWFGVGSGLA AAREAGAGGE LAEMYRSWHF FRTFLGNVQM TLAKSDLAIA QRYVSALVDP ATAALFDVIR DEHERTVREV LAVTGQAGLL ETAPVLRHTL LVRDAYLAPL HALQVSLLAR TRAAGDIADP QLRRGLLLTI NGIAAGLRNT G
|
| |