Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_6501 |
Symbol | |
ID | 5674816 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | - |
Start bp | 7905379 |
End bp | 7908762 |
Gene Length | 3384 bp |
Protein Length | 1127 aa |
Translation table | 11 |
GC content | 74% |
IMG OID | 641245349 |
Product | pyruvate carboxylase |
Protein accession | YP_001510744 |
Protein GI | 158318236 |
COG category | [C] Energy production and conversion |
COG ID | [COG1038] Pyruvate carboxylase |
TIGRFAM ID | [TIGR01235] pyruvate carboxylase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 5 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 12 |
Fosmid unclonability p-value | 0.697042 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGCGCAAGG TACTGGTGGC AAACCGCAGT GAGATCGCGG TTCGGGTGTT CCGGGCCGCG CACGAGCTCG GGTTGCGGAC GGTGGCTGTC TACACCCCCG AGGATGTCTC CGCGCTGCAC CGCACGAAGG CGTCCGAGGC CTACGAGCTG TGCGAGCCGG GTCATCCGGT GCGTGGCTAC CTCGACATCG ACGCGTTGTT GCGGGTCGCC AAGCAGGCGG ACGCCGACAC GCTGCACCCG GGTTACGGGT TCCTGTCGGA GTCGGCGGTG CTGGCGGAGG CATGCGCGGC GGCTGGCGTG ACGTTCGTCG GCCCGCCGCC GGCTGTGCTG CGGCTGACGG GTGACAAGGT GGCGGCCCGT AACGCGGCGA TCGCCGCGGG GCTGCCGGTG CTGCGGGCCT CGGAGCCGCT GCCGGACGGC GTCGGCGCGC AGGAGGCCGC GGCCGAGGTC GGGTTCCCGC TGTTCGTGAA GGCGTCGGCG GGTGGTGGCG GGCGCGGGCT GCGCCGGGTG CTGCGTCCGG AGGACCTCGC GGAGGCGGTG GCGAGTGCGT CGCGTGAGGC GGCGGCCGCG TTCGGGGACG GCACGGTCTT CCTGGAGCAG GCGGTGGAGC GGCCGCGTCA TGTCGAGGTG CAGGTCTTCG CCGACGGCTA CGGGAACGTG ATCCACCTGT TCGAGCGGGA CTGCTCGGTG CAGCGCCGGC ACCAGAAGGT CGTCGAGATC GCCCCGGCGC CGGCGCTGGA CGAACCGATC CGCGACGCCC TGTGCGAGGC CGCGGTGCGC TTCGCCCGTC ACGTCGGCTA TGTGAACGCC GGGACGGTGG AGTTCCTCGT CGGGGCCGAC GGCGCGTTCA CGTTCATGGA GATGAACCCG CGCATCCAGG TCGAGCACAC GGTGACCGAG GAGGTCACCG GTGTCGACAT CGTCGCCGCG CAGCTGCGGG TCGCCGCCGG GGAGAGCCTG ATCGACCTGA ACCTCACCCA GGACGCGATC GCTCTTCGGG GCACCGCGAT CCAGTGCCGG GTGACGACCG AGGATCCCTC GGACGGCTTC CGCCCCGACA CCGGGACGAT CACCTTCTAC CAGTCGCCGG GCGGGCCCGG GGTCCGCCTC GACGGTGCCA CCTACCCGGG GGCGGAGGTC AGCCCGTACT TCGACTCGCT GCTGGTGAAG CTGACCGCGC GTGGTTCGAC GCTGGAGAAG GCGTCCCGGC GCGCCCGGCG GGCGCTGGGC GAGTTCCGGG TCCGCGGCGT GCGCACCAAC ATCGAGTTCC TCGGCCGGCT GCTCGAAGAC CCCGACTTCC TGGCCGGCGG GGTCACGACC TCGTTCATCG ACGAGCGCCC CGGCCTGGTG AACGCCCTCG ACGACGGGGA CGCGACGAGC CGCATGCTGG CCCGGCTGGC GGAGAGCACG GTGAACGGGC ACGCCCGCCC GGCGGGGGTG GCTCTGACGG AGCCGCGCAC GCTGCTCCCG CCGCTGCACC CGACCGACAC CCCGGCGGCC GGGTCGCGTC AGCTCCTGAC CGAGCTGGGG CCGGCCGGGT GGGCGGCGGA CCTGCGGGGG CGGACGGCGC TGGCAGTCAC CGACACCACG CTGCGTGACG CGCACCAGTC GTTGCTGGCG ACCCGGCTGC GCAGCTTCGA CATCCTGGCC GCCGCGCCCG CGCTCGCCGA GCTCACCCCG AACCTGCTGA GCCTGGAGTG CTGGGGTGGG GCCACCTACG ACGTCGCGTT GCGCTTCCTC GCCGAGGACC CGTGGGAGCG CCTGACGGCG GTGCGGGCGG CGGTGCCGAA CATCTGCCTG CAGATGCTGC TGCGCGGGCG CAACGCCGTC GGCTACACCC CCTACCCGGA CGACGTCGTG CGCGCGTTCG TCGCCGAGGC GGCGGCCGCC GGGCTCGACA TCTTCCGGAT CTTCGACGCG CTCAACGACA TCGAGCAGAT GCGCCCGGCG ATCGCCGCCG TGCTGGAGAC GAACGCCGTC GCCGAGGGCA CCCTGTGCTA CACCGGCGAC CTGACCGACC CCGGCGAGCG GATCTACACC CTCGACTACT ACCTGCGCCT GGCCGACCAG CTCGTGCAGG CCGGCGTGCA CGTGTTGGCG ATCAAGGACA TGGCCGGGCT GCTGCGCCCG GCCGCCGCCC ACGCGCTGGT CGCCGCGCTG CGGGAGCGCT TCGACGTACC CGTCCACCTG CACACCCACG ACACCGCCGG TGGTCAGCTC GCCACCCTGA TCGCGGCCAA CGAGGCCGGG GTGGACGCCG TCGACGCGGC GGCCGCGCCG ATGTCCGGCG GCACCAGCCA GCCGAACCTC TCCGCCCTGG TCGCCGCCAC CGACCACACC CCGCGGGCCA CCGGCATCCG GCTCGGCGCG CTGTCGGCGA TGGAGTCGTA CTGGGAGGCC GTGCGCGATC TCTACTCGCC GTTCGAGGCC GGGCTGCGCG CGCCGACCGG CGCGGTGTAC CGGCACGAGA TCCCCGGCGG GCAGCTCACC AACCTGCGCC AGCAGGCCAT CAGCCTCGGC CTCGGCGACC GCTGGGCCCA GGTGACCGAG TGCTACGCCG TGGCGAACGA CCTGCTCGGC AAGCCGATCA AGGTCACCCC GACCAGCAAG GTCGTCGGTG ACCTGGCGCT GTTCATCGCC GGCGGCTCGG TGGACGTCGA CCGGCTGCGT GAGCATCCCG AGGAGTACGA CCTGCCCGCC AGCGTCCTGG GCTACCTGGC CGGCGAACTC GGCACACCCC CGGCCGGCTT CGCCGAGCCG TTCCGGGAGC GGGCGCTGGC CGGCCGCCGT CCCGAGCCGC CCGCGGCCAC CCTCGCCACC GAGGACGCCG CCGCGCTGGC CGAGCCCGGC GCCGGCCGGC GGGCGGCGCT GTCCCGGCTG CTGTTCCCCG GCCCGTGGAA GGACTACCGC AAGGCCGTCG CCACCTACGG CGACTCCTCG GTGATCCCCA CCGAGGCGTT CCTGTTCGGC CTGGTCCCCG GCCGCCCGGC GAGCGTCATG CTCGAACCCG GAGTCGAGAT CATCGTGACG CTGGAGACGG TGGGCGAGCC GGACAGCGGC GGCATGCGCA CCCTCTACCT GCGGGTCAAC GGCCAGCCCC GCCCGGTTCG GGTCCGCGAC AACTCGATCA AGGCGACGGC CACCGCGGCC CGGCGCGCCG ACCCGTCCGA CCCCACCCAC GTCGGCGCCG GCCTGCCCGG GATCGTCACC TTCGCCGTCG CGGCCGGCGA CGAGATCGAG AAGGGCCAGA AACTCGCCGT GGTCGAGGCC ATGAAGATGG AGGCCGCGGT CACCAGCCCG GCCGCCGGGA AGATCGCCGA ACTCGTCCGC TCCAGCGGCG AATCCGTAGA GGTCGGCGAC CTCCTCCTGA TCCTGCGCCC CTGA
|
Protein sequence | MRKVLVANRS EIAVRVFRAA HELGLRTVAV YTPEDVSALH RTKASEAYEL CEPGHPVRGY LDIDALLRVA KQADADTLHP GYGFLSESAV LAEACAAAGV TFVGPPPAVL RLTGDKVAAR NAAIAAGLPV LRASEPLPDG VGAQEAAAEV GFPLFVKASA GGGGRGLRRV LRPEDLAEAV ASASREAAAA FGDGTVFLEQ AVERPRHVEV QVFADGYGNV IHLFERDCSV QRRHQKVVEI APAPALDEPI RDALCEAAVR FARHVGYVNA GTVEFLVGAD GAFTFMEMNP RIQVEHTVTE EVTGVDIVAA QLRVAAGESL IDLNLTQDAI ALRGTAIQCR VTTEDPSDGF RPDTGTITFY QSPGGPGVRL DGATYPGAEV SPYFDSLLVK LTARGSTLEK ASRRARRALG EFRVRGVRTN IEFLGRLLED PDFLAGGVTT SFIDERPGLV NALDDGDATS RMLARLAEST VNGHARPAGV ALTEPRTLLP PLHPTDTPAA GSRQLLTELG PAGWAADLRG RTALAVTDTT LRDAHQSLLA TRLRSFDILA AAPALAELTP NLLSLECWGG ATYDVALRFL AEDPWERLTA VRAAVPNICL QMLLRGRNAV GYTPYPDDVV RAFVAEAAAA GLDIFRIFDA LNDIEQMRPA IAAVLETNAV AEGTLCYTGD LTDPGERIYT LDYYLRLADQ LVQAGVHVLA IKDMAGLLRP AAAHALVAAL RERFDVPVHL HTHDTAGGQL ATLIAANEAG VDAVDAAAAP MSGGTSQPNL SALVAATDHT PRATGIRLGA LSAMESYWEA VRDLYSPFEA GLRAPTGAVY RHEIPGGQLT NLRQQAISLG LGDRWAQVTE CYAVANDLLG KPIKVTPTSK VVGDLALFIA GGSVDVDRLR EHPEEYDLPA SVLGYLAGEL GTPPAGFAEP FRERALAGRR PEPPAATLAT EDAAALAEPG AGRRAALSRL LFPGPWKDYR KAVATYGDSS VIPTEAFLFG LVPGRPASVM LEPGVEIIVT LETVGEPDSG GMRTLYLRVN GQPRPVRVRD NSIKATATAA RRADPSDPTH VGAGLPGIVT FAVAAGDEIE KGQKLAVVEA MKMEAAVTSP AAGKIAELVR SSGESVEVGD LLLILRP
|
| |