Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Francci3_0317 |
Symbol | |
ID | 3903349 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. CcI3 |
Kingdom | Bacteria |
Replicon accession | NC_007777 |
Strand | + |
Start bp | 366037 |
End bp | 369447 |
Gene Length | 3411 bp |
Protein Length | 1136 aa |
Translation table | 11 |
GC content | 73% |
IMG OID | 637877646 |
Product | pyruvate carboxylase |
Protein accession | YP_479433 |
Protein GI | 86739033 |
COG category | [C] Energy production and conversion |
COG ID | [COG1038] Pyruvate carboxylase |
TIGRFAM ID | [TIGR01235] pyruvate carboxylase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 0.557231 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 13 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGCGTAAGG TACTGGTTGC AAATCGTAGC GAGATCGCCG TCCGTGTTTT CCGAGCCGCT CAGGAACTGG GCCTGCGCAC GGTGGCGGTC TACACGCCCG AAGACGTTTC CGCGCTGCAC CGGACAAAGG CGTCCGAGGC CTACGAGATC GGTGGCCCCG GGCACCCGGT GCGCGGATAC CTGGACATCG ATGCCCTGCT CACCGTGGCC AAGCAGGCGG AGGCCGACGC GCTGCACCCC GGCTACGGCT TCCTGTCGGA GTCGGCGGTG CTCGCCGATG CGTGCGCCAC CGCCGGCGTC ACCTTCGTCG GACCGCCCCC GGCGGTCCTG CGGCTGACCG GAGACAAGGT CGCCGCCCGG GACGCCGCGC TGGCCGCCGG GCTGCCGGTG CTGCGCGCCT CGGTGCCGCT GCCGGAGGGC GCCGGGGCGC TCGCCGCCGC CGAGGAGGTC GGCTTCCCGC TGTTCGTCAA GGCCTCCGCC GGTGGTGGGG GGCGCGGCCT GCGGCGGGTG GAACGCCCCG CCGACCTCGC CGACGCGGTG GCGAGCGCGT CCCGGGAGGC CGCCGCCGCC TTCGGGGACG GCACGATCTT CCTGGAGCAG GCGGTGGACC GGCCGCGCCA CATCGAGGTC CAGATCCTGG CCGACGCCTA CGGCAACATC ATCCATCTGT ACGAGCGGGA CTGCTCGGTG CAGCGCCGGC ACCAGAAGGT CGTGGAGATC GCCCCGGCGC CGCAGCTCGA CGAGGCGGTC CGGGCGAGCA TCTGCGCCGA CGCGGTGCGG TTCGCCCGGC ATGTCGGCTA TGTCAATGCG GGCACGGTCG AGTTTCTTCT CGGCGCCGAC GACCGGTACA CGTTCATGGA GATGAATCCC CGGATCCAGG TCGAGCACAC GGTCACCGAG GAGGTCACCG GGGTTGATCT GGTCGCGGCG CAGCTGCGGA TCGCTGATGG GGAGAGCCTG GCCGGCCTCA ACCTCACCCA GGAGCAGATC GTCCTGCGCA GCACCGCCAT CCAGTGCCGG GTCACCACCG AGGATCCCTC GGATGGCTTC CGGCCCGACA TCGGGACGAT CAGCTTCTAC CAGTCACCCG GCGGTCCGGG GGTGCGCCTC GACGGCGCCA CGTACCCGGG GGCCGAAGTC AGCCCGTACT TCGACTCGTT GCTGGTGAAG CTGACGACCC GGGGTAACAC GCTGGAGAAG GCGGCGCGGC GCGCCCGGCG GGCGCTGAAC GAGTTCCGGG TCCGCGGGGT GCGCACGAAC ATCGACTTCC TGGTGCGGCT CCTCGCTGAC CCCGGCTTCC TCGCCGGCGG GGTGAGCACG TCGTTCATCG ATGAACGGCC GGAGCTGCTC GCTCCCGGCA AGGGGGCGGA CCCGACCAGC CGGCTGCTGG CTCGCCTCGC CGAGAGCACG GTGAACGGAT TCCCGCGCCC GGCGGTCGCC CTGACCGACC CCCGGGAGTT GCTGCCGCCC GCGCCCGCGC AGGTGGCGGC GTCCGGCGGC ACGGCGTCCG GCGGCACGCA GCCGCCGGCG GGATCCCGGC AGCTGCTGAC GGAGCTCGGG CCGGCGGGCT GGGCGGCGTG GCTGCGCGCC CAGGAGGCGC TGGCCGTCAC CGACACCACC CTGCGTGACG CCCACCAGTC GCTGCTGGCC ACGAGGCTGC GCAGCTTCGA CATCCTGGCG GCCGCCCCGT CGTACGCCGC GCTGACGCCG AACCTGCTGA GCCTGGAGGC CTGGGGCGGG GCGACCTACG ATGTCGCGCT GCGCTTTCTC GCGGAGGATC CGTGGGAGCG GCTCGCCGCG CTGCGCCAGG CCGCGCCGAA CATCTGCCTG CAGATGCTGC TGCGGGGGCG CAACGCGGTC GGGTACACCC CCTACCCGGA CGACGTCGTC CGGGCGTTCG TCGCCGAGGC CGCGGCGACC GGCGTCGACA TCTTCCGGAT CTTCGATGCG CTGAACGACA TCGAGCAGAT GCGCCCGGCG ATCGAGGCGG TGCTCGGTAC CGGGGCGATC GCCGAAGGGA CGCTCTGCTA CACCGGTGAC CTCTCCGACC CGGCGGAGCG GATCTACACC CTCGACTACT ACCTGCGCCT CGCCGAAGAG CTCGTCGAGA CGGGTGTGCA TGTGCTTGCC GTGAAGGACA TGGCGGGACT GCTGCGCCCG GCGGCGGCGG CCACGCTGGT GACGGCCCTG CGGGAGCGTT TCGACCGGCC CGTCCACCTG CATACCCACG ACACCGCGGG CGGCCAGCTC GCCACCCTGC TCGCGGCGTC CGCGGCCGGG GTGGACGCGG TGGACGCCGC CGCCGCGCCG ATGTCCGGCG GCACCAGCCA GGTCAACATG TCCGCGCTGG TGGCGGCGAC CGACCACACT CCCCGGGCGA CCGGCATCGC GCTGTCCGCG CTGTCCGCGA TGGAGCCGTA CTGGGAGGCT GTGCGCGACC TCTACGCCCC GTTCGAGGCG GGGCTGCGGG CCCCGACCGG GGCCGTGTAC CGGCACGAGA TTCCCGGCGG CCAGCTGACG AACCTGCGTC AGCAGGCGAT CGCGCTGGGA CTGGGGGACC GGTGGGCCGA GGTCACCGAG TGCTACGCGA TCGCCAACGA GCTGCTCGGC AAGCCCATCA AGGTGACGCC GACCAGCAAG GTCGTCGGTG ACCTGGCTCT GTTCATCGCC GGCGGATCCG TCGACGTGGA CCGGTTGCGT GCGCATCCCG AGGAGTTCGA CCTGCCGGCC AGCGTGCTCG GCTACCTCGC GGGTGAACTC GGCACGCCGC CGGCAGGCTT CGCCGAACCG TTCCGGGAAC GGGCGCTGGC CGGGCGCCGT CCGTCGCCGC CGTCCGTCGA CCTCGACGCG GCGGACGCCG ACGAGCTCTC CTTCCCGGGG GCCCGGCGGG TGGCCCTGTC CAGGCTGCTC TTCCCCGGCC CGTGGAAGGA CTATCTGCGG GCGGTGGACG CCTACGGGGA CTCCTCCGTG GTCCCGACCG ACGGCTTTTT GTACGGGCTG CGGCCTGGCG TTCCGCTCAC CGTCACCCTG GAGCCCGGGG TAGAGATCAT CGTGGAGCTG GAGACGCTCT CCGAACCGGA CGACTCCGCG ATGCGCACCC TGTACCTGCG GGTCAACGGC CAGCCCCGGC CGGTGCGGGT CCGGGACGCG TCCATCACGG CCACCACCAC GGCGGCCCGT CGGGCGGACG CCGCCGATCC GAACCAGGTG GGCGCCGGGC TGCCGGGCAT CGTCACGTTC AGCGTCGCGG TGGGCGACAC GGTGACGAAG GGCCAGCGGC TGGCGGTGGT GGAGGCGATG AAGATGGAGG CCGCGGTGAC CAGCCCGGTG GGCGGGACCG TCGTGGAGCT CGTCCGCTCC AGCGGCGACT CCGTCGAGGT CGGCGACCTG CTGCTTACTC TGCGTTCCTG A
|
Protein sequence | MRKVLVANRS EIAVRVFRAA QELGLRTVAV YTPEDVSALH RTKASEAYEI GGPGHPVRGY LDIDALLTVA KQAEADALHP GYGFLSESAV LADACATAGV TFVGPPPAVL RLTGDKVAAR DAALAAGLPV LRASVPLPEG AGALAAAEEV GFPLFVKASA GGGGRGLRRV ERPADLADAV ASASREAAAA FGDGTIFLEQ AVDRPRHIEV QILADAYGNI IHLYERDCSV QRRHQKVVEI APAPQLDEAV RASICADAVR FARHVGYVNA GTVEFLLGAD DRYTFMEMNP RIQVEHTVTE EVTGVDLVAA QLRIADGESL AGLNLTQEQI VLRSTAIQCR VTTEDPSDGF RPDIGTISFY QSPGGPGVRL DGATYPGAEV SPYFDSLLVK LTTRGNTLEK AARRARRALN EFRVRGVRTN IDFLVRLLAD PGFLAGGVST SFIDERPELL APGKGADPTS RLLARLAEST VNGFPRPAVA LTDPRELLPP APAQVAASGG TASGGTQPPA GSRQLLTELG PAGWAAWLRA QEALAVTDTT LRDAHQSLLA TRLRSFDILA AAPSYAALTP NLLSLEAWGG ATYDVALRFL AEDPWERLAA LRQAAPNICL QMLLRGRNAV GYTPYPDDVV RAFVAEAAAT GVDIFRIFDA LNDIEQMRPA IEAVLGTGAI AEGTLCYTGD LSDPAERIYT LDYYLRLAEE LVETGVHVLA VKDMAGLLRP AAAATLVTAL RERFDRPVHL HTHDTAGGQL ATLLAASAAG VDAVDAAAAP MSGGTSQVNM SALVAATDHT PRATGIALSA LSAMEPYWEA VRDLYAPFEA GLRAPTGAVY RHEIPGGQLT NLRQQAIALG LGDRWAEVTE CYAIANELLG KPIKVTPTSK VVGDLALFIA GGSVDVDRLR AHPEEFDLPA SVLGYLAGEL GTPPAGFAEP FRERALAGRR PSPPSVDLDA ADADELSFPG ARRVALSRLL FPGPWKDYLR AVDAYGDSSV VPTDGFLYGL RPGVPLTVTL EPGVEIIVEL ETLSEPDDSA MRTLYLRVNG QPRPVRVRDA SITATTTAAR RADAADPNQV GAGLPGIVTF SVAVGDTVTK GQRLAVVEAM KMEAAVTSPV GGTVVELVRS SGDSVEVGDL LLTLRS
|
| |