Gene Francci3_0317 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFrancci3_0317 
Symbol 
ID3903349 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. CcI3 
KingdomBacteria 
Replicon accessionNC_007777 
Strand
Start bp366037 
End bp369447 
Gene Length3411 bp 
Protein Length1136 aa 
Translation table11 
GC content73% 
IMG OID637877646 
Productpyruvate carboxylase 
Protein accessionYP_479433 
Protein GI86739033 
COG category[C] Energy production and conversion 
COG ID[COG1038] Pyruvate carboxylase 
TIGRFAM ID[TIGR01235] pyruvate carboxylase 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.557231 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGCGTAAGG TACTGGTTGC AAATCGTAGC GAGATCGCCG TCCGTGTTTT CCGAGCCGCT 
CAGGAACTGG GCCTGCGCAC GGTGGCGGTC TACACGCCCG AAGACGTTTC CGCGCTGCAC
CGGACAAAGG CGTCCGAGGC CTACGAGATC GGTGGCCCCG GGCACCCGGT GCGCGGATAC
CTGGACATCG ATGCCCTGCT CACCGTGGCC AAGCAGGCGG AGGCCGACGC GCTGCACCCC
GGCTACGGCT TCCTGTCGGA GTCGGCGGTG CTCGCCGATG CGTGCGCCAC CGCCGGCGTC
ACCTTCGTCG GACCGCCCCC GGCGGTCCTG CGGCTGACCG GAGACAAGGT CGCCGCCCGG
GACGCCGCGC TGGCCGCCGG GCTGCCGGTG CTGCGCGCCT CGGTGCCGCT GCCGGAGGGC
GCCGGGGCGC TCGCCGCCGC CGAGGAGGTC GGCTTCCCGC TGTTCGTCAA GGCCTCCGCC
GGTGGTGGGG GGCGCGGCCT GCGGCGGGTG GAACGCCCCG CCGACCTCGC CGACGCGGTG
GCGAGCGCGT CCCGGGAGGC CGCCGCCGCC TTCGGGGACG GCACGATCTT CCTGGAGCAG
GCGGTGGACC GGCCGCGCCA CATCGAGGTC CAGATCCTGG CCGACGCCTA CGGCAACATC
ATCCATCTGT ACGAGCGGGA CTGCTCGGTG CAGCGCCGGC ACCAGAAGGT CGTGGAGATC
GCCCCGGCGC CGCAGCTCGA CGAGGCGGTC CGGGCGAGCA TCTGCGCCGA CGCGGTGCGG
TTCGCCCGGC ATGTCGGCTA TGTCAATGCG GGCACGGTCG AGTTTCTTCT CGGCGCCGAC
GACCGGTACA CGTTCATGGA GATGAATCCC CGGATCCAGG TCGAGCACAC GGTCACCGAG
GAGGTCACCG GGGTTGATCT GGTCGCGGCG CAGCTGCGGA TCGCTGATGG GGAGAGCCTG
GCCGGCCTCA ACCTCACCCA GGAGCAGATC GTCCTGCGCA GCACCGCCAT CCAGTGCCGG
GTCACCACCG AGGATCCCTC GGATGGCTTC CGGCCCGACA TCGGGACGAT CAGCTTCTAC
CAGTCACCCG GCGGTCCGGG GGTGCGCCTC GACGGCGCCA CGTACCCGGG GGCCGAAGTC
AGCCCGTACT TCGACTCGTT GCTGGTGAAG CTGACGACCC GGGGTAACAC GCTGGAGAAG
GCGGCGCGGC GCGCCCGGCG GGCGCTGAAC GAGTTCCGGG TCCGCGGGGT GCGCACGAAC
ATCGACTTCC TGGTGCGGCT CCTCGCTGAC CCCGGCTTCC TCGCCGGCGG GGTGAGCACG
TCGTTCATCG ATGAACGGCC GGAGCTGCTC GCTCCCGGCA AGGGGGCGGA CCCGACCAGC
CGGCTGCTGG CTCGCCTCGC CGAGAGCACG GTGAACGGAT TCCCGCGCCC GGCGGTCGCC
CTGACCGACC CCCGGGAGTT GCTGCCGCCC GCGCCCGCGC AGGTGGCGGC GTCCGGCGGC
ACGGCGTCCG GCGGCACGCA GCCGCCGGCG GGATCCCGGC AGCTGCTGAC GGAGCTCGGG
CCGGCGGGCT GGGCGGCGTG GCTGCGCGCC CAGGAGGCGC TGGCCGTCAC CGACACCACC
CTGCGTGACG CCCACCAGTC GCTGCTGGCC ACGAGGCTGC GCAGCTTCGA CATCCTGGCG
GCCGCCCCGT CGTACGCCGC GCTGACGCCG AACCTGCTGA GCCTGGAGGC CTGGGGCGGG
GCGACCTACG ATGTCGCGCT GCGCTTTCTC GCGGAGGATC CGTGGGAGCG GCTCGCCGCG
CTGCGCCAGG CCGCGCCGAA CATCTGCCTG CAGATGCTGC TGCGGGGGCG CAACGCGGTC
GGGTACACCC CCTACCCGGA CGACGTCGTC CGGGCGTTCG TCGCCGAGGC CGCGGCGACC
GGCGTCGACA TCTTCCGGAT CTTCGATGCG CTGAACGACA TCGAGCAGAT GCGCCCGGCG
ATCGAGGCGG TGCTCGGTAC CGGGGCGATC GCCGAAGGGA CGCTCTGCTA CACCGGTGAC
CTCTCCGACC CGGCGGAGCG GATCTACACC CTCGACTACT ACCTGCGCCT CGCCGAAGAG
CTCGTCGAGA CGGGTGTGCA TGTGCTTGCC GTGAAGGACA TGGCGGGACT GCTGCGCCCG
GCGGCGGCGG CCACGCTGGT GACGGCCCTG CGGGAGCGTT TCGACCGGCC CGTCCACCTG
CATACCCACG ACACCGCGGG CGGCCAGCTC GCCACCCTGC TCGCGGCGTC CGCGGCCGGG
GTGGACGCGG TGGACGCCGC CGCCGCGCCG ATGTCCGGCG GCACCAGCCA GGTCAACATG
TCCGCGCTGG TGGCGGCGAC CGACCACACT CCCCGGGCGA CCGGCATCGC GCTGTCCGCG
CTGTCCGCGA TGGAGCCGTA CTGGGAGGCT GTGCGCGACC TCTACGCCCC GTTCGAGGCG
GGGCTGCGGG CCCCGACCGG GGCCGTGTAC CGGCACGAGA TTCCCGGCGG CCAGCTGACG
AACCTGCGTC AGCAGGCGAT CGCGCTGGGA CTGGGGGACC GGTGGGCCGA GGTCACCGAG
TGCTACGCGA TCGCCAACGA GCTGCTCGGC AAGCCCATCA AGGTGACGCC GACCAGCAAG
GTCGTCGGTG ACCTGGCTCT GTTCATCGCC GGCGGATCCG TCGACGTGGA CCGGTTGCGT
GCGCATCCCG AGGAGTTCGA CCTGCCGGCC AGCGTGCTCG GCTACCTCGC GGGTGAACTC
GGCACGCCGC CGGCAGGCTT CGCCGAACCG TTCCGGGAAC GGGCGCTGGC CGGGCGCCGT
CCGTCGCCGC CGTCCGTCGA CCTCGACGCG GCGGACGCCG ACGAGCTCTC CTTCCCGGGG
GCCCGGCGGG TGGCCCTGTC CAGGCTGCTC TTCCCCGGCC CGTGGAAGGA CTATCTGCGG
GCGGTGGACG CCTACGGGGA CTCCTCCGTG GTCCCGACCG ACGGCTTTTT GTACGGGCTG
CGGCCTGGCG TTCCGCTCAC CGTCACCCTG GAGCCCGGGG TAGAGATCAT CGTGGAGCTG
GAGACGCTCT CCGAACCGGA CGACTCCGCG ATGCGCACCC TGTACCTGCG GGTCAACGGC
CAGCCCCGGC CGGTGCGGGT CCGGGACGCG TCCATCACGG CCACCACCAC GGCGGCCCGT
CGGGCGGACG CCGCCGATCC GAACCAGGTG GGCGCCGGGC TGCCGGGCAT CGTCACGTTC
AGCGTCGCGG TGGGCGACAC GGTGACGAAG GGCCAGCGGC TGGCGGTGGT GGAGGCGATG
AAGATGGAGG CCGCGGTGAC CAGCCCGGTG GGCGGGACCG TCGTGGAGCT CGTCCGCTCC
AGCGGCGACT CCGTCGAGGT CGGCGACCTG CTGCTTACTC TGCGTTCCTG A
 
Protein sequence
MRKVLVANRS EIAVRVFRAA QELGLRTVAV YTPEDVSALH RTKASEAYEI GGPGHPVRGY 
LDIDALLTVA KQAEADALHP GYGFLSESAV LADACATAGV TFVGPPPAVL RLTGDKVAAR
DAALAAGLPV LRASVPLPEG AGALAAAEEV GFPLFVKASA GGGGRGLRRV ERPADLADAV
ASASREAAAA FGDGTIFLEQ AVDRPRHIEV QILADAYGNI IHLYERDCSV QRRHQKVVEI
APAPQLDEAV RASICADAVR FARHVGYVNA GTVEFLLGAD DRYTFMEMNP RIQVEHTVTE
EVTGVDLVAA QLRIADGESL AGLNLTQEQI VLRSTAIQCR VTTEDPSDGF RPDIGTISFY
QSPGGPGVRL DGATYPGAEV SPYFDSLLVK LTTRGNTLEK AARRARRALN EFRVRGVRTN
IDFLVRLLAD PGFLAGGVST SFIDERPELL APGKGADPTS RLLARLAEST VNGFPRPAVA
LTDPRELLPP APAQVAASGG TASGGTQPPA GSRQLLTELG PAGWAAWLRA QEALAVTDTT
LRDAHQSLLA TRLRSFDILA AAPSYAALTP NLLSLEAWGG ATYDVALRFL AEDPWERLAA
LRQAAPNICL QMLLRGRNAV GYTPYPDDVV RAFVAEAAAT GVDIFRIFDA LNDIEQMRPA
IEAVLGTGAI AEGTLCYTGD LSDPAERIYT LDYYLRLAEE LVETGVHVLA VKDMAGLLRP
AAAATLVTAL RERFDRPVHL HTHDTAGGQL ATLLAASAAG VDAVDAAAAP MSGGTSQVNM
SALVAATDHT PRATGIALSA LSAMEPYWEA VRDLYAPFEA GLRAPTGAVY RHEIPGGQLT
NLRQQAIALG LGDRWAEVTE CYAIANELLG KPIKVTPTSK VVGDLALFIA GGSVDVDRLR
AHPEEFDLPA SVLGYLAGEL GTPPAGFAEP FRERALAGRR PSPPSVDLDA ADADELSFPG
ARRVALSRLL FPGPWKDYLR AVDAYGDSSV VPTDGFLYGL RPGVPLTVTL EPGVEIIVEL
ETLSEPDDSA MRTLYLRVNG QPRPVRVRDA SITATTTAAR RADAADPNQV GAGLPGIVTF
SVAVGDTVTK GQRLAVVEAM KMEAAVTSPV GGTVVELVRS SGDSVEVGDL LLTLRS