Gene Francci3_2689 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFrancci3_2689 
Symbol 
ID3904913 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. CcI3 
KingdomBacteria 
Replicon accessionNC_007777 
Strand
Start bp3174911 
End bp3177526 
Gene Length2616 bp 
Protein Length871 aa 
Translation table11 
GC content68% 
IMG OID637880013 
Productphosphoenolpyruvate synthase 
Protein accessionYP_481779 
Protein GI86741379 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG0574] Phosphoenolpyruvate synthase/pyruvate phosphate dikinase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value0.769187 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGTGACG TACGCCCGCT GCGGGAGTTG CGTTCCGGCG ACCGGGACAC GGTCGGCGCC 
AAGGCCGCCA ATCTCGGCGA GCTCATATCC GCCGGCTTTC CGGTTCCCGA CGGTTTCTGC
CTGCCCCAGG CGGTGTATCA CCGCACGGTC GGGGACAAGG TCCGCCCACT GCTTGCGCAG
CTCGACGCGG CTCTTACCGA GGACGCGACA GATGACCAGA TCCGACCGAT CTCGGCGGCC
ATGCGGGCCA CGGTCGAGGC CACGGACGTT CCCGCCGGGC TCGCTGCGGA TGTCGCGCAG
GCCCTTGCGG CATGGCGTAT CGCTGACGTC CGGGTGTCGG TCCGTTCCTC AGCGACGTGG
GAGGACACCG ACGCGACGAG CTTCGCCGGC CAGTATCGCA GCGAACTGGG GGTCCCGCCG
GCGGCCGTGC TCGATTCCGT ACGGCGTTGT TGGGGCTCGC TGTGGGAGCT TCCGGCCATC
CGCTATCGGC AACGGCATGG AATTCCGCAC GGTGCGGTCG GCATGTCCGT GATCGTCCAG
CTGATGGCCG AGGCCGAGGC CGCCGGCGTG CTCTTCACCG TCGACCCGCG GGACGCCGCC
GCCGACCGTC TGGTGATCGA GGCCACCTGG GGCTTCGGTG AGGCGCTGGT GAGTGGGAAA
GTCGATCCCG ACCGTTTTGA CGTCGACCGG TCCGGCGCCA CGCTGCGCCA CGCGCACGTC
GCCGACAAAC GGCAGATGGT CGCGTATCCG TCGCACAGCG GTGCGGGCGG GGTCGATTTC
GTCGACGTTC CGGACCAGCG ACGGCGGGCA CCTTCGCTCA CGGCGGAGCA GGTTGCCGAG
CTGGCCAGCC TCGGCCGCGC GATCGAAACC CATTTCGGTG CCCCTCAGGA CGTCGAATGG
GCCGTTTCCG GAACGACTCT CACGATTCTG CAGGCCCGTC CCATCAGGCT GCCGGCCGCC
GATGAGCCAC CGCCGCCCGC GGCCGACTGG ACAAGTCCGA TCGAGGGCGC CTGGTGGGCG
CGGATAAGCA TCTGTGATTC GTGGCTGCCC GAACCGCTCT CGCCGCTTTT CGCGTCGACT
CTTTTCCCGT GCCTGGTCCG GCACTGGCAG CGGAACTGGG CCGGACCGGA CTCCGCCCAA
CGCAACAACC GGCTGCTGCC CACACCGATG ACGGGCGTCA TCAACGGCTT CGCCTACCTG
CGCTTCGACT ATCACCTGAA CCGGTATCCC CGACACGCCG CAGCCATGGT GCTTCGATTC
TTCCGGTTCC ATCTCGGCCC GCTGCGCAGG CAGTGGCAGC GGGGCATCCT GCCGCGCCAC
TCCGAACGCA TCGAGGCGGC CAACCGTCGG GACCTCACCC GCCTGGACAA CAACGAGCTG
CTCGGGCTGA TCGACGGGGT GCAAGAACTC AGCGGACGCT ACTGGGGGAT CATCGGCGGT
CTGGCCTGGT ACTGGAACGT ATCGGAATGG CTCCTGGCAA CCGTCTATCC CTGGGTCGCC
AGGGCTGGTA CGGGCGCGGG ACTTCCGATC GGCCCCGGAC CCCTGCTGCA GGGCTACCCG
AGCCGCACTC TCGACGTCGA ACTGGAGCTC GCGGAGCTCG CGCGCCATGA CGCCGACGGG
GCCGAGTACA CGGCCGAGTT CGAGCGGTTC ATCGGCCGGC AGGGCCACCA AGTGTACAGC
CTCGACTTCG CCAGCCCGAC CCCCGCCGAG GACCCCGAAG TCTTCAGGGC AACCATCGAA
GCGTACCGGA GCGGAACGCG TCAGCAACCG CAGGAACGAA TCGACGCGCT GGCCGCGCAA
CGGGAGGACC GGCTACGGAC GATAAGAAAG GCCCTCCGGT TCGCGCCGGT GAGACGCGGC
GTCCTCCACC TGCTTCTGCG GTGGAACCTC CGTCAGGGCC GGCTCCGGGA CGAGGTTCTT
TTTCACTTCA CCCGTGGCTG GCCGGTGCTG CGCCGGGCGT ACCTCGAACT CGGTCGCCGG
CTGGTGGCGG CAGGCGTCCT GACTGAGCCC GACGATGTCT TCTACCTCAC CGGTGACCAG
GTGAAACGGC AGCTTGCCGC GCTTGACACC GGCGTGGCCG GCGACGACCT CACCAGCGTC
GTCCACGAGC GGCGCCGGCT CCGCGAACAA CAACGCCTGC TCAGCCCGCC GATACAGGTC
CCGCAGGACG CCCGGATCTT CCTCGGCAGG AGGGACGTCA CCGCCCTGGC AGTTTTCGGC
CCCCGTCCGA GAGGGGCCGA GGATGACGGG CTGCGAGGCT CTCCCGTCAG CCCCGGTCGA
GCCACAGCAC CGGCCCGCAG AATAAGTTCC ACCGACGACT TCGGCCGGCT ACGGCCCGGC
GAGATCCTAG TCGCCCCTCA TCTCACGCCA GCCTGGTCGC CACTGCTTTC CATCGCGGCT
GGTGTGGTAA CAGATACCGG CGGCGCGCTG TCGCATGGCT CGATCGTCGC GCGTGAGTAC
GGAGTTCCCG CGGTTATGGG AGTTCACGGC GCGACCCACA TTATCCAGGA TGGCCAGGTC
GTTACGGTTG ACGGCGATCG GGGTCTCGTA CTCCTGCAAG GAGTTGAACG TGGCGGCAGA
CTCTCCGCTC AGCAGCTGGA CGCCTCCCCT CGATAA
 
Protein sequence
MRDVRPLREL RSGDRDTVGA KAANLGELIS AGFPVPDGFC LPQAVYHRTV GDKVRPLLAQ 
LDAALTEDAT DDQIRPISAA MRATVEATDV PAGLAADVAQ ALAAWRIADV RVSVRSSATW
EDTDATSFAG QYRSELGVPP AAVLDSVRRC WGSLWELPAI RYRQRHGIPH GAVGMSVIVQ
LMAEAEAAGV LFTVDPRDAA ADRLVIEATW GFGEALVSGK VDPDRFDVDR SGATLRHAHV
ADKRQMVAYP SHSGAGGVDF VDVPDQRRRA PSLTAEQVAE LASLGRAIET HFGAPQDVEW
AVSGTTLTIL QARPIRLPAA DEPPPPAADW TSPIEGAWWA RISICDSWLP EPLSPLFAST
LFPCLVRHWQ RNWAGPDSAQ RNNRLLPTPM TGVINGFAYL RFDYHLNRYP RHAAAMVLRF
FRFHLGPLRR QWQRGILPRH SERIEAANRR DLTRLDNNEL LGLIDGVQEL SGRYWGIIGG
LAWYWNVSEW LLATVYPWVA RAGTGAGLPI GPGPLLQGYP SRTLDVELEL AELARHDADG
AEYTAEFERF IGRQGHQVYS LDFASPTPAE DPEVFRATIE AYRSGTRQQP QERIDALAAQ
REDRLRTIRK ALRFAPVRRG VLHLLLRWNL RQGRLRDEVL FHFTRGWPVL RRAYLELGRR
LVAAGVLTEP DDVFYLTGDQ VKRQLAALDT GVAGDDLTSV VHERRRLREQ QRLLSPPIQV
PQDARIFLGR RDVTALAVFG PRPRGAEDDG LRGSPVSPGR ATAPARRISS TDDFGRLRPG
EILVAPHLTP AWSPLLSIAA GVVTDTGGAL SHGSIVAREY GVPAVMGVHG ATHIIQDGQV
VTVDGDRGLV LLQGVERGGR LSAQQLDASP R