Gene Franean1_6583 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_6583 
Symbol 
ID5674898 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp8009511 
End bp8012456 
Gene Length2946 bp 
Protein Length981 aa 
Translation table11 
GC content74% 
IMG OID641245434 
Productphosphoenolpyruvate carboxylase 
Protein accessionYP_001510826 
Protein GI158318318 
COG category[C] Energy production and conversion 
COG ID[COG2352] Phosphoenolpyruvate carboxylase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.800439 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones27 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGACCACCA CACCTGGCAT CCCCGGGACG GCAAGCACCG CTAGCCCGGA GAGCAGCAAC 
AACCCGGCGA GCACCGGTAA CCCGGCGGAC GGGCGTCCGG ACCAGCAGGC TCAGGACCCG
GGCGGCTTCT CCGGCGGCGT GTCCTCCGAC GCGCGCCACG CGGCCGTCCG CGCCGGTGTG
CGCCGCCTCG GCGCGCTGCT CGGCGAGGCC CTCACCCGCC ACGAGGGCCC GGAACTGCTC
GCGCTGGTCG AACGGGTCCG AATGCTCGCC CGCGGCGAGG ACGGCGGCAC CGAGCTCGCC
GCCGTCCTCG ACGGCGTCGA CGCGCGGCAG GCGATCCTGC TGGCCCGCGC GTTCACCGCC
TACTTCCAGC TCGCGAACAT CACCGAGCAG TTGCACCGGG CGCGGGAGAT CTCCGACCGC
CCGCGCAGCC GGCTGCGCGA CCTCGCCGAC CGCATCGCCG AAGCGGTGGA CGAGGGTGTC
GTCGACCCGG GCCTTCCGGC CGAGGTCATG CGGCGGCTGC AGCTGCGCCC GGTCTTCACC
GCGCACCCCA CCGAGGCCAG CCGCCGTTCC GTCCTGGAGA CGCTGCGCGG CATCGCCGAC
CTGCTCGACG CCACCGACGA CCCGCGCCGC CCCGGCGCCG ACGACGAGCG GCTGGCCCGC
CGCCTCGCCG AGCTGGTGGA CGTCCTCTGG CAGACCGACG AGCTGCGGGT GGAGCGGCCG
AAGCCCGCGG ACGAGGCACG CTCGGCCGCG TACTACCTCA CCTCGATCGC CACCGAGGTC
CTGCCCGACC TGCTGGAGGA GCTCGACACC GCGTTCGCCG GGATCGGGGT CGAGCTGCCG
GTGACCGCCC GTCCGCTGGC CTTCGGCTCG TGGGCCGGCG GCGACCGGGA CGGGAACCCG
AACGTCACGC CGGAGGTCAC CCTGGAGGTA CTCAACCTCC AGTACGAGTT CGGCATCCGG
GTCCTCACCG CCCTCGTCGA CCGGCTGGTC CGTGAGCTGA CGGCGTCCAC CCGGGTGGTC
GGGGAGGTCA GCGGCGACCT GCTCGCCGCA CTGGCCGCGG ACCGCGCCGC GCTGCCCGAG
GTGTACGACC GCTTCATCCG GCTCAACGCC GAGGAGCCGT ACCGGCTCAA GTGCAGCTAC
ATCCTGGCGC GGCTGGCCGG CACCCGGGCC CGGCTGGCCG CCGGCGCACC GCACGTCCCC
GGCAAGGACT ACCTGCGGCC CACCGACCTG GTGGACGAGC TGGAGCTCAT GCGGGTCTCC
CTCGCCGCGA GCCACGGCGA GCTCGTCGCG AACGGGCCGC TGCGACGCGC GATCCGCACG
GCGAGCGCCG TCGGCCTCTC GCTGGCCACC CTCGACATCC GCGAGCACGC CGACGCCCAC
CACGCCGCCC TCGGGGCGAT CTACGACCAG CTCGCCGAGC TGGACGTCCC CTATCGCGAC
CTGGACCGGC CGGCCCGGCT CGCGCTGCTG TCGGCCGAGC TGGCGGGGCG CCGGCCGCTG
CTCGGCGCCG TCCCGCCGCG GCTGCCCGAG CGGGTGGCGC GCACCCTCGA ACTGCTCACC
ACGGTGCGCC ACGCGCTGGA CCAGTACGGC GACGGCGTCA TCGAGAGCTA CATCGTCTCG
ATGACCCGCG ACGCCGACGA CATCCTGGCC GTCGCCGTGC TCGCCCGCGA GGCCGGTCTG
GTCGGCCTCG GCCGGCCACC GGGGACGTCC AGCGGCCCGG GAATCGGCAC GGAACGTGAC
GGGCCCGCGG TGCCGGCGGT CGCGTGGGCG CGGATCGGGT TCGTGCCGCT GTTCGAGACC
GTCGCCGAAC TGCGCGCCGC GGGGACGCTG CTCGACCGGC TGCTCGCCGA CCCGTCCTAC
CGGGCGCTGG TCGCGGCACG CGGTGACGTC CAGGAGGTCA TGCTCGGCTA TTCGGACTCG
TCCAAGGACG CCGGCATCAC CGCGTCCCAG TGGGAGATCC ACAAGGCACA GCGTGAGCTG
CGGGACACCG CCCGCGCCCA CGGGGTCGTG CTGCGGCTCT TCCACGGCCG CGGCGGATCG
GTCGGCCGCG GCGGCGGCCC CTCCGGCGAG GCGATCCTTG CCCAGCCGTT CGGCACCCTG
GACGGACCGA TCAAGGTCAC CGAACAGGGC GAGGTGATCT CGGACAAGTA CACCCTGCCG
CAGCTGGCCC GGCAGAACCT GGAGATCACC CTGTCGGCGG TGGTGGAGGC GTCCGTCCTG
CACCGCACCT CCCGGCTCAC CGGACAGGCG CTCGCCGGCT GGAACGGGAC CATGACGGCG
GTCGCGGACG CGGCGAAGGA CGCCTACCGC TCCCTCGTCG CCGACCCGGC CCTGGTTCCG
TTCTTCCTGG CGGCGACGCC GGTGGAGGAG CTCGGCAACC TCAACATCGG CTCCCGCCCC
TCGCGGCGGC CCGGCGGCTC CGGAGGGCTG GCGGACCTGC GGGCCATCCC CTGGGTGTTC
GGCTGGACGC AGGCCCGGAT CATCCTGCCC GGCTGGTTCG GGGTCGGCTC CGGGCTGGCG
GCGGCGCGCG AGGCCGGCGC GGGCGGGGAG CTGGCCGAGA TGTACCGCTC ATGGCATTTC
TTCCGGACGT TTCTCGGCAA CGTCCAGATG ACCCTGGCGA AGTCGGACCT CGCGATCGCC
CAGCGCTACG TCTCGGCGCT GGTCGACCCG GCCACGGCCG CGCTGTTCGA CGTCATCCGG
GACGAGCACG AGCGGACGGT GCGGGAGGTG CTGGCCGTCA CCGGGCAGGC CGGCCTGCTC
GAGACCGCGC CGGTGCTGCG CCACACGCTG CTGGTCCGGG ACGCCTACCT GGCCCCGCTG
CACGCGTTGC AGGTCTCCCT GCTGGCCCGC ACCAGGGCGG CCGGTGACAT CGCCGACCCT
CAGCTACGCC GCGGCCTCCT ACTGACCATC AATGGAATAG CCGCCGGCCT CCGCAACACG
GGCTGA
 
Protein sequence
MTTTPGIPGT ASTASPESSN NPASTGNPAD GRPDQQAQDP GGFSGGVSSD ARHAAVRAGV 
RRLGALLGEA LTRHEGPELL ALVERVRMLA RGEDGGTELA AVLDGVDARQ AILLARAFTA
YFQLANITEQ LHRAREISDR PRSRLRDLAD RIAEAVDEGV VDPGLPAEVM RRLQLRPVFT
AHPTEASRRS VLETLRGIAD LLDATDDPRR PGADDERLAR RLAELVDVLW QTDELRVERP
KPADEARSAA YYLTSIATEV LPDLLEELDT AFAGIGVELP VTARPLAFGS WAGGDRDGNP
NVTPEVTLEV LNLQYEFGIR VLTALVDRLV RELTASTRVV GEVSGDLLAA LAADRAALPE
VYDRFIRLNA EEPYRLKCSY ILARLAGTRA RLAAGAPHVP GKDYLRPTDL VDELELMRVS
LAASHGELVA NGPLRRAIRT ASAVGLSLAT LDIREHADAH HAALGAIYDQ LAELDVPYRD
LDRPARLALL SAELAGRRPL LGAVPPRLPE RVARTLELLT TVRHALDQYG DGVIESYIVS
MTRDADDILA VAVLAREAGL VGLGRPPGTS SGPGIGTERD GPAVPAVAWA RIGFVPLFET
VAELRAAGTL LDRLLADPSY RALVAARGDV QEVMLGYSDS SKDAGITASQ WEIHKAQREL
RDTARAHGVV LRLFHGRGGS VGRGGGPSGE AILAQPFGTL DGPIKVTEQG EVISDKYTLP
QLARQNLEIT LSAVVEASVL HRTSRLTGQA LAGWNGTMTA VADAAKDAYR SLVADPALVP
FFLAATPVEE LGNLNIGSRP SRRPGGSGGL ADLRAIPWVF GWTQARIILP GWFGVGSGLA
AAREAGAGGE LAEMYRSWHF FRTFLGNVQM TLAKSDLAIA QRYVSALVDP ATAALFDVIR
DEHERTVREV LAVTGQAGLL ETAPVLRHTL LVRDAYLAPL HALQVSLLAR TRAAGDIADP
QLRRGLLLTI NGIAAGLRNT G