Gene Franean1_6501 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_6501 
Symbol 
ID5674816 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp7905379 
End bp7908762 
Gene Length3384 bp 
Protein Length1127 aa 
Translation table11 
GC content74% 
IMG OID641245349 
Productpyruvate carboxylase 
Protein accessionYP_001510744 
Protein GI158318236 
COG category[C] Energy production and conversion 
COG ID[COG1038] Pyruvate carboxylase 
TIGRFAM ID[TIGR01235] pyruvate carboxylase 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value0.697042 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGCGCAAGG TACTGGTGGC AAACCGCAGT GAGATCGCGG TTCGGGTGTT CCGGGCCGCG 
CACGAGCTCG GGTTGCGGAC GGTGGCTGTC TACACCCCCG AGGATGTCTC CGCGCTGCAC
CGCACGAAGG CGTCCGAGGC CTACGAGCTG TGCGAGCCGG GTCATCCGGT GCGTGGCTAC
CTCGACATCG ACGCGTTGTT GCGGGTCGCC AAGCAGGCGG ACGCCGACAC GCTGCACCCG
GGTTACGGGT TCCTGTCGGA GTCGGCGGTG CTGGCGGAGG CATGCGCGGC GGCTGGCGTG
ACGTTCGTCG GCCCGCCGCC GGCTGTGCTG CGGCTGACGG GTGACAAGGT GGCGGCCCGT
AACGCGGCGA TCGCCGCGGG GCTGCCGGTG CTGCGGGCCT CGGAGCCGCT GCCGGACGGC
GTCGGCGCGC AGGAGGCCGC GGCCGAGGTC GGGTTCCCGC TGTTCGTGAA GGCGTCGGCG
GGTGGTGGCG GGCGCGGGCT GCGCCGGGTG CTGCGTCCGG AGGACCTCGC GGAGGCGGTG
GCGAGTGCGT CGCGTGAGGC GGCGGCCGCG TTCGGGGACG GCACGGTCTT CCTGGAGCAG
GCGGTGGAGC GGCCGCGTCA TGTCGAGGTG CAGGTCTTCG CCGACGGCTA CGGGAACGTG
ATCCACCTGT TCGAGCGGGA CTGCTCGGTG CAGCGCCGGC ACCAGAAGGT CGTCGAGATC
GCCCCGGCGC CGGCGCTGGA CGAACCGATC CGCGACGCCC TGTGCGAGGC CGCGGTGCGC
TTCGCCCGTC ACGTCGGCTA TGTGAACGCC GGGACGGTGG AGTTCCTCGT CGGGGCCGAC
GGCGCGTTCA CGTTCATGGA GATGAACCCG CGCATCCAGG TCGAGCACAC GGTGACCGAG
GAGGTCACCG GTGTCGACAT CGTCGCCGCG CAGCTGCGGG TCGCCGCCGG GGAGAGCCTG
ATCGACCTGA ACCTCACCCA GGACGCGATC GCTCTTCGGG GCACCGCGAT CCAGTGCCGG
GTGACGACCG AGGATCCCTC GGACGGCTTC CGCCCCGACA CCGGGACGAT CACCTTCTAC
CAGTCGCCGG GCGGGCCCGG GGTCCGCCTC GACGGTGCCA CCTACCCGGG GGCGGAGGTC
AGCCCGTACT TCGACTCGCT GCTGGTGAAG CTGACCGCGC GTGGTTCGAC GCTGGAGAAG
GCGTCCCGGC GCGCCCGGCG GGCGCTGGGC GAGTTCCGGG TCCGCGGCGT GCGCACCAAC
ATCGAGTTCC TCGGCCGGCT GCTCGAAGAC CCCGACTTCC TGGCCGGCGG GGTCACGACC
TCGTTCATCG ACGAGCGCCC CGGCCTGGTG AACGCCCTCG ACGACGGGGA CGCGACGAGC
CGCATGCTGG CCCGGCTGGC GGAGAGCACG GTGAACGGGC ACGCCCGCCC GGCGGGGGTG
GCTCTGACGG AGCCGCGCAC GCTGCTCCCG CCGCTGCACC CGACCGACAC CCCGGCGGCC
GGGTCGCGTC AGCTCCTGAC CGAGCTGGGG CCGGCCGGGT GGGCGGCGGA CCTGCGGGGG
CGGACGGCGC TGGCAGTCAC CGACACCACG CTGCGTGACG CGCACCAGTC GTTGCTGGCG
ACCCGGCTGC GCAGCTTCGA CATCCTGGCC GCCGCGCCCG CGCTCGCCGA GCTCACCCCG
AACCTGCTGA GCCTGGAGTG CTGGGGTGGG GCCACCTACG ACGTCGCGTT GCGCTTCCTC
GCCGAGGACC CGTGGGAGCG CCTGACGGCG GTGCGGGCGG CGGTGCCGAA CATCTGCCTG
CAGATGCTGC TGCGCGGGCG CAACGCCGTC GGCTACACCC CCTACCCGGA CGACGTCGTG
CGCGCGTTCG TCGCCGAGGC GGCGGCCGCC GGGCTCGACA TCTTCCGGAT CTTCGACGCG
CTCAACGACA TCGAGCAGAT GCGCCCGGCG ATCGCCGCCG TGCTGGAGAC GAACGCCGTC
GCCGAGGGCA CCCTGTGCTA CACCGGCGAC CTGACCGACC CCGGCGAGCG GATCTACACC
CTCGACTACT ACCTGCGCCT GGCCGACCAG CTCGTGCAGG CCGGCGTGCA CGTGTTGGCG
ATCAAGGACA TGGCCGGGCT GCTGCGCCCG GCCGCCGCCC ACGCGCTGGT CGCCGCGCTG
CGGGAGCGCT TCGACGTACC CGTCCACCTG CACACCCACG ACACCGCCGG TGGTCAGCTC
GCCACCCTGA TCGCGGCCAA CGAGGCCGGG GTGGACGCCG TCGACGCGGC GGCCGCGCCG
ATGTCCGGCG GCACCAGCCA GCCGAACCTC TCCGCCCTGG TCGCCGCCAC CGACCACACC
CCGCGGGCCA CCGGCATCCG GCTCGGCGCG CTGTCGGCGA TGGAGTCGTA CTGGGAGGCC
GTGCGCGATC TCTACTCGCC GTTCGAGGCC GGGCTGCGCG CGCCGACCGG CGCGGTGTAC
CGGCACGAGA TCCCCGGCGG GCAGCTCACC AACCTGCGCC AGCAGGCCAT CAGCCTCGGC
CTCGGCGACC GCTGGGCCCA GGTGACCGAG TGCTACGCCG TGGCGAACGA CCTGCTCGGC
AAGCCGATCA AGGTCACCCC GACCAGCAAG GTCGTCGGTG ACCTGGCGCT GTTCATCGCC
GGCGGCTCGG TGGACGTCGA CCGGCTGCGT GAGCATCCCG AGGAGTACGA CCTGCCCGCC
AGCGTCCTGG GCTACCTGGC CGGCGAACTC GGCACACCCC CGGCCGGCTT CGCCGAGCCG
TTCCGGGAGC GGGCGCTGGC CGGCCGCCGT CCCGAGCCGC CCGCGGCCAC CCTCGCCACC
GAGGACGCCG CCGCGCTGGC CGAGCCCGGC GCCGGCCGGC GGGCGGCGCT GTCCCGGCTG
CTGTTCCCCG GCCCGTGGAA GGACTACCGC AAGGCCGTCG CCACCTACGG CGACTCCTCG
GTGATCCCCA CCGAGGCGTT CCTGTTCGGC CTGGTCCCCG GCCGCCCGGC GAGCGTCATG
CTCGAACCCG GAGTCGAGAT CATCGTGACG CTGGAGACGG TGGGCGAGCC GGACAGCGGC
GGCATGCGCA CCCTCTACCT GCGGGTCAAC GGCCAGCCCC GCCCGGTTCG GGTCCGCGAC
AACTCGATCA AGGCGACGGC CACCGCGGCC CGGCGCGCCG ACCCGTCCGA CCCCACCCAC
GTCGGCGCCG GCCTGCCCGG GATCGTCACC TTCGCCGTCG CGGCCGGCGA CGAGATCGAG
AAGGGCCAGA AACTCGCCGT GGTCGAGGCC ATGAAGATGG AGGCCGCGGT CACCAGCCCG
GCCGCCGGGA AGATCGCCGA ACTCGTCCGC TCCAGCGGCG AATCCGTAGA GGTCGGCGAC
CTCCTCCTGA TCCTGCGCCC CTGA
 
Protein sequence
MRKVLVANRS EIAVRVFRAA HELGLRTVAV YTPEDVSALH RTKASEAYEL CEPGHPVRGY 
LDIDALLRVA KQADADTLHP GYGFLSESAV LAEACAAAGV TFVGPPPAVL RLTGDKVAAR
NAAIAAGLPV LRASEPLPDG VGAQEAAAEV GFPLFVKASA GGGGRGLRRV LRPEDLAEAV
ASASREAAAA FGDGTVFLEQ AVERPRHVEV QVFADGYGNV IHLFERDCSV QRRHQKVVEI
APAPALDEPI RDALCEAAVR FARHVGYVNA GTVEFLVGAD GAFTFMEMNP RIQVEHTVTE
EVTGVDIVAA QLRVAAGESL IDLNLTQDAI ALRGTAIQCR VTTEDPSDGF RPDTGTITFY
QSPGGPGVRL DGATYPGAEV SPYFDSLLVK LTARGSTLEK ASRRARRALG EFRVRGVRTN
IEFLGRLLED PDFLAGGVTT SFIDERPGLV NALDDGDATS RMLARLAEST VNGHARPAGV
ALTEPRTLLP PLHPTDTPAA GSRQLLTELG PAGWAADLRG RTALAVTDTT LRDAHQSLLA
TRLRSFDILA AAPALAELTP NLLSLECWGG ATYDVALRFL AEDPWERLTA VRAAVPNICL
QMLLRGRNAV GYTPYPDDVV RAFVAEAAAA GLDIFRIFDA LNDIEQMRPA IAAVLETNAV
AEGTLCYTGD LTDPGERIYT LDYYLRLADQ LVQAGVHVLA IKDMAGLLRP AAAHALVAAL
RERFDVPVHL HTHDTAGGQL ATLIAANEAG VDAVDAAAAP MSGGTSQPNL SALVAATDHT
PRATGIRLGA LSAMESYWEA VRDLYSPFEA GLRAPTGAVY RHEIPGGQLT NLRQQAISLG
LGDRWAQVTE CYAVANDLLG KPIKVTPTSK VVGDLALFIA GGSVDVDRLR EHPEEYDLPA
SVLGYLAGEL GTPPAGFAEP FRERALAGRR PEPPAATLAT EDAAALAEPG AGRRAALSRL
LFPGPWKDYR KAVATYGDSS VIPTEAFLFG LVPGRPASVM LEPGVEIIVT LETVGEPDSG
GMRTLYLRVN GQPRPVRVRD NSIKATATAA RRADPSDPTH VGAGLPGIVT FAVAAGDEIE
KGQKLAVVEA MKMEAAVTSP AAGKIAELVR SSGESVEVGD LLLILRP