Gene Franean1_6970 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_6970 
Symbol 
ID5675282 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp8494921 
End bp8497098 
Gene Length2178 bp 
Protein Length725 aa 
Translation table11 
GC content72% 
IMG OID641245818 
Productmalate synthase G 
Protein accessionYP_001511209 
Protein GI158318701 
COG category[C] Energy production and conversion 
COG ID[COG2225] Malate synthase 
TIGRFAM ID[TIGR01345] malate synthase G 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value0.965318 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGCCTGACT ACATCGACGT CGTCGGACTG CGCGTCGCCA GCGAGCTGCA CCAGTTCGTC 
GCCGAGGAGG CGCTGCCCGG GTCGGGTGTC GACCAGGCCG CGTTCTGGGC CGGCGCCACG
GAGATCATCA ACGATCTGAC CCCGCGCAAC CGGGAGCTAC TGGCGCGGCG CGACGAGCTC
CAGCAGGCTG TGGACGACTA CCACCGGCGC ACGCCGGGCC GGCCGGCGCA GAGCGACTAC
GCCGAGTTCC TGACCTCCAT CGGCTACCTG CTCGACGAGC CCGCGCCGTT CACCATCACC
ACCGACGGCG TGGACGACGA GATCGCGACC CAGGCCGGCC CGCAGCTGGT GGTGCCGCTG
CTCAACGCGC GGTACGCGGT CAACGCCGCC AACGCCCGGT GGGGCTCGCT CTACGACGCC
CTGTACGGCA GCGACGTCAT CGACGAGACA GACGGCAGGG AGCGCGCCGG CGGCTACAAC
CCGACCCGCG GCGCCGCGGT CATCGCCCGG GTCCGGGAGA TCCTCGACCG GAGCTTCCCG
CTGACCAGCG GCTCGCACGC GGACGCGACC CGGTACGCGG TCGACGCCGA CGGGCTCGTC
GTCACCGTCG ACGGGGCCGC CGTCCGGCTG GCCGACCCCG CGCTGTTCGT CGGCTTCCGC
GGCGAGGGCG ACGAGCCGAC GGCTGTCCTG CTGACCCATC ACGGGCTGCA CGTCGAGATC
CAGGTCGACC GGAACCACCC GGTCGGGGCG GGCGACCGCG CGGGCGTCAG CGACGTCGTC
GTCGAGGCCG CCGTCACCAC GATCATGGAC CTGGAGGACT CGGTCGCGGC CGTCGACGCC
GCGGACAAGG TGCTCGGCTA CCGCAACTGG CTGCAGCTCA TGCAGCGCCG GCTCACCGCC
GAGGTGGACA AGGGCGGCCG CACCTTCACC CGGGTCCTCG CCGCCGACCG CGAGTACACG
ACCGCCGACG GCGGCACAGT CACCCTGCCC GGCCGGTCGA TGTTGCTCAT CCGCCAGGTC
GGGCTGCTGA TGACCACCGA TGCGGTCCTC GACCGCGACG GCCGGCCGGT GCCCGAGGGC
ATCCTCGACG CGCTGGTCAC CGGCCTGGGC AGCGTGCATG ACCTGCGCGG CGACACCGCC
GGCGGCAACT CCCGCACCGG CTCGGCGTAC GTGGTCAAGC CCAAGATGCA CGGCCCGGAC
GAGGTCGCCT TCACCGTCGA GCTGATGTCC CGCGTCGAGC GCCTCCTCCA GCTCCCGCCG
GCGACGATCA AGCTCGGCAT CATGGACGAG GAACGCCGCA CCTCGGTCAA CCTCAGGCGC
TGCGTGTACG AGGCCCGGGA CCGCGTCGTC TTCATCAACA CCGGGTTCCT GGACCGGACC
GGCGACGAGA TCCACACCTC GATGCTCGCC GGGCCGATGG TCCGCAAGGC CGCCATGCGC
GAGGCGACGT GGATCCGCGT CTACGAGGAC AACAACGTCG ACGCCGGCCT CGCCCTCGGT
TTCCCCGGGC GGGCGCAGAT CGGCAAGGGC ATGTGGGCCG CGCCGGACAA TCTGGCCGAC
ATGATGACGC AGAAGATCGG GCACCCGCTG GCCGGCGCGT CCTGCGCCTG GGTGCCCTCG
CCTACCGCCG CGGCGCTGCA CGCACTGCAC TACCACGAGG TCTCCGTGGC GGACCGGCAG
CGCGAGCTGG CCGGCCGCCG CCCGGCCGAC CGGCTGGAGC TGCTCACCGT CCCGCTCGCG
GAACCGGCGG CCGCCTGGTC GCCGGAGGAG GTCGCCGCCG AGGTTGACAA CAACGTCCAG
GGCGTGCTCG GCTACGTCGT GCGCTGGGTC GAGCTCGGCA TCGGCTGCTC CAAGGTGCCC
GACCTGACCG GCACACCCCT GATGGAGGAC CGCGCCACCT GCCGCATCTC CTCCCAGCAC
GTGGCCAACT GGCTGCGGCA CGGCATCGTC TCCCGCGAGC AGGTCGAGGA GTCGCTGCGC
CGGATGGCCG CGCTCGTCGA CGCGCAGAAC GCCGGCGAGC CGGGGTACCG GCCGATGGCC
CCGGCCTTCG AGGAGCTGGC CTTCGCCGCC GCGCGGGCGC TGCTACTCGA GGGCGCCGAC
CAGCCGAACG GCTACACCGA ACCGCTGTTG CACGAGCACC GGCGCGCTCA GAAGAACAAG
GAGATGGTCC GCTCATGA
 
Protein sequence
MPDYIDVVGL RVASELHQFV AEEALPGSGV DQAAFWAGAT EIINDLTPRN RELLARRDEL 
QQAVDDYHRR TPGRPAQSDY AEFLTSIGYL LDEPAPFTIT TDGVDDEIAT QAGPQLVVPL
LNARYAVNAA NARWGSLYDA LYGSDVIDET DGRERAGGYN PTRGAAVIAR VREILDRSFP
LTSGSHADAT RYAVDADGLV VTVDGAAVRL ADPALFVGFR GEGDEPTAVL LTHHGLHVEI
QVDRNHPVGA GDRAGVSDVV VEAAVTTIMD LEDSVAAVDA ADKVLGYRNW LQLMQRRLTA
EVDKGGRTFT RVLAADREYT TADGGTVTLP GRSMLLIRQV GLLMTTDAVL DRDGRPVPEG
ILDALVTGLG SVHDLRGDTA GGNSRTGSAY VVKPKMHGPD EVAFTVELMS RVERLLQLPP
ATIKLGIMDE ERRTSVNLRR CVYEARDRVV FINTGFLDRT GDEIHTSMLA GPMVRKAAMR
EATWIRVYED NNVDAGLALG FPGRAQIGKG MWAAPDNLAD MMTQKIGHPL AGASCAWVPS
PTAAALHALH YHEVSVADRQ RELAGRRPAD RLELLTVPLA EPAAAWSPEE VAAEVDNNVQ
GVLGYVVRWV ELGIGCSKVP DLTGTPLMED RATCRISSQH VANWLRHGIV SREQVEESLR
RMAALVDAQN AGEPGYRPMA PAFEELAFAA ARALLLEGAD QPNGYTEPLL HEHRRAQKNK
EMVRS