Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_6970 |
Symbol | |
ID | 5675282 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | + |
Start bp | 8494921 |
End bp | 8497098 |
Gene Length | 2178 bp |
Protein Length | 725 aa |
Translation table | 11 |
GC content | 72% |
IMG OID | 641245818 |
Product | malate synthase G |
Protein accession | YP_001511209 |
Protein GI | 158318701 |
COG category | [C] Energy production and conversion |
COG ID | [COG2225] Malate synthase |
TIGRFAM ID | [TIGR01345] malate synthase G |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 13 |
Fosmid unclonability p-value | 0.965318 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGCCTGACT ACATCGACGT CGTCGGACTG CGCGTCGCCA GCGAGCTGCA CCAGTTCGTC GCCGAGGAGG CGCTGCCCGG GTCGGGTGTC GACCAGGCCG CGTTCTGGGC CGGCGCCACG GAGATCATCA ACGATCTGAC CCCGCGCAAC CGGGAGCTAC TGGCGCGGCG CGACGAGCTC CAGCAGGCTG TGGACGACTA CCACCGGCGC ACGCCGGGCC GGCCGGCGCA GAGCGACTAC GCCGAGTTCC TGACCTCCAT CGGCTACCTG CTCGACGAGC CCGCGCCGTT CACCATCACC ACCGACGGCG TGGACGACGA GATCGCGACC CAGGCCGGCC CGCAGCTGGT GGTGCCGCTG CTCAACGCGC GGTACGCGGT CAACGCCGCC AACGCCCGGT GGGGCTCGCT CTACGACGCC CTGTACGGCA GCGACGTCAT CGACGAGACA GACGGCAGGG AGCGCGCCGG CGGCTACAAC CCGACCCGCG GCGCCGCGGT CATCGCCCGG GTCCGGGAGA TCCTCGACCG GAGCTTCCCG CTGACCAGCG GCTCGCACGC GGACGCGACC CGGTACGCGG TCGACGCCGA CGGGCTCGTC GTCACCGTCG ACGGGGCCGC CGTCCGGCTG GCCGACCCCG CGCTGTTCGT CGGCTTCCGC GGCGAGGGCG ACGAGCCGAC GGCTGTCCTG CTGACCCATC ACGGGCTGCA CGTCGAGATC CAGGTCGACC GGAACCACCC GGTCGGGGCG GGCGACCGCG CGGGCGTCAG CGACGTCGTC GTCGAGGCCG CCGTCACCAC GATCATGGAC CTGGAGGACT CGGTCGCGGC CGTCGACGCC GCGGACAAGG TGCTCGGCTA CCGCAACTGG CTGCAGCTCA TGCAGCGCCG GCTCACCGCC GAGGTGGACA AGGGCGGCCG CACCTTCACC CGGGTCCTCG CCGCCGACCG CGAGTACACG ACCGCCGACG GCGGCACAGT CACCCTGCCC GGCCGGTCGA TGTTGCTCAT CCGCCAGGTC GGGCTGCTGA TGACCACCGA TGCGGTCCTC GACCGCGACG GCCGGCCGGT GCCCGAGGGC ATCCTCGACG CGCTGGTCAC CGGCCTGGGC AGCGTGCATG ACCTGCGCGG CGACACCGCC GGCGGCAACT CCCGCACCGG CTCGGCGTAC GTGGTCAAGC CCAAGATGCA CGGCCCGGAC GAGGTCGCCT TCACCGTCGA GCTGATGTCC CGCGTCGAGC GCCTCCTCCA GCTCCCGCCG GCGACGATCA AGCTCGGCAT CATGGACGAG GAACGCCGCA CCTCGGTCAA CCTCAGGCGC TGCGTGTACG AGGCCCGGGA CCGCGTCGTC TTCATCAACA CCGGGTTCCT GGACCGGACC GGCGACGAGA TCCACACCTC GATGCTCGCC GGGCCGATGG TCCGCAAGGC CGCCATGCGC GAGGCGACGT GGATCCGCGT CTACGAGGAC AACAACGTCG ACGCCGGCCT CGCCCTCGGT TTCCCCGGGC GGGCGCAGAT CGGCAAGGGC ATGTGGGCCG CGCCGGACAA TCTGGCCGAC ATGATGACGC AGAAGATCGG GCACCCGCTG GCCGGCGCGT CCTGCGCCTG GGTGCCCTCG CCTACCGCCG CGGCGCTGCA CGCACTGCAC TACCACGAGG TCTCCGTGGC GGACCGGCAG CGCGAGCTGG CCGGCCGCCG CCCGGCCGAC CGGCTGGAGC TGCTCACCGT CCCGCTCGCG GAACCGGCGG CCGCCTGGTC GCCGGAGGAG GTCGCCGCCG AGGTTGACAA CAACGTCCAG GGCGTGCTCG GCTACGTCGT GCGCTGGGTC GAGCTCGGCA TCGGCTGCTC CAAGGTGCCC GACCTGACCG GCACACCCCT GATGGAGGAC CGCGCCACCT GCCGCATCTC CTCCCAGCAC GTGGCCAACT GGCTGCGGCA CGGCATCGTC TCCCGCGAGC AGGTCGAGGA GTCGCTGCGC CGGATGGCCG CGCTCGTCGA CGCGCAGAAC GCCGGCGAGC CGGGGTACCG GCCGATGGCC CCGGCCTTCG AGGAGCTGGC CTTCGCCGCC GCGCGGGCGC TGCTACTCGA GGGCGCCGAC CAGCCGAACG GCTACACCGA ACCGCTGTTG CACGAGCACC GGCGCGCTCA GAAGAACAAG GAGATGGTCC GCTCATGA
|
Protein sequence | MPDYIDVVGL RVASELHQFV AEEALPGSGV DQAAFWAGAT EIINDLTPRN RELLARRDEL QQAVDDYHRR TPGRPAQSDY AEFLTSIGYL LDEPAPFTIT TDGVDDEIAT QAGPQLVVPL LNARYAVNAA NARWGSLYDA LYGSDVIDET DGRERAGGYN PTRGAAVIAR VREILDRSFP LTSGSHADAT RYAVDADGLV VTVDGAAVRL ADPALFVGFR GEGDEPTAVL LTHHGLHVEI QVDRNHPVGA GDRAGVSDVV VEAAVTTIMD LEDSVAAVDA ADKVLGYRNW LQLMQRRLTA EVDKGGRTFT RVLAADREYT TADGGTVTLP GRSMLLIRQV GLLMTTDAVL DRDGRPVPEG ILDALVTGLG SVHDLRGDTA GGNSRTGSAY VVKPKMHGPD EVAFTVELMS RVERLLQLPP ATIKLGIMDE ERRTSVNLRR CVYEARDRVV FINTGFLDRT GDEIHTSMLA GPMVRKAAMR EATWIRVYED NNVDAGLALG FPGRAQIGKG MWAAPDNLAD MMTQKIGHPL AGASCAWVPS PTAAALHALH YHEVSVADRQ RELAGRRPAD RLELLTVPLA EPAAAWSPEE VAAEVDNNVQ GVLGYVVRWV ELGIGCSKVP DLTGTPLMED RATCRISSQH VANWLRHGIV SREQVEESLR RMAALVDAQN AGEPGYRPMA PAFEELAFAA ARALLLEGAD QPNGYTEPLL HEHRRAQKNK EMVRS
|
| |