Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Francci3_3751 |
Symbol | |
ID | 3906035 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. CcI3 |
Kingdom | Bacteria |
Replicon accession | NC_007777 |
Strand | - |
Start bp | 4499491 |
End bp | 4500717 |
Gene Length | 1227 bp |
Protein Length | 408 aa |
Translation table | 11 |
GC content | 71% |
IMG OID | 637881077 |
Product | malate dehydrogenase |
Protein accession | YP_482831 |
Protein GI | 86742431 |
COG category | [C] Energy production and conversion |
COG ID | [COG0281] Malic enzyme |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 0.131564 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 17 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGACCGCAA CCACTGACCA GCCGACTCTC GGTTCCACCA CGTCGGACTT CGCCGGGCTC AGCCCGCGGC CGTCCGACGA CGATCCCGCG TTCGCCCTGC ACCGCGGAGG AAAGATCAAC ATCAACGCTA CCGCTCCATT GAACAGTCGG GAAGACCTGT CCCTGGCCTA TACGCCCGGA GTCGCGCGGG TGTGTACCGC CATCGCCGGG ACTCCGGCCC TTGCCGACGA CTACACCTGG CGGTCTAACA CCGTTGCCGT GGTCACGGAT GGGACGGCCG TGCTCGGCCT CGGTGACATC GGCCCCGCGG CCTCCCTGCC GGTGATGGAG GGCAAGGCCG CGTTGTTCAA GCGCTTCGGT GGTGTGGACG CGGTGCCACT GGCCCTTGCC TGCACGGACG TCGAGGAGAT CGTCGACACC GTCGCGCGCC TCGCTCCGAG CTTCGGCGGG ATCAACCTCG AGGACATCTC GGCACCGCGC TGCTTCGCCA TCGAGCGCCT CCTGCAGGAC CGCGTCGACA TCCCGGTCTT CCACGACGAC CAGCACGGAA CGGCCATCGT CGCGCTGGCC GCGCTGGAGA ATGCGGCGAA GCTCACCGGA CGCACCCTCG GTGATCTGCG TGCCGTGGTG TCCGGCGCGG GCGCTGCCGG CGCCGCCGTG GCCCGCATCC TGCTGGCCGC CGGCATCGGG GACATCGCGG TTGGCGACAG CCGCGGCATC CTCTACGCCG GTCGGGACAA CCTCACCCCG AGCAAGGCCG CGCTCGCCGC CGAGACCAAC CGTGCCGGCC TGGCCGGGTC CATCACCGAC GCGCTCGCCG GCGCCGATGT CTTCCTCGGG CTGTCCGCCG GGCAGGTCCC GGAGGAGGCG GTCGCCACCA TGGCGCCGGA GGCGATCATT TTCGCGATGG CCAACCCCGA TCCCGAGATC GACCCGGCGG TCGCCCACAA GTACGCGCGG ATCGTGGCGA CCGGTCGCAG CGACTACCCG AACCAGATCA ACAACGTCCT CGCCTTCCCC GGAATCTTCC GGGGAGCGCT CGATGTCCGG GCCAGCCGCG TGACGGAGGG CATGAAGCTC GCCGCCGCGA GGGCGCTCGC CGCGGTGATC GCGGACGAGC TGGCCGAGGA TCTCATCATC CCGAGCGTGT TCGACGACCG GGTCGCCCCG GCCGTCGCCG CGGCGACGGC CGCGGCCGCG CGGGCGGACG GGGTGGCCCG CCGCTGA
|
Protein sequence | MTATTDQPTL GSTTSDFAGL SPRPSDDDPA FALHRGGKIN INATAPLNSR EDLSLAYTPG VARVCTAIAG TPALADDYTW RSNTVAVVTD GTAVLGLGDI GPAASLPVME GKAALFKRFG GVDAVPLALA CTDVEEIVDT VARLAPSFGG INLEDISAPR CFAIERLLQD RVDIPVFHDD QHGTAIVALA ALENAAKLTG RTLGDLRAVV SGAGAAGAAV ARILLAAGIG DIAVGDSRGI LYAGRDNLTP SKAALAAETN RAGLAGSITD ALAGADVFLG LSAGQVPEEA VATMAPEAII FAMANPDPEI DPAVAHKYAR IVATGRSDYP NQINNVLAFP GIFRGALDVR ASRVTEGMKL AAARALAAVI ADELAEDLII PSVFDDRVAP AVAAATAAAA RADGVARR
|
| |