Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Francci3_1228 |
Symbol | |
ID | 3902973 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. CcI3 |
Kingdom | Bacteria |
Replicon accession | NC_007777 |
Strand | + |
Start bp | 1470493 |
End bp | 1472118 |
Gene Length | 1626 bp |
Protein Length | 541 aa |
Translation table | 11 |
GC content | 71% |
IMG OID | 637878561 |
Product | malate synthase |
Protein accession | YP_480335 |
Protein GI | 86739935 |
COG category | [C] Energy production and conversion |
COG ID | [COG2225] Malate synthase |
TIGRFAM ID | [TIGR01344] malate synthase A |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 0.112102 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 11 |
Fosmid unclonability p-value | 0.512222 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGGCTCGT TGGACGGGGT GACGGTGCAC GGGGTGTCCG CGGTGCGGAC CGTGCCGGGG CTCACCGCGG AGCAGGTCGA CGCGGTGCTC TCCGACAACG CGCTGGCCTT CGTCGCCGGG TTGCACCGCA CGTTCGCCGG CCGGCGTGCC GAACTGCTCG CCGCCCGGGC CGCCCGCCGC GCCGCGATCG CAGCCGGGGC CACCCTGGAT TTCCTGCCGC AGACCGCCGA CATCCGGGCC GGGAACTGGC GGGTCGCGTC GCCGGCGCCG GGGCTGGTCG ACCGTCGGGC CGAGATCACC GGCCCGACCG ACGCCAAGAT GCTCATCAAC GCGCTCAACA GCGGTGCCCG GGTCTTCATG GCGGACCTCG AGGACGCCAA CGTGCCGACC TGGTCGAACA TGGTCGTCGG ACAGCACAAC CTGAGTGAGG CCGTCGCCGG CACGCTCGCC TTCACCTCGC CCGACGGCCG TCGTTACGAG CTCGACGAGT CCACGGCGAC GCTCGTGGTG CGCCCGCGCG GCTGGCACCT GCCGGAGCGG CACGTCACCG TCGACGGGGA GCCGATCGTC GCCGCGCTCT TCGACGCCGG AATGTACCTG GTCCGCAACG CGCACGCCCT GCGGGCTACG GGCGTGGCGC CGTACTTCTA CCTGCCGAAG CTGGAGAGTC ATCTCGAGGC CCGGCTGTGG AACGACGTGT TCACCGCGGC GCAGGCCGAG CTCGGCCTGC CCGTCGGCAC CATCCGGGCG ACCGTTCTCA TCGAGACGCT GCCCGCCGCC TTCGAGATGG AGGAGATCCT CTACGAGCTG CGGGAGCATT CCGCCGGACT CAACGCTGGC CGCTGGGACT ACATGTTCTC CACCATCAAG ACGTTCGCGT CCCGGCCGAC TGAGTTCCTG CTGCCCGACC GCAACGGCGT GACGATGACG GTGCCGTTTC TGCGCGCCTA CACCGAGCTG CTGGTCTCCA CCTGTCATCG CCGCGGCGCG CACGCGATCG GCGGGATGGC GGCGTTCATC CCGTCGCGGC GCGACCCCGA GATCAACGCC GCGGCCCTGG CGAAGGTGCG CGCCGACAAG GAGCGGGAGT CCGCGGACGG GTTCGACGGG AGCTGGGTGG CGCACCCTGA CCTGGTGCCG GTGTGCACCG AGGTCTTCGA TGCGGTGCTC GGCGACGAGC CGAACCAGCT CACCCGGTTG CGTGACGACG TCAAGGTCGG CGCCGGCGAC CTGCTGGCGG TGCGCGACAC TCCGGGTTCC GTCACCGCGG CCGGGGTCCG CGGGAACATC AGCGTCGGGG TGCGCTACCT GGAGAGCTGG CTGCGCGGGA TCGGGGCGGT CGGCATCGAC AACCTGATGG AGGACGCCGC CACCGCTGAG ATCTCCCGTA GTCAGATTTT CCAGTGGATA GCCGCCGGGG TCGTGCTCGA CGACGGCCGT CCGGTCACGG CCGATCTGGT CCGGACCGCT CTGGCGGAGG TGCTGGACCA GATCCGGCTC TCCATCGGCG CCGCCGCCTT CGACAACGGT CGCTGGAAGG ACGCGGCGGC GGTGTTCGAG GAGACGGCGC TCGGCGAGAC CTTCGTCGAG TTCCTTACTC TTCCCGCCTA CGAGCGGATC GACTGA
|
Protein sequence | MGSLDGVTVH GVSAVRTVPG LTAEQVDAVL SDNALAFVAG LHRTFAGRRA ELLAARAARR AAIAAGATLD FLPQTADIRA GNWRVASPAP GLVDRRAEIT GPTDAKMLIN ALNSGARVFM ADLEDANVPT WSNMVVGQHN LSEAVAGTLA FTSPDGRRYE LDESTATLVV RPRGWHLPER HVTVDGEPIV AALFDAGMYL VRNAHALRAT GVAPYFYLPK LESHLEARLW NDVFTAAQAE LGLPVGTIRA TVLIETLPAA FEMEEILYEL REHSAGLNAG RWDYMFSTIK TFASRPTEFL LPDRNGVTMT VPFLRAYTEL LVSTCHRRGA HAIGGMAAFI PSRRDPEINA AALAKVRADK ERESADGFDG SWVAHPDLVP VCTEVFDAVL GDEPNQLTRL RDDVKVGAGD LLAVRDTPGS VTAAGVRGNI SVGVRYLESW LRGIGAVGID NLMEDAATAE ISRSQIFQWI AAGVVLDDGR PVTADLVRTA LAEVLDQIRL SIGAAAFDNG RWKDAAAVFE ETALGETFVE FLTLPAYERI D
|
| |