Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_1090 |
Symbol | |
ID | 5669504 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | + |
Start bp | 1301369 |
End bp | 1303165 |
Gene Length | 1797 bp |
Protein Length | 598 aa |
Translation table | 11 |
GC content | 72% |
IMG OID | 641240022 |
Product | acetolactate synthase 1 catalytic subunit |
Protein accession | YP_001505452 |
Protein GI | 158312944 |
COG category | [E] Amino acid transport and metabolism [H] Coenzyme transport and metabolism |
COG ID | [COG0028] Thiamine pyrophosphate-requiring enzymes [acetolactate synthase, pyruvate dehydrogenase (cytochrome), glyoxylate carboligase, phosphonopyruvate decarboxylase] |
TIGRFAM ID | [TIGR00118] acetolactate synthase, large subunit, biosynthetic type |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 17 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACCAGAG AGACCAGACA GATCACCGGA GCGCAGTCGC TCGTCCACTC GCTCGAGGCG GTCGGTGCCG ACGTGGTCTT CGGCATTCCG GGCGGGGCGA TCCTGCCCGC CTACGATCCG CTGTTCGACT CCACCCGGGT CCGGCACGTG CTCGTCCGGC ACGAGCAGGG CGCCGGGCAC GCCGCCGAGG GGTATGCCCA GGCCACCGGG CGTGTCGGGG TCTGCATGGC GACGTCCGGG CCGGGCGCGA CGAACCTCGT GACACCGATC GCGGACGCCT ACATGGACTC GGTGCCCATC GTCGCCATCA CCGGGCAGGT GCCGAGCGCG GCGATCGGCA CCGACGCCTT CCAGGAAGCC GACATCTGCG GCATCACGCT GCCGATCACC AAGCACAACT TCCTGGTGCA GTCGGCGGAC GACATCCCAC GCACGATCGC CGAGGCGTTC CACATCGCCT CGACGGGGCG CCCGGGCCCG GTGCTGGTCG ACCTGCCGAA GGACATCCTG CAGGCGGTCA CGTCGGTCCT GCCGCACGAG GTGTGGCCGC CGACGCTCGA CCTGCCCGGC TACCGCCCGG TCACCCGGCC GCACGCCAAG CAGGTCCGCG AGGCGGCCAA GATGATCAGT GCGGCGCGCC GGCCGGTGCT CTACGTGGGC GGTGGGGTGC TGAAGGCCCG TGCCGCCGCC GAGCTGCGGG TGCTCGCCGA GCTCACCGGC ATCCCGGTGG TCACCACGCT GATGGCGCGC GGCGCGTTCC CCGACTCCCA CCCGCAGCAC CTGGGCATGC CCGGGATGCA CGGCTCCGTC GCCGCGGTCA CCGCGATGCA GAAGGCCGAC CTGCTGATCA CCCTGGGGGC CAGGTTCGAC GACCGGGTCA CCGGCAAGCT GTCGTCATTC GCGCCCGGCG CGGCCGTCAT CCACGCCGAC ATCGACCCGG CCGAGATCGG CAAGAACCGG ATCGCCGACG TGCCGATCGT GGGCGACTGC CGGGAGGTCA TCGTCGACCT GACCGCGGCA CTGCGCACCG AGGAGCGCCC CGACCTCGAG GGCTGGTGGC GGTCGCTGGA CAGGTGGCGC GCGACCTACC CGCTCGGCTA CGACCAGCCC GCCGACGGCT CGCTCGCCCC GCAGCAGGTC ATCGAGCGGA TCGGCCGGAT CGCCGGCCCG GAGACCGTTT TCGCGGCCGG GGTGGGGCAG CACCAGATGT GGGCCGCCCA GTTCATCTCC TACGAGCACC CCTACACCTG GCTGAACTCC GGCGGCGCCG GCACCATGGG CTACGCCGTG CCGGCCGCGA TGGGCGCCAA GGTCGGCCGC CCGGAGGCCA CCGTGTGGGC GATCGACGGC GACGGCTGCT TCCAGATGAC CAACCAGGAG CTCGCCACCT GCGCCCTGGA GGGCATCCCG ATCAAGGTCG CGGTCATCAA CAACGGTTCG CTGGGCATGG TCCGGCAGTG GCAGACGCTC TTCTACGACA AGCGCTACTC GAACACCGAC CTGGGCACCC ACCCGGTGTC GGCGCGCACC GGGGTGACCC GGGTCCCCGA CTTCGTGCGC CTCGCCGAGG CCCTGGGCTG CGTCGGCCTG CGCTGTGAGT CGCCCGCGGA CGTGGACGCC ACGATCGAGA AGGCGATGAG CATCGACGAC GCCCCGGTCG TGGTCGACTT CGTGGTCCAC CCCGACGCGA TGGTGTGGCC GATGGTGGCC GCCGGCTCGA GCAACGACGA GATCCGCGTC GCCCGCGACA TCGCCCCCGA CTTCGACTAC TCCGGCGACG CGGAGGTCAA CCTGTGA
|
Protein sequence | MTRETRQITG AQSLVHSLEA VGADVVFGIP GGAILPAYDP LFDSTRVRHV LVRHEQGAGH AAEGYAQATG RVGVCMATSG PGATNLVTPI ADAYMDSVPI VAITGQVPSA AIGTDAFQEA DICGITLPIT KHNFLVQSAD DIPRTIAEAF HIASTGRPGP VLVDLPKDIL QAVTSVLPHE VWPPTLDLPG YRPVTRPHAK QVREAAKMIS AARRPVLYVG GGVLKARAAA ELRVLAELTG IPVVTTLMAR GAFPDSHPQH LGMPGMHGSV AAVTAMQKAD LLITLGARFD DRVTGKLSSF APGAAVIHAD IDPAEIGKNR IADVPIVGDC REVIVDLTAA LRTEERPDLE GWWRSLDRWR ATYPLGYDQP ADGSLAPQQV IERIGRIAGP ETVFAAGVGQ HQMWAAQFIS YEHPYTWLNS GGAGTMGYAV PAAMGAKVGR PEATVWAIDG DGCFQMTNQE LATCALEGIP IKVAVINNGS LGMVRQWQTL FYDKRYSNTD LGTHPVSART GVTRVPDFVR LAEALGCVGL RCESPADVDA TIEKAMSIDD APVVVDFVVH PDAMVWPMVA AGSSNDEIRV ARDIAPDFDY SGDAEVNL
|
| |