Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_4037 |
Symbol | |
ID | 5672395 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | + |
Start bp | 4813463 |
End bp | 4814656 |
Gene Length | 1194 bp |
Protein Length | 397 aa |
Translation table | 11 |
GC content | 71% |
IMG OID | 641242913 |
Product | acetyl-CoA acetyltransferase |
Protein accession | YP_001508330 |
Protein GI | 158315822 |
COG category | [I] Lipid transport and metabolism |
COG ID | [COG0183] Acetyl-CoA acetyltransferase |
TIGRFAM ID | [TIGR01930] acetyl-CoA acetyltransferases |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 10 |
Fosmid unclonability p-value | 0.301808 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | TTGTCGGAGC ATGAAGCGGT AATCGTCTCG ACCGCACGGA CGGCGATCGG CACCGCGGGC AAGGGTTCCC TCGTCGACGT CGACGCTTTC GAGCTGGGTA CGCAGGCCGT CGCGGAGGCA GTCCGGCGGT CCGGTATCGA CAGCGCGGAC ATTGATGACG TCGTCCTGGG TGAATCGCTT TACGGCGGCG GCGACATCGC GCGTTACGCG GCCATCGAGG CTGGTTTCGC GAACGCCGCC GGCGTCGCGC ACAACCGGCA CTGCGCGTCG GGGCTCGTCG CGGTGCAGAC CGCCGCCGCC TCGATCATCT CCGGGATGGA CCGTGTCATC GTCGCGGGTG GGACGAACTC CTCCTCCACC TCGCCACGGG CGAAGCGGCG CCGGCCCGGA ACCGACGAGG TCGAGGACTG GTACTCGCCG ACCCACCGGA ACACCGCCGA GGCCCCGAAC TTCGACATGT CCATCACCGT CGGGTGGAAC GCGGCCGTCA AGGCCGGCGT CAGCCGCGAG GAGATGGACG CGTGGGCGCT GCGCTCGCAC CAGCGGGCCG TCGCGGGCAT CGACGCGGGC AGCTTCACCG ACGAGATCTT CCCGATCGAG GTGCCCCGGC GGGACGGCAC GAGCTTCACC TTCGCCGTGG ACGAGCACCC GCGGCGCACC ACCAGCATGG AGAAGCTGGC CTCGCTCAAG CCGCTGCACC CCGAGATCGA GGGCTTCAGC ATCACGGCCG GCAACGCGGC GGGCGCGAAC GACGGCGCCG CCGCCATGGT CATCACTGAC GGCGGCTACG CCGCGGAGCA CGGCCTGGAG CCGCTGGGCA TCGTCCGGGC CTGGGCGTCG GTCGGGGTGC CCCCGGCCGA GACCGGCATC GCCCCCACAC TGGCGATCCC GAAGGCCCTC AAGCGCGCCG GCCTCACCGT GGCCGACGTC GACCTGTGGG AGATCAACGA GGCGTTTGCG TCCGTGGCGG TCGCGGCATC CCGGATCCTG GAGATCGACG ACAGCCGCGT CAACTTCCTG GGCAGCGGTT GCAGCCTCGG GCACCCGATC GCGACGACCG GCGCGCGGCA GATCATCACG TTGATCCATG AGCTGCGCCG CCGTGGCGGC GGAGTGGCGG TGTCGGCGAT GTGCGCCGGC GGGGGCATGG CAAGCGCCCT CGTGCTGGAG GTTCCCGCTC CGCGCTCGAG CTGA
|
Protein sequence | MSEHEAVIVS TARTAIGTAG KGSLVDVDAF ELGTQAVAEA VRRSGIDSAD IDDVVLGESL YGGGDIARYA AIEAGFANAA GVAHNRHCAS GLVAVQTAAA SIISGMDRVI VAGGTNSSST SPRAKRRRPG TDEVEDWYSP THRNTAEAPN FDMSITVGWN AAVKAGVSRE EMDAWALRSH QRAVAGIDAG SFTDEIFPIE VPRRDGTSFT FAVDEHPRRT TSMEKLASLK PLHPEIEGFS ITAGNAAGAN DGAAAMVITD GGYAAEHGLE PLGIVRAWAS VGVPPAETGI APTLAIPKAL KRAGLTVADV DLWEINEAFA SVAVAASRIL EIDDSRVNFL GSGCSLGHPI ATTGARQIIT LIHELRRRGG GVAVSAMCAG GGMASALVLE VPAPRSS
|
| |