Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_4254 |
Symbol | |
ID | 5672609 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | - |
Start bp | 5073629 |
End bp | 5074873 |
Gene Length | 1245 bp |
Protein Length | 414 aa |
Translation table | 11 |
GC content | 76% |
IMG OID | 641243127 |
Product | homoserine O-acetyltransferase |
Protein accession | YP_001508544 |
Protein GI | 158316036 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG2021] Homoserine acetyltransferase |
TIGRFAM ID | [TIGR01392] homoserine O-acetyltransferase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 0.218969 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 13 |
Fosmid unclonability p-value | 0.848752 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACCGACC AGCGTCAGGA CGCCGCACCC GCCAGCCAGC CGGCGCCGCC ACCACGGCCA CCATCGGGTC CGCGGCCGAC GCCGGCACCG GCACCGCCGT CGCCGTCGCC GTCGCCGCCG CCGCCGGCGT CGGCCGCGTG GCGCCCCGGC GACCCGGTGG GAGATCGCCG GTTCGTCAGG TCGGCCGGCC CGCTGACCCT GGAACGCGGT GGTTCGCTGC CCGAGGTGAC GATCGCCTAC GAGACGTGGG GCACGCTCGC GCCCGACGCG GGTAACGCCG TGCTGGTGCT GCACGCGCTC ACCGGTGACA GTCACGCCGC CGGCGGCGCC GGGCCGGGGC ATCCGACCCC GGGCTGGTGG GGTGGGCTGG TGGGGCCGGG CGCAGTCCTC GACACCGACC GGTTCTTCGT CGTCTGCCCG AACGTCCTCG GCGGCTGTCA GGGCACGACG GGCCCGGCCT CCGCCGCATC AGACGGACGT CCCTGGGGCG GCCGCTGGCC GGAGATCACC GTCGGCGACC AGGTCAGGGC GGAGGCCCTG CTGGCCGACG AGCTCGGCGT GGGCCGGTGG GCGGCCGTCA TCGGCGGCTC GATGGGCGGG ATGAGGGCGC TGGAATGGGC GCTGGCGTTC CCCGCCCGGG TACGGCGCGC CGTCGTGATC GCCTGCGGAG CGACGGCGAC CGCCGAACAG ATCGCGCTCT ACGCCACCCA GCTCGCGATG ATCCGCGCCG ACCCGAACTG GCACGGCGGT GACTACCACG ACCGCCCGCC CGGGGCCGGG CCGCACGTCG GGCTGGGCCT GGCCCGGCAG ATGGGGCAGG TCAGCTACCG CAGCGAGCGC GAGCTGGCCC ACCGGTTCGG CAACGCGGTG CAGGCGGACG GCCGCTACGC CGCCGCATCC TACGTCGAGC ATCACGGGGC GAAGCTGGCG CACCGTTTCG ACGCCGGCAG TTACCTCACG CTGACCGCGG CGATGATGAG CCAGGACGCC GGCCGCGGCC GTGGCGGCGT GCCGGCGGCG CTGCGGGCCT GCCCCGTCCC GGTGACGGTC GCCGGCATCG ACAGTGACCG CCTCTACCCG CCGCGCCTGC AGGCCGAGCT CGCCCGCCAC CTGGGCACCG AGCTGCGTCT CGTCCCGTCC GCGTCGGGCC ACGACGGCTT CCTGCTGGAG ACGGCCGCCG TCGGCCAGAT CGTGCGCGAC GCGCTCACTC CCGCGGCGAC GGGCCCAAGG ACACCGTCGT CATGA
|
Protein sequence | MTDQRQDAAP ASQPAPPPRP PSGPRPTPAP APPSPSPSPP PPASAAWRPG DPVGDRRFVR SAGPLTLERG GSLPEVTIAY ETWGTLAPDA GNAVLVLHAL TGDSHAAGGA GPGHPTPGWW GGLVGPGAVL DTDRFFVVCP NVLGGCQGTT GPASAASDGR PWGGRWPEIT VGDQVRAEAL LADELGVGRW AAVIGGSMGG MRALEWALAF PARVRRAVVI ACGATATAEQ IALYATQLAM IRADPNWHGG DYHDRPPGAG PHVGLGLARQ MGQVSYRSER ELAHRFGNAV QADGRYAAAS YVEHHGAKLA HRFDAGSYLT LTAAMMSQDA GRGRGGVPAA LRACPVPVTV AGIDSDRLYP PRLQAELARH LGTELRLVPS ASGHDGFLLE TAAVGQIVRD ALTPAATGPR TPSS
|
| |