Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_1240 |
Symbol | |
ID | 5669653 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | + |
Start bp | 1488663 |
End bp | 1491425 |
Gene Length | 2763 bp |
Protein Length | 920 aa |
Translation table | 11 |
GC content | 77% |
IMG OID | 641240172 |
Product | GCN5-related N-acetyltransferase |
Protein accession | YP_001505600 |
Protein GI | 158313092 |
COG category | [C] Energy production and conversion |
COG ID | [COG1042] Acyl-CoA synthetase (NDP forming) |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 3 |
Plasmid unclonability p-value | 0.0677298 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 4 |
Fosmid unclonability p-value | 0.00305841 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | GTGAACGCCG GCCCCGATCC TGGCCTGCCG CGCGACTACC CCGCCCACTG GGAGGCCGAC GTCATCCTGT CGGACGGCGG GACGGCGCAT ATCCGTCCGA TCCGGCCGTC GGACGGGGCG CTGCTGCGGC CGTTCTGGTC CCGGCTGTCC CAGCGGACGA TCTACTTCCG GTACTTCAAC GTCCGGCGCG GGCTCAGTGA CGAGGACATC GCCCGGACGA CCAACGTCGA TCAGTGGGTC CGCGGCGCCC TGGTGGCGCT GATCAGCGGC GAGATCGTCG CCCTCGCGCA CTGGGAGGGC CGCCCCCGCT CTGCCGCCGA CGCGGCCGGT CCACCGAACG CCCCCGGCGC CCCCGGCACC CCCGACACCA CCAGCACCCC TGACGCTGCC GGAAGCCGCG GGAGCGCCGA TGACCGTCCG GCGCCCGACG CGGAGGTCGC CTTCCTCGTC GAGGACGCCC AGCAGGGCCG CGGGCTCGGC TCGGTGCTGC TGGAGCACCT GGCCGCGGCC GCGGCCGAGC GCGGGGTGCG CCGCTTCGAC GCCGACGTCC TCAGCGAGAA CCAGCAGATG ATCCGGGTGT TCCTCGACGC CGGCTACACC GTGGCGCGCG CCTGGGAGTC CGGCGGGGTC CGGCTGTCGT TCGACATCGC GCCGACGGCC CGCTCGGTGG ACGTCATGCG CGCCCGGGAG CACCGGGCCG AGGCCGCGTC GATGAACCGG CTGCTGCATC CCAGGGCCAT CGCGGTCGTC GGCGCCGGCC GGGACCGCTC CTCACTGGGC AACATCGTCC TGCGCAACCT GCTCGCCGGC GGCTTCGACG GCCCGGTGTA CCCGGTCAAC CCGGCCGCCG CGGCGGGGGA GGGTGCCGTC GCCTCGGTCC GGGCGTACGC CTCGGTGGAG GACACGCCGC GGCCCGTCGA CCTCGCGGTG CTCTGCGTGT CCGCGGAGGT GATCCCGGCG GTCGTCGCCG CCTGCGGCCG GCACGGCGTG CGCGGTCTGG TCGTGGTGAC CGACCAGCGG GACGACGCCG CCGACGCGCG GCTCGCCTCC GACGCCCGCG CGAACGGTAT GCGGGTGGTC GGCCCGGCCA GCCTCGGCAT CCAGAACCCG GCGGTGGGGC TGAACGCCTC GCTGGTCGAG CGGATGCCGC CGGCCGGCCG CATCGGCTGC TACTCGCAGT CCGGGCCGCT CGGCGGGGCG CTGCTGGAGG CCGCCGCGGG CCGTCGGCTG GGGTTCTCGG TCTTCGTCTC CGCGGGCGAC CGCGCGGACG TCAGCGGCAA TGATCTCCTG CAGTACTGGG AGGCGGACCC GTCCACGGGT GTGGCGCTGA TGCACCTGGA GACCTTCGGG AACCCGCGCA AGTTCGCCCG GCTGGCGCGC CGGCTCGGCC GCGACACCCC GGTCGTGGTC GTGCTCTCCG AGCGCACCCC GCTGGACGAG GCCCTGCTGC GCCAGGCCGG GGTGATCGGC GTCGACCGGG TCTCGCAGGG CCTGGACGTG GCGCTGCTGC TCGCCAACCA GCCGCTGCCG GGGGGCAACC GGGTCGCCGT GGTCGGCGAC TCACGGGCCC TGGTGGGGTT CACCGCCCGG GCGGCCGACG CGGCCGGGCT CGCGGTGCGG GAGGTGCTGC TGCCGGTCGG CAGCACCGCC GAGGCGTTCC GCGACGCTCT CGTCTCGGCC TCGGCCGAGG TGGACGCCCT GCTCGTGATC GCGGTGCGGC TGCCGTCGTC GCTGCCCGGC CTCGCGGCCG CGGGAGTCGG CGTGGCCGCG GGAATCGCCG CCGCCGCCGC CGCGCCGCTG GTGCGGGTGC CCCTGCTGGC GACCGTGCGG GCCACGGAGG CGTCGCCGGA GCTCGGCGCG ATACCGGCCT ATCCCTCGCC CGAGGGCGCG GTCGCGGCGC TGCGCCGGGC CGTCGGCTAC GCCCACTGGC GGGCCCTGCC CTCCGGGGCG GTGCCGGCCA CCCAGGTGCG GGCCGAGGAG GCCCGCCGGC TCGTGGCGGG GACGACCGGC CGCCTCACCG ACGGCGCGGC CGGCGAGCTC CTGGCCTGCT ACGGCATCGA GGTGGTGCCC CGCCGGGTCA TCGGCGGCGC TGATGAGGCG GTCGAGGCCG CCGCCCTGCT CGGCTGGCCC GTGGTGCTCA AGGCGCTGTC CGACGGGTAC CGGCACCGGC CGGACCTCGG GGGGCAGCGC CTGGACCTGC CCGACCCGGC GGCCGTGCGC GCCGCGTGGC GCTCGCTGGC CGAGCGGCTC GGGCCGGGCG CGCCGATCGT CGCGCAGCGG ATGGTCCCCG GCGGGGTCGC GGTGGTCGCC GGAGCCGAGC AGCATCCCCG GTTCGGCCCG CTGGTGTCGT TCGGGCTGGC CGGCCCGGCC ACCGAGCTGC TGGGTGACCG GGTGCACCAC ATCCTCCCGC TGACCGACGC CGACGCCGCG CGCCTTGTCC GCTCGGTGCG CGCGGCGCCG CTGCTGTTCG GCTACCGCGG CGCCGAGCCG GTGGACGTCG CCGCGCTCGA GGATCTGCTC CTGCGGCTGG CCCGCCTCGT GGATGACATC GGCGGGGTGA AGCACCTCAC ACTCGAGCCC GTGATCGTCT CGGTGGACCG GGTAAGCGTG CTGTCCGCGG ACATCGTCCT GGCACCGCCA ACCCCTCGCG CGGACGCCGG TCCGCGCCGG TTCTGGCGCC CCGTCGCGGA CCTGCCGGCC AGGCCGGAGC CCCGGGCCGG TTCGGCACAA CGTGTCCCCA CCGTCCACAA TCGTCTGCCA TGA
|
Protein sequence | MNAGPDPGLP RDYPAHWEAD VILSDGGTAH IRPIRPSDGA LLRPFWSRLS QRTIYFRYFN VRRGLSDEDI ARTTNVDQWV RGALVALISG EIVALAHWEG RPRSAADAAG PPNAPGAPGT PDTTSTPDAA GSRGSADDRP APDAEVAFLV EDAQQGRGLG SVLLEHLAAA AAERGVRRFD ADVLSENQQM IRVFLDAGYT VARAWESGGV RLSFDIAPTA RSVDVMRARE HRAEAASMNR LLHPRAIAVV GAGRDRSSLG NIVLRNLLAG GFDGPVYPVN PAAAAGEGAV ASVRAYASVE DTPRPVDLAV LCVSAEVIPA VVAACGRHGV RGLVVVTDQR DDAADARLAS DARANGMRVV GPASLGIQNP AVGLNASLVE RMPPAGRIGC YSQSGPLGGA LLEAAAGRRL GFSVFVSAGD RADVSGNDLL QYWEADPSTG VALMHLETFG NPRKFARLAR RLGRDTPVVV VLSERTPLDE ALLRQAGVIG VDRVSQGLDV ALLLANQPLP GGNRVAVVGD SRALVGFTAR AADAAGLAVR EVLLPVGSTA EAFRDALVSA SAEVDALLVI AVRLPSSLPG LAAAGVGVAA GIAAAAAAPL VRVPLLATVR ATEASPELGA IPAYPSPEGA VAALRRAVGY AHWRALPSGA VPATQVRAEE ARRLVAGTTG RLTDGAAGEL LACYGIEVVP RRVIGGADEA VEAAALLGWP VVLKALSDGY RHRPDLGGQR LDLPDPAAVR AAWRSLAERL GPGAPIVAQR MVPGGVAVVA GAEQHPRFGP LVSFGLAGPA TELLGDRVHH ILPLTDADAA RLVRSVRAAP LLFGYRGAEP VDVAALEDLL LRLARLVDDI GGVKHLTLEP VIVSVDRVSV LSADIVLAPP TPRADAGPRR FWRPVADLPA RPEPRAGSAQ RVPTVHNRLP
|
| |