Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_0167 |
Symbol | |
ID | 5668592 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | + |
Start bp | 200636 |
End bp | 201874 |
Gene Length | 1239 bp |
Protein Length | 412 aa |
Translation table | 11 |
GC content | 78% |
IMG OID | 641239096 |
Product | GCN5-related N-acetyltransferase |
Protein accession | YP_001504540 |
Protein GI | 158312032 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 0.10739 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 9 |
Fosmid unclonability p-value | 0.156031 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCCACTCG GAGTCGCCGT GCCGCCGCTC ATCGTGGCGC CGGGGCGTGC CGGCGGGCCG ATGATGCCGT CCGCGTCGGC GGTGGTCCCG TTCGGGGGAG CCGGCTCGGT AGCCTCATCC TCGTGGTCGA AGTGTCCGCG AGGTGGCCCC GAGGTGTGCC GGCGTGCCCG TCCGGTGTCT CCGGCCACCC GGAAACGAGA ACCCGTCGCG CGGCGGGCGC GCCGGGGTGG CTGCGCGTCC GGGGGCCCCG TGACCGCTGA TCCGGCGAGT GTCGCGGGTG GCGATGCGCT GGTCGCGCCG AAGCGCGTGC TGCCCCCGGA GGACGTCGAG CTGGCCCGCG AGCTTGAGGA GGTCGCACAC CGCGCCTGGC CGCCGCTGCG CGAGTGGACC CACGGCGGGT GGGTGCTGCG GGAGTCCGCG GGCTCCTCCC GCCGGGGTAA CTCGGTGTGG GCCCGTGGCG ACGTCCCCGA CCTGGCCGCG GCGCTGCGCG CGGTCCACTC CTTCTATACG GCGGCCGGCC TGCCGCCGAC GTTCCAGATC ACCCCGGTGG CCCGGCCCGC CGGCCTGCTC GACGCGCTGG ATGCCGCGGG TTACGACGAC GGCGGCCCGA CCGACGTCTG CGTCGCTCCC CTGGCCGCCC TGCGCGCGCC CCGACCGGAC GCCGCGGAGC ACCGCCCCGG GCCGGCGGGC GGGCCGGGCG CTGACCGCCG CGTCGAGTCC GCCGCCGCCG ACCTGCCCGA CGAGCGCTGG CTCGCGGTGG CGGGGGACGT CCTGGCCACG TTCGCCGGCC AGCGCGTCGG CACGCTCGCG GTGGTTCGCG CGATGGCGCT GCCCCAGCGC TACGTGACGG TCTTCGTCGA CGGCCGTCCG GTCGGCGTCG GGCGCGGGGT CCTGGACGGC AGCTGGCTCG GGATCTACAG CATGGCCACC CTCCCGGCCG CCCGCGGCGT GGGCGTCGCC GGCCGCACGC TCGCCGAGCT CGCGCACTGG GCCGGGGCGC GGGGGGCCGA GCGCGCCTAC CTCCAGGTCG AGCGGCACAG CGTGGTGGCG CGCGGGCTCT ACGCCCGGCG CGGGTTCCGT CCCGTCTACG GGTACAGCTA CCGGCGGCTG CCGGCACCGT CCGGGCTCCA GCGCAGCGCC GTGGCGCCCC GCGTGGCGAC GGAAAACGTG GCATCGGAAA ACGTGGCATC GGCGTGGCCT GCGGCCGGCC GAGCTGGCGG CGGGACGGCC AGCCGATGA
|
Protein sequence | MPLGVAVPPL IVAPGRAGGP MMPSASAVVP FGGAGSVASS SWSKCPRGGP EVCRRARPVS PATRKREPVA RRARRGGCAS GGPVTADPAS VAGGDALVAP KRVLPPEDVE LARELEEVAH RAWPPLREWT HGGWVLRESA GSSRRGNSVW ARGDVPDLAA ALRAVHSFYT AAGLPPTFQI TPVARPAGLL DALDAAGYDD GGPTDVCVAP LAALRAPRPD AAEHRPGPAG GPGADRRVES AAADLPDERW LAVAGDVLAT FAGQRVGTLA VVRAMALPQR YVTVFVDGRP VGVGRGVLDG SWLGIYSMAT LPAARGVGVA GRTLAELAHW AGARGAERAY LQVERHSVVA RGLYARRGFR PVYGYSYRRL PAPSGLQRSA VAPRVATENV ASENVASAWP AAGRAGGGTA SR
|
| |