Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_6662 |
Symbol | |
ID | 5674977 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | + |
Start bp | 8091387 |
End bp | 8092847 |
Gene Length | 1461 bp |
Protein Length | 486 aa |
Translation table | 11 |
GC content | 77% |
IMG OID | 641245513 |
Product | GCN5-related N-acetyltransferase |
Protein accession | YP_001510905 |
Protein GI | 158318397 |
COG category | [R] General function prediction only |
COG ID | [COG2144] Selenophosphate synthetase-related proteins |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 6 |
Plasmid unclonability p-value | 0.0223324 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 12 |
Fosmid unclonability p-value | 0.900242 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTCCAGTC TGCTGTCGGG GGGGCACACG CCGATCCGGC ACCGCACCGT GGACACACTG GCGATCTGGG GTGATCGCCG GACGCTGGCG GGCCAGCCGC CGTACCGCGT CGAGCAGGCC GAGGATCTGG CCACCTTCAC CGCCTACGCC CGTCTGCGGC GGGAGGTCTT CGTCGACGAG CAGGGCCTGT TCTCCGCGAC CGTGGCCGGC GATCTGGACG AGGTCGACAG CGACCCGCGC AGCATCGTCC TGGTCGCCCG GGTCGTCGGC GGGCCGGATG ACGGCACGGT GATCGGCGGG GTGCGGTTGG CGCCGATCTG GCGCGGCGAG GACATCGGCG CCTGGCAGGG CGGCCGGCTC GTCGTCGCGG CAGCCGCCCG CGGGCGCTAC GCGGGGATCG GTGCCGCACT GGTCCGGGCG GCGTGCGCGC GCGCCGAGAA CGAGGGGGTG CTGCGCTTCG ACGCCGCGGT GCAGCCCGAC CGCGCCCGCT TCTTCGGCCG GCTCGGCTGG ATGATCGCCG GGACGACCAC GGTCGCCGGC CGCCCGCACG TGCTCATGCG CTGGCCCATC AACCGGCTGG CGGCGGTGGC GGCCTCGATC AAGGCCCCGC TGGCGACCCT GCTGGCCGGG ATGCGCCCCG GCGGCCCCGG GTTCGTCGGT GACGACGGGG CACCGGTGCC CGGCACGGAC GTCGTCGCGG CCTGCGACGC GATCGTGCCG TCGATGGTCG AACGCGACCC CTACTGGGCC GGCTGGTGCG GCGTGCTGGT CAACCTCAAC GATCTGGCGG CGATGGGCGC CCGGCCGGTG GGCATGCTCG ACGCGGTGGC CGGCCCGACG GCGTCCCGGG TCGCCCGGGT GATCGGCGGG CTGCGGGCGG CGGCGGAGCG CTACGGCGTG CCCATCCTGG GCGGCCACAC CCAGCTCGGG GTGGCGGCCG CGCTGTCGGT GACGGCGCTG GGCCGCTCCG AACGCCCGAT CCCCGCCGGC GGCGGCCTGC CGGGGCACGC GGTGACCCTG ACCGCCGACC TGGGCGGTGA CTGGCGCCCG GGGTACTCCG GCCGGCAGTG GGACTCGACG TCCAACCGCC GGACGGCGGA GCTGCGCGCG CTGCTGGACC TGCCGCGCCG GCACCGCCCG TGCGCGGCGA AGGACGTCAG CATGGTCGGG ATCGTCGGCA CGCTCGGCAT GCTCGCCGAG GCGAGCGGGT GCGCCGCCGA GCTGGACGTG GCCGCGGTCC CCCGGCCGGC GGGCGCCACC GTCGGCGACT GGCTCACCTG CTTCCCCGGT TACGCAATGC TCACCGCCGA CGTCGACGAC CGCCCCGTTC CCGCGCCGAG CCCGGCGACG TCGCAGCGCT GCGGCCGGCT ACTCAACGGG ACGGGAGTGA CCCTGCGCTG GCCGGACGGT GTGCTGACCC CGGCGCTGTC CGGCCATGTG ACGGGGCTGG GCCACGCGTG A
|
Protein sequence | MSSLLSGGHT PIRHRTVDTL AIWGDRRTLA GQPPYRVEQA EDLATFTAYA RLRREVFVDE QGLFSATVAG DLDEVDSDPR SIVLVARVVG GPDDGTVIGG VRLAPIWRGE DIGAWQGGRL VVAAAARGRY AGIGAALVRA ACARAENEGV LRFDAAVQPD RARFFGRLGW MIAGTTTVAG RPHVLMRWPI NRLAAVAASI KAPLATLLAG MRPGGPGFVG DDGAPVPGTD VVAACDAIVP SMVERDPYWA GWCGVLVNLN DLAAMGARPV GMLDAVAGPT ASRVARVIGG LRAAAERYGV PILGGHTQLG VAAALSVTAL GRSERPIPAG GGLPGHAVTL TADLGGDWRP GYSGRQWDST SNRRTAELRA LLDLPRRHRP CAAKDVSMVG IVGTLGMLAE ASGCAAELDV AAVPRPAGAT VGDWLTCFPG YAMLTADVDD RPVPAPSPAT SQRCGRLLNG TGVTLRWPDG VLTPALSGHV TGLGHA
|
| |