Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_4229 |
Symbol | |
ID | 5672584 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | - |
Start bp | 5035949 |
End bp | 5037175 |
Gene Length | 1227 bp |
Protein Length | 408 aa |
Translation table | 11 |
GC content | 70% |
IMG OID | 641243102 |
Product | beta-lactamase domain-containing protein |
Protein accession | YP_001508519 |
Protein GI | 158316011 |
COG category | [Q] Secondary metabolites biosynthesis, transport and catabolism |
COG ID | [COG2015] Alkyl sulfatase and related hydrolases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 16 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGAAACAGC CCCTGAACGC CCTCATCGTC AAGGAGGGGG AGGGCCAGCA GGACGCCGTC CCCGTCAACG ACCACATCTT CACCTCGAAG GGCATCTCCA ACAGCTACCT GGTCACCACC CCCGACGGCG ACGTGCTGAT CAACACCGGC ATGTACACCG AGGCGGAGCA GATCAAGGCC CGCTTCGGCC GGGTGAGTTC CGGCCCGCTG CGGGTCATCG TCTTCACCCA GGGCCACCCC GACCATGTGG GCGGCTGGTC CCAGATCGCC GCGCCGGGCG TCGAGACGAT CGCCCAGGCC AACCACGCGG ACGTCCGCGA GTACTGGCGA CGGCTGCAGC CGTTCTACTC CAGCCGCAGC ACCCACCTGT GGAAGCGCGA TGTCACGGGC GTCGACCGCA CCTACCAGCC GCCCGAGGCC GTGGTCACCA CCACCTTCCT GGACAACCAC GCCTTCACCC TGGGCGGGCG CCGCTTCGAG CTCTACTCGA CTCCGGGCGG CGAGACGACG GACTCCCTGG TCGTGTGGCT CCCCGACGAG CGCACCGTGT TCACCGGCAA CCTTACCGGA CCTCTGTTCG GCCACGTCCC CAACCTGTAC ACGATCCGCG GCGACAAGAT CCGCGGCTCG CTGTCGTACA TCCACTCCGT CGACCGGGTC ATCGGGCTCG AACCCGAGGT CCTCATCACC GGGCACGGCG AGCCGGTGCG CGGGGCCGAG GAGATCCGCC GCCGCCTCAC CCAGCTGCGC GACGCCACCG AGTACCTGCG CGACCGCACC ATCGAGGGCA TGAACGCGGG CGTCGACCTG TGGACCCTGA TGGGCCAGAT CACGCTGCCG CCCGAGCTGG CCATCCCGCA GGGGCACGGC AAGGTGCCCT GGATCGTCCG GGCGATCTGG GAGGAGCACA CCGGCTGGTT CCGCTACGAG TCGACCACAG AGCTCTACGA CGTGCCCGCC TCCGCCGTCT GGGCCGACCT GCTGGACATG GCCGGTGGGA CCGGCCCGCT GGTCGACCGG GCCCGGGCCC ACCTGGACGC CGGGCGGCCG GTCGAGGCAC TGCACCTGAT CGACATGGTG CTCTCCCGGG AACCGAAGGA TCCCGAGGCC CTGCGGATCA GGCTCGGGGC CCACGAACTG CTGCTCGAGC GCAGCGGCCG GGAGAACTTC AGCGAGGTCC GCTGGCTCGA AGCCGAGATC CGTGACCTCC GGGACATGCT GCCATGA
|
Protein sequence | MKQPLNALIV KEGEGQQDAV PVNDHIFTSK GISNSYLVTT PDGDVLINTG MYTEAEQIKA RFGRVSSGPL RVIVFTQGHP DHVGGWSQIA APGVETIAQA NHADVREYWR RLQPFYSSRS THLWKRDVTG VDRTYQPPEA VVTTTFLDNH AFTLGGRRFE LYSTPGGETT DSLVVWLPDE RTVFTGNLTG PLFGHVPNLY TIRGDKIRGS LSYIHSVDRV IGLEPEVLIT GHGEPVRGAE EIRRRLTQLR DATEYLRDRT IEGMNAGVDL WTLMGQITLP PELAIPQGHG KVPWIVRAIW EEHTGWFRYE STTELYDVPA SAVWADLLDM AGGTGPLVDR ARAHLDAGRP VEALHLIDMV LSREPKDPEA LRIRLGAHEL LLERSGRENF SEVRWLEAEI RDLRDMLP
|
| |