Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_0544 |
Symbol | |
ID | 5668961 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | + |
Start bp | 630384 |
End bp | 631904 |
Gene Length | 1521 bp |
Protein Length | 506 aa |
Translation table | 11 |
GC content | 73% |
IMG OID | 641239471 |
Product | glycosidase PH1107-related |
Protein accession | YP_001504909 |
Protein GI | 158312401 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG2152] Predicted glycosylase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 0.134151 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 17 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | TTGATCGGCA GCACCCGGAC CACAGCTGCC GGGTCGGCCC AGAACGACCT GGTCACGCGG CATCCGCTGG GGCTCTCCCC GAACAGCGAC CGGGTCATCG CGAAGCTCTT CCTGCCCGGT GAGGAGGGCA GCCAGCCGCA CTCCCGGGCC GCCGGCATCG TGGCCCGGGC TCTCGCCCTG CCCGAGCGGG AGGTCGACAG CCTGGTCGCC GACCTCCTCG AACGGTTCGA GCCGCGCCAC CGGGACTACC GGGGCATCCT CGCGCGGCAT GCCGCGGTGG TGACTCCGCG GGTGCCGGTG CCCGCGGAGC TCTCCCCAGC GCGCTGTCTG CTGCTCGGCG CGTGTTTCAC CGCCGAGTAC GCGGTGGAGG GCGCGGCCGT CACCAACCCC TCGGCGGTGC CGCACCCCGA CCAGACCGGG CTGCCGTCCG GCGCGCTGCG ACTGGCGTTG AGCCTGCGCG CTGTCGGGGA GGGCCATCTG TCCTCGATCG GCTTCGCCGT CGCGGTGATC GGTCCGGGCC CGGCGGTGCG CCTGGAGTCG CGGACCGGCC CACTCACCAC CGGCGTCGCC GTACCCGTCG AGTGGGAGGC CGCCCGGCTG CGCGCCGTGC TCACCGAACA CGGGCTCGAC GACGAGGTCA CCCGTGCCGT CCTCCAGAGC CTGGACCCAC GGCGGCCCGG CAGTGAACCC GACCCCGATC TCTCCCGTGC CTTCGCCGCG GTTCCCCCCG ACCTGCTCCG TCGCCCCCAG GCACCCGGGA TCCTCGCCGG GATCCGCTCG ATCGCCGGAT CATTGCGGCG GGTCGAGTTC CCACCCGACA GCGCCCTGCC GCAACGGGTG CTGTGGCCCA CCGTCACCAG CGAGAGCAAC GGCATGGAGG ACGCGCGGTT CGTCCGGTTC ACCGCGCCGG ACGGGACCGC GGACTACCGG GCCACCTACA CCGCCTTCGA CGGCACGGAC ATCTCCCCGC GCCTGCTCAC CAGCCCCGAC CTGCGGGTCT TCACCACCGC ACCGCTCACC GGCCCGGCCG CCCGTAACAA GGGCATGGCG CTGTTCCCCC GGCTGGTCGA CGGTCGCCAT CTCGCGCTCT GTCGCTCCGA CGGCGAGTCC ACCGGTCTGA CCGCATCGGA TGACGGCCAG GTATGGGGGC CGGCGCGTCC GCTCCACGGG CCGCGGGTCG CCTTCGAACT GCTCCAGGTG GGCAACTGCG GGTCCCCGGT CGAGACGTCC GCGGGCTGGC TCACGCTCAC CCACGGCGTC GGGCCGATGC GGACGTACAC CATCGGCGCG ATCCTGCTTG ATCTCGACGA TCCGGGAAGG GTGGTCGCGG CGCTGCCCGA ACCACTGCTC GCTCCGACCG GGCAGGAGAG CACGGGCTAT GTCCCCAACG TCGTCTACTC CTGCGGCAGC CTGATCCACC ACGGTCTGCT GTGGCTGCCG TACGGGATCG GCGACACCCG GATCGGGATG GCCAGCGTGC CCGTCGACCG GCTCCTCGCA CGAATGGTTC CCGTCGGGTG A
|
Protein sequence | MIGSTRTTAA GSAQNDLVTR HPLGLSPNSD RVIAKLFLPG EEGSQPHSRA AGIVARALAL PEREVDSLVA DLLERFEPRH RDYRGILARH AAVVTPRVPV PAELSPARCL LLGACFTAEY AVEGAAVTNP SAVPHPDQTG LPSGALRLAL SLRAVGEGHL SSIGFAVAVI GPGPAVRLES RTGPLTTGVA VPVEWEAARL RAVLTEHGLD DEVTRAVLQS LDPRRPGSEP DPDLSRAFAA VPPDLLRRPQ APGILAGIRS IAGSLRRVEF PPDSALPQRV LWPTVTSESN GMEDARFVRF TAPDGTADYR ATYTAFDGTD ISPRLLTSPD LRVFTTAPLT GPAARNKGMA LFPRLVDGRH LALCRSDGES TGLTASDDGQ VWGPARPLHG PRVAFELLQV GNCGSPVETS AGWLTLTHGV GPMRTYTIGA ILLDLDDPGR VVAALPEPLL APTGQESTGY VPNVVYSCGS LIHHGLLWLP YGIGDTRIGM ASVPVDRLLA RMVPVG
|
| |