Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_4377 |
Symbol | |
ID | 5672730 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | + |
Start bp | 5223940 |
End bp | 5224899 |
Gene Length | 960 bp |
Protein Length | 319 aa |
Translation table | 11 |
GC content | 71% |
IMG OID | 641243246 |
Product | intradiol ring-cleavage dioxygenase |
Protein accession | YP_001508663 |
Protein GI | 158316155 |
COG category | [Q] Secondary metabolites biosynthesis, transport and catabolism |
COG ID | [COG3485] Protocatechuate 3,4-dioxygenase beta subunit |
TIGRFAM ID | [TIGR01409] Tat (twin-arginine translocation) pathway signal sequence |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 17 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCGGACC GTCAGCAGCC AAGCCGGATG CCGACGTACG AGGGGCGTGC ACTGGCCCGT CCTGAGGAGG AGATCGTCGA TCAGGGGCTG GGCTTCGACA TGGGCACGGT GGTGAGCCGG CGGCGGATGC TGGCCTTCTT CGGTGTGGGT GCCGCGGCAG CAAGCCTAGC CGCCTGCACT CCAGGCCAGG TCGGGTCCTC GGGGGCGTCC GCTGCTACGG CGTCCGCGGT TGCAGGGGAG ATCCCCGAAG AGACCGCGGG CCCCTACCCG GGCGACGGGT CCAACGGGCC GGACGTCCTC GAGCAGAGCG GTGTGGTCCG CAGTGACATC CGGTCCAGCT TCGGCGACTC GACCGGCACC GCCGAAGGCG TCCCCATGAC ACTGGCGCTG ACGGTCCGCG ACCTCGCGAA CGGTGGCACG CCCTTCGCCG GGGTGGCCGT GTACGTGTGG CACTGCGACC GCGAGGGCCG CTACTCGCTG TACTCCGACG GCGTCACCGA CCAGAACTAT CTGCGTGGCG TCCAGATCAC CGACACCGCC GGCACGGTCC GTTTCACCAG CATCTTCCCC GCCTGCTACT CAGGACGCTG GCCCCACATC CACTTCGAGG TCTACCCCGA CCAGGGCAGC ATCACCGACG CCACCACGGC CATCGCCACC TCCCAGGTCG CACTCCCCCA AGACGTCTGC ACCACGGTCT ACGCCCAGCA GGGCTACGAG GCGTCCGTGA GCAACCTGGC CCAGGTCAGC CTCTCCAGCG ACAACGTCTT CGGCGACGAC TCCGGCGCCA GCCAACTCGC CACCGTGACT GGCGACGTCA CCGGCGGCTA CACCGTCTCC CTTCCTGTCA GCGTCGACAC CGCCACCACC CCCGGCGGCG GCGGCCAAGC CCCCGGAGGG GGCGGCGGCC AGCCGCCGTC CGGCGGGCCA GGTGGTCAGC CGCCGGCCAC ATCGAGCTGA
|
Protein sequence | MADRQQPSRM PTYEGRALAR PEEEIVDQGL GFDMGTVVSR RRMLAFFGVG AAAASLAACT PGQVGSSGAS AATASAVAGE IPEETAGPYP GDGSNGPDVL EQSGVVRSDI RSSFGDSTGT AEGVPMTLAL TVRDLANGGT PFAGVAVYVW HCDREGRYSL YSDGVTDQNY LRGVQITDTA GTVRFTSIFP ACYSGRWPHI HFEVYPDQGS ITDATTAIAT SQVALPQDVC TTVYAQQGYE ASVSNLAQVS LSSDNVFGDD SGASQLATVT GDVTGGYTVS LPVSVDTATT PGGGGQAPGG GGGQPPSGGP GGQPPATSS
|
| |