Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_4056 |
Symbol | |
ID | 5672414 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | + |
Start bp | 4835802 |
End bp | 4837544 |
Gene Length | 1743 bp |
Protein Length | 580 aa |
Translation table | 11 |
GC content | 77% |
IMG OID | 641242932 |
Product | FAD dependent oxidoreductase |
Protein accession | YP_001508349 |
Protein GI | 158315841 |
COG category | [Q] Secondary metabolites biosynthesis, transport and catabolism |
COG ID | [COG1233] Phytoene dehydrogenase and related proteins |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 9 |
Fosmid unclonability p-value | 0.194203 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACAGCGG ACGCGGTCGT CATCGGCGCG GGGGTGAACG GCCTCGTCGC GGCGAACCGG CTCGCCGACG CCGGCTGGGA CGTCGTCGTG TGCGAGGCCG CCGACGAGCC GGGCGGGGCC TGCCGCTCCG CCGAGGTCAC CGCGCCCGGG TTCGGTACCG ACCTGTTCAG CGCCTTCTAC CCGTTCGCCG CGCGGTCCCC GGCGCTGCGC GCGCTCGACC TCACCGACCA CGGGCTGACC TGGCTGCACG CCCCGCGGGT GCTGGCCCAC CCCACCCCGG ACGGCCGCTG CGCGGTGCTG TCCACCGACC TCGACGGGAC GGCCGCATCC CTCGCCGCCT ACGCCCCGGC CGACGCGGCG GCCTGGCGGG CGCAGGTCGC ACTGTGGGAA CGGGTCCGCG ACCCGCTGCT CGAGGCGCTG CTGGCCGCCC CGTTCCCGCC GGTCCGGGCG GGCGCACGGC TCGCCCGCGC GCTCGGGGCC GCCGACGCGC TGCGCTTCGC CCGGTTCGCG ATGCTGCCCG TCCGCCGGTT CGCCGCGGAG GAATTCGCCG GCGCGGGCGC CGGACTGCTG GTCGCCGGCA GCGCGCTGCA CACCGACCTG GCGCCGGAGT CGGCGGGATC GGCGCTGATC GGCTGGCTGC TGGCGATGCT CGGCCAGGAT GTCGGCTTCC CGGTGCCGCG TGGCGGCGCG GGCCGGCTGA CCGCCGCCCT CGTCGACCGG CTGCGCTCCC GCGGCGGGGT GGTGCGCACT CGAGCCGAGG TGGACGCGGT GATCGTCACC GCCGGCCGGG CCCGTGGCGT GCGGCTCACC GACGGGACGG CCGTGCGCGC CCGGCGCGCG GTGCTCGCCG ACGTGGACGC GGTCTCCCTC TATCGGCGCC TGGTCGGCGA CGAGCATCTC CCCGCCCGCC TGCTCACCGA CCTGGCCCGC TTCCAGTGGG ACAGCTCGAC CTTCAAGGTC AACTGGGCGC TCGCCGGGCC GATCCCGTGG TCCGACCGCC GGATCGCGGA CGCCGGCACC GTCCACCTCG GCGGCACCAT GGACGATCTG ACGATGATGT CCGCGCAGCT GGCGTGCGGG CTCGTGCCGG CGGACCCGTT CCTCGTCCTC GGCCAGATGA CCACTGCCGA CCCGGGCCGC TCGCCGGCCG GGACGCAGAG CGCGTGGGCG TACTTCCACC TCCCGCAGTC CCCCCGGGGC GACGCCGGTG GGGCCGGTGT CACCGGCCGC TGGGACGGCG ACGACACCGC CCGCCTCCTC GAGCGGGTGG AACGCAAGCT GGAGGCGGCG GCCCCCGGCT TCGGGTCGCT GATTCTCAGC CGCGACACCC AGTCGCCGCG ACGGCTGGAG GACCAGGACG CCGTGCTGCG CGGCGGCGCC CTCAACGGCG GGACGGCCGC CCTGCACCAG CAGCTGATCT TCCGCCCGGT GCCCGGCCTG GGCCGCCCGG AGACCCCGAT CCCCGGGCTC TATCTGGCGT CGATGTCAGC ACATCCCGGC GGCGGTGCGC ACGGCGGGCC GGGCGCCATG GCGGCCACGG TGGCGCTGCG CGACGCCGGC CCGGCCGGCC CCGTCCGCCG CCGCGCGTCC GCCGCCGCGC ACCGGCTGAT CTACGCTGGG CCGGTGGCGT CGCCCGCGGC GCCAGGCCCG GATTCTCCAG GGCCGGCGAG TCCGGGCCCG GCGAACGAGT CCTTGAGCGC CTCCGACACC TCGCGTGCGA CCCGTCCAAA AGCGGCGTCG TAA
|
Protein sequence | MTADAVVIGA GVNGLVAANR LADAGWDVVV CEAADEPGGA CRSAEVTAPG FGTDLFSAFY PFAARSPALR ALDLTDHGLT WLHAPRVLAH PTPDGRCAVL STDLDGTAAS LAAYAPADAA AWRAQVALWE RVRDPLLEAL LAAPFPPVRA GARLARALGA ADALRFARFA MLPVRRFAAE EFAGAGAGLL VAGSALHTDL APESAGSALI GWLLAMLGQD VGFPVPRGGA GRLTAALVDR LRSRGGVVRT RAEVDAVIVT AGRARGVRLT DGTAVRARRA VLADVDAVSL YRRLVGDEHL PARLLTDLAR FQWDSSTFKV NWALAGPIPW SDRRIADAGT VHLGGTMDDL TMMSAQLACG LVPADPFLVL GQMTTADPGR SPAGTQSAWA YFHLPQSPRG DAGGAGVTGR WDGDDTARLL ERVERKLEAA APGFGSLILS RDTQSPRRLE DQDAVLRGGA LNGGTAALHQ QLIFRPVPGL GRPETPIPGL YLASMSAHPG GGAHGGPGAM AATVALRDAG PAGPVRRRAS AAAHRLIYAG PVASPAAPGP DSPGPASPGP ANESLSASDT SRATRPKAAS
|
| |