Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_6785 |
Symbol | |
ID | 5675098 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | + |
Start bp | 8267263 |
End bp | 8268822 |
Gene Length | 1560 bp |
Protein Length | 519 aa |
Translation table | 11 |
GC content | 74% |
IMG OID | 641245634 |
Product | CHAD domain-containing protein |
Protein accession | YP_001511025 |
Protein GI | 158318517 |
COG category | [S] Function unknown |
COG ID | [COG5607] Uncharacterized conserved protein |
TIGRFAM ID | |
| ![](https://exploration.weizmann.ac.il/pandatox/images_new/ic_cp.jpg)
![](https://exploration.weizmann.ac.il/pandatox/images_new/ic_hh.jpg)
|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 22 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCGTACAC CGGCCCACGG GCATCGGGAG GTAGAGACCA AATTCGACGT GAGTTCGACG TTCGTCGTCC CGGTCCTGAC CGGTCTTGCC GGGGTCGCCT CGACCGCCGG GCCGACGGAG GAGCATCTCG ACGCCGTCTA CTACGACACC GAAGATCTGC GGCTTGCCCG CAACCGCATC ACGCTGCGAC GGCGCCAGGG CGGGCATGAC GCCGGCTGGC ATCTCAAGCT CCAGAGCTCG GGGGCCGGCC GGGACGAGAT CAGCCGTCCG CTGGGCGCGA TCGAGCGGGA CCCCTCCGCC GAGGGCACTG TCCCGGGCGA GTTCGCCGAC CTCGTGGCCG CCACCACCCG GGGCAGGCCC CTGGCCCCCG TCGCGCGGGT GCGGACCGTC CGCCGCGCGA CCACTCTGCG CGCCCCCGAC GGACGGGACC TGGCGGAACT AGCCGACGAC GAGGTCCACG CCCAGACACT GGGCACGTCG ACGACGCTGT CCCGCTGGCG GGAGATCGAG GTCGAGGCCC TCGGCGACGA CCTCGACGTC CTGCCGGCGG CCGGGGCGGT GCTGTGTGAG GCCGGGGCCC GGCCGGCCGC CGGGCCGTCC AAGCTCGCCC GGGCACTCGG ATCGCGGGCG GCGCGGCCGG AACTCCCCGA GCTCCCCGCC GACGGCGCCC CGGCCGGGGG AACCGCGGGT GAGGCCGTCC GCGGCTATCT GGCGACCCAT ACCCGGGCCC TGCTGGCCGC TGACGCCCGC GTGCGCCTCG GCGATCCCGA GTCTGTTCAC GACATGCGGG TCGCCGCCCG CCGGCTGCGC AGCGCCCTGC GCACGTTCCA GCGGCTGTTC GACCCAGCAC CGGCGCGCGT GCTCCAGGCC CGGCTGCGGG AACTGAACCT CCTGCTCAAC GCCGCCCGCG ACGGCGAGGT CCAGCTCGAG CGGTTCACCA CCGAGATCGA CGCGCTCGAC GAACGGGACC TGCTGGGCCC CGTCGCCGCC CGCGTGCAGG GCCATCTGCG CGCACAGCAC CTGCGCGGCC GGGAGCAGGC CCTGACCTGG ATGCGCGACG CGCAGTACCT GGATTTCCTC GACGACCTGA TCGCCTTCGT CGTCGGACCA CCGTATTCCG CCCTTGGTCG CCGCCCGGCC GGGCCGGCCC TGCGCTCCCC GATCCGCAAA GCCGACCGCA AGCTGCGCCG CCGGGTCGAT CGAGCCCTCC GCACCCCCGC CGGCGACAGC CAGGACGTCG CCCTGCACGC CGCGCGGAAG GCCGCGAAGC AGCTGCGCTA CGCGAGCGAG GCGGCCACGC CGGTCTACGG GGAACACGCC GCGACGCACA CCAGGCGAGC CAAGAAAATC CAGAACAGCC TGGGTGAGCA CCAGGACTGC GTCGTCGCCC AGGGCGTCCT GCGCGAGTTC GCGATCGCCG CCAACCAGGC CGGCGAATCC TCGTTCACCT ACGGCCTTCT CCTCGGCGGC GAGCGGGAAC AGGCTCACCT GACCAGGGAT GTCTTCGCCG CCCGCTGGCC GAAGCTCTCC CGCCGGCGCC ACCGCCGCTG GCTGCACTGA
|
Protein sequence | MRTPAHGHRE VETKFDVSST FVVPVLTGLA GVASTAGPTE EHLDAVYYDT EDLRLARNRI TLRRRQGGHD AGWHLKLQSS GAGRDEISRP LGAIERDPSA EGTVPGEFAD LVAATTRGRP LAPVARVRTV RRATTLRAPD GRDLAELADD EVHAQTLGTS TTLSRWREIE VEALGDDLDV LPAAGAVLCE AGARPAAGPS KLARALGSRA ARPELPELPA DGAPAGGTAG EAVRGYLATH TRALLAADAR VRLGDPESVH DMRVAARRLR SALRTFQRLF DPAPARVLQA RLRELNLLLN AARDGEVQLE RFTTEIDALD ERDLLGPVAA RVQGHLRAQH LRGREQALTW MRDAQYLDFL DDLIAFVVGP PYSALGRRPA GPALRSPIRK ADRKLRRRVD RALRTPAGDS QDVALHAARK AAKQLRYASE AATPVYGEHA ATHTRRAKKI QNSLGEHQDC VVAQGVLREF AIAANQAGES SFTYGLLLGG EREQAHLTRD VFAARWPKLS RRRHRRWLH
|
| |