Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_5685 |
Symbol | |
ID | 5674011 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | + |
Start bp | 6902441 |
End bp | 6903502 |
Gene Length | 1062 bp |
Protein Length | 353 aa |
Translation table | 11 |
GC content | 75% |
IMG OID | 641244538 |
Product | Rieske (2Fe-2S) domain-containing protein |
Protein accession | YP_001509941 |
Protein GI | 158317433 |
COG category | [P] Inorganic ion transport and metabolism [R] General function prediction only |
COG ID | [COG4638] Phenylpropionate dioxygenase and related ring-hydroxylating dioxygenases, large terminal subunit |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 7 |
Plasmid unclonability p-value | 0.0319041 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 15 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACCGGTC GCCTGACCAA CACCGACCCG GCGCTGCGGC ACGGCTGGCA CCCGGTCGCC CGGTCGCCCG AGCTCGCCGA CGAGCCGATC GCCGTCCGGC TGCTCGGCGA GCCGTGGGTG CTGGCCCGGC TCGACGACCA GGTGGCGGCC TTCGCCGACT GGTGCCCGCA CCGGCTCGCC CCGCTGTCGG CGGGACGGGT CGAGGGCCAC GAGCTCGTCT GCGGCTACCA CGGATGGCGG TTCGTCGCGT CCGGGGAGTG CACCGCGGTA CCGGCGCTCG GCCCGGGAAT ACCGGCGCCG CGGCGGGCCC GCGCCATCCC GCCGTGGGGG GTGACCGAGC GGCACGGGCT GGTGTGGATC GCGCCCGCGG AGCCGTTCGC CGACATCATC GAGCTGCCGG AGGCGGCCGA GGACGGGTTC GACGACGCCT GGCTGCCCGC CGCGCGCACG ACGGCCTGCG CGGGCCTGCT CGCCGACAAC TTCCTCGACA CCGCGCACTT CCCGTTCGTG CACGCCGCCA CCATCGGCGC CGGCGAGGAG ACCGTCGTCG CGCCGTACCG GGTGGACGCC GACGGCGACG GCTTCCTGGT CCGGATGGAT CAGGAGGTCG CCAACCCGGA GGATCCGGGG GTCGCGGCCG GGCTCCGTCC GCTCATCCAA CGCCGCACGT CGACGTACGT GTACCGCCCG CCGTTCATGC TGCGGCTGCG GCTGGAGTAC CCCGACGCCG GGATCACCAA CACGATCCTG TTCTGCCTGC AGCCCGAGGA GGCCGCCGCG ACCCGCGTCT ACACCCGCAT CCTGCGCGAC GACCTGGGCG GTGATCCCGC CCGGCTGGCC GAGGCCGTCC GCTTCGAGCA GGCGGTGCTC GACGAGGACC TCGCCCTGCA GGAGCGCTTC ACCATCGACG GACTCCCGCT GATCTCCGGG GACGGTGGGA CGGCCGCGGA GGTCAGCATC CGCGCGGACG CGGCGGGCGT GGCGCTGCGC CGGGTGCTGG CCGCTGTCGT CGCCAGGGCA GCCCGCAGCT CTCCCACCAG GGCAGCGGAT ATCCGACACT GA
|
Protein sequence | MTGRLTNTDP ALRHGWHPVA RSPELADEPI AVRLLGEPWV LARLDDQVAA FADWCPHRLA PLSAGRVEGH ELVCGYHGWR FVASGECTAV PALGPGIPAP RRARAIPPWG VTERHGLVWI APAEPFADII ELPEAAEDGF DDAWLPAART TACAGLLADN FLDTAHFPFV HAATIGAGEE TVVAPYRVDA DGDGFLVRMD QEVANPEDPG VAAGLRPLIQ RRTSTYVYRP PFMLRLRLEY PDAGITNTIL FCLQPEEAAA TRVYTRILRD DLGGDPARLA EAVRFEQAVL DEDLALQERF TIDGLPLISG DGGTAAEVSI RADAAGVALR RVLAAVVARA ARSSPTRAAD IRH
|
| |