Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_2969 |
Symbol | |
ID | 5671353 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | + |
Start bp | 3495909 |
End bp | 3497198 |
Gene Length | 1290 bp |
Protein Length | 429 aa |
Translation table | 11 |
GC content | 63% |
IMG OID | 641241873 |
Product | Rieske (2Fe-2S) domain-containing protein |
Protein accession | YP_001507293 |
Protein GI | 158314785 |
COG category | [P] Inorganic ion transport and metabolism [R] General function prediction only |
COG ID | [COG4638] Phenylpropionate dioxygenase and related ring-hydroxylating dioxygenases, large terminal subunit |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 0.665588 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 21 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCGCGTC TCGCCAAGCC GTCGGAGGCC AGCTGGACCG AGCGCTACCC GGAGCTGGGC ACCGGCCTCG TGTCGTACGA GGACTCCATC TCCCCGGAGT TCTACGAGCT CGAGCGCGAG GCGATCTTCC GGCGTGCCTG GCTCAATGTC GGCCGGGTCG AACAGTTGCC CCGCAAAGGG AGCTTCTTCA CGAAGGAGCT TGCCGTTGCC AAGACGTCGC TCATCGTCAC GCGCGATCTC GATGGACAGG TTCGCGCCTT CCACAACGTG TGCCGCCACC GCGGCAACAA GCTGGTGTGG GACAAGACGC CGAGGGACGA GACCAGTGGT GTCTGTCGCC AGTTCATGTG CAAGTACCAC GGCTGGCGGT ACGGCCTGGA CGGTCAGCTG AAGTTCGTCC TTCAGGAGGA GGAGTTCTTC GACTTCGACA AGGCGGACTT CCCGCTTGTG TCAGCTCACT GCGAGGTCTG GGAAGGCTTC ATCTTCATCA ACCTGTCGGA CGAACCCAGC CAGTCGCTCC TTGAATTCCT TGGGCCGATG GTGCGCGGGC TCGAGGGCTA CCCGTTCCAT CACGTCACCG AGCGGTATGC CTTCAAGGCG GACATCCTCA GTAACTGGAA GATCTACCTT GACGCATTCC AGGAGTACTA TCATGCGTCG ATCTTGCACT CGCAGCAGCA GGTGCCGAGC CTGCGCAGCT TTGAGTCCGG TTTCAAGGCG CCGCACTACC AGGTCGATGG ACCGCATCGG CTGGTGAGCA CCGGCGGCTG GAAGGGCGTG CCGCGGCACA TGCTGCCGCT CGACCAGATG TACCCGATCG AGCACAACAT CGAGGCCGGC ATGATGGGCC CCTGGCAGCG CCCGGACATC CCCGAGCTCG ACCCGGCCAA CCTGCCGGCG GGGCTCAACC CTGGCGGGCT CGACCCCTGG TCGATCTCGA ACTTCCAGAT CTGGCCCAAT TTCGTGATCC TGGTCTATGA GCGGGGCTGG TACCTCACCT ACCAGTACTG GCCGACTTCC CACAACACGC ATGTGTGGGA GATGTCGTAC TACTTCCCGC CGTCGCGGAA TGCCAGCGAA CGGATCCGGC ACGAGGTCAC CGCCGTCGTG TCCAAGGAGG CCGGTCTCCA GGACGCGGGC ACCCTCGACG GCACCCAGAT GGGCCTGGAA TCCAGGGTTA TCGACAGATA TCCGCTGTCC GATCAGGAGA TTACCGTACG TCACCTGCAC AAGGTCACCG GAGATTGGGT GCAGTCCTAT CTTCGGGACG GAAAGGCAGT CCGGGCATGA
|
Protein sequence | MARLAKPSEA SWTERYPELG TGLVSYEDSI SPEFYELERE AIFRRAWLNV GRVEQLPRKG SFFTKELAVA KTSLIVTRDL DGQVRAFHNV CRHRGNKLVW DKTPRDETSG VCRQFMCKYH GWRYGLDGQL KFVLQEEEFF DFDKADFPLV SAHCEVWEGF IFINLSDEPS QSLLEFLGPM VRGLEGYPFH HVTERYAFKA DILSNWKIYL DAFQEYYHAS ILHSQQQVPS LRSFESGFKA PHYQVDGPHR LVSTGGWKGV PRHMLPLDQM YPIEHNIEAG MMGPWQRPDI PELDPANLPA GLNPGGLDPW SISNFQIWPN FVILVYERGW YLTYQYWPTS HNTHVWEMSY YFPPSRNASE RIRHEVTAVV SKEAGLQDAG TLDGTQMGLE SRVIDRYPLS DQEITVRHLH KVTGDWVQSY LRDGKAVRA
|
| |