Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_3306 |
Symbol | |
ID | 5671678 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | + |
Start bp | 3917945 |
End bp | 3919216 |
Gene Length | 1272 bp |
Protein Length | 423 aa |
Translation table | 11 |
GC content | 65% |
IMG OID | 641242195 |
Product | Rieske (2Fe-2S) domain-containing protein |
Protein accession | YP_001507615 |
Protein GI | 158315107 |
COG category | [P] Inorganic ion transport and metabolism [R] General function prediction only |
COG ID | [COG4638] Phenylpropionate dioxygenase and related ring-hydroxylating dioxygenases, large terminal subunit |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 0.478644 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 10 |
Fosmid unclonability p-value | 0.259475 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGGCACGAT GGCCGAAGCC GGCTGAGGGA AGCTGGACGG AACACTACCC GGAGCTGGGA ACCGGGCTGG TGTCGTACGA GGACTCCATC TCGCCCGATT TCTACGCCCT GGAACGGGAT GCGATCTTCA GGCGTGCCTG GCTGAACGTG GGCCGGGTCG AGCAGTTACC GCGCAGCGGG AGCTACTTCA CCAAGGAGAT CGAGGCCGCC AGGGCCTCGA TCGTCGTCGT GCGCGACAAC GACGGTCAGG TCCGTGCCTT CCACAACATC TGTCGCCATC GCGGCAACAA GCTGGTGTGG AACGACTTCC CGCGGGAGGA GACCGCCGGT AGCTGTCGCC AGTTCACCTG CAAGTACCAC GGCTGGCGGT ACGGCCTGGA CGGCGCCGCC ACCTTCGTGC AGCAGGAGGG CGAGTTCTTC AACCTGGACA AGAAGGACTT CGGCCTCGTC CCGGTGCACT GCGACGTCTG GTCCGGATTC ATTTTCGTCA ATCTGGCGAA GGAGCCCGAA CAGTCGCTGC CCGAGTTCCT CGGCCCGATG GTCACCGCGC TCGGCGGATA TCCGTTCGAC ACGATGACCG AGCGCTTTTA CTACCGTGCT GAGGTCGGCG CGAACTGGAA GCTGTTCATG GACGCCTTCC AGGAGTTTTA CCATGCGCCC ATTCTGCACG CCCGGCAGAC GCCGTCGAAG TTCTCGACGG CGGCGCAGCA GGCGGGGTTC GAGGCGCCGC ACTACCGCAT CGACGGCCCG CACCGGCTGG TCAGCACGGC CGGTATCAAG GCGTGGGAAC TGGACCCGGA GATGCGCAAG CCGATGGAGG ACATCACCCG CAGCGGCCTG TTCGGGCCCT GGGACGAACC CGATCTCGGC GTCGAGAAGA TGCCGGCGGG TCTCAACCCG GCCGGCTGCG AGCCGTGGGG TCTGGACTCG TTCAATCTCT GGCCGAACTT CGTGATCCTG ATCTGGGCGG GTGGCTGGTA TCTCACCTAC CATTACTGGC CGACGTCGCA CAACACGCAC ACCTTCGAGG GGAACCTGTA TTTCGTTCCC TCCCGGAACG CGCGGGACCG GGTTGCCCGC GAGATGGCCG CGGTGACCTT CAAGGAGTAC GCGTTGCAGG ACGCCAACAC GCTGGAGGCG ACGCAGTCGA TGCTCGAGTC CCGGGCTGTC GCCGAGTTCC CACTGAACGA CCAGGAGGTT CTCTGCCGGC ACCTGCACAA GGTCGCGGCC GACTGGGTCG GGGAATACCA GCACAACGGC GCGGGGGCGT GA
|
Protein sequence | MARWPKPAEG SWTEHYPELG TGLVSYEDSI SPDFYALERD AIFRRAWLNV GRVEQLPRSG SYFTKEIEAA RASIVVVRDN DGQVRAFHNI CRHRGNKLVW NDFPREETAG SCRQFTCKYH GWRYGLDGAA TFVQQEGEFF NLDKKDFGLV PVHCDVWSGF IFVNLAKEPE QSLPEFLGPM VTALGGYPFD TMTERFYYRA EVGANWKLFM DAFQEFYHAP ILHARQTPSK FSTAAQQAGF EAPHYRIDGP HRLVSTAGIK AWELDPEMRK PMEDITRSGL FGPWDEPDLG VEKMPAGLNP AGCEPWGLDS FNLWPNFVIL IWAGGWYLTY HYWPTSHNTH TFEGNLYFVP SRNARDRVAR EMAAVTFKEY ALQDANTLEA TQSMLESRAV AEFPLNDQEV LCRHLHKVAA DWVGEYQHNG AGA
|
| |