Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_3683 |
Symbol | |
ID | 5672049 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | + |
Start bp | 4361013 |
End bp | 4362272 |
Gene Length | 1260 bp |
Protein Length | 419 aa |
Translation table | 11 |
GC content | 66% |
IMG OID | 641242566 |
Product | Rieske (2Fe-2S) domain-containing protein |
Protein accession | YP_001507986 |
Protein GI | 158315478 |
COG category | [P] Inorganic ion transport and metabolism [R] General function prediction only |
COG ID | [COG4638] Phenylpropionate dioxygenase and related ring-hydroxylating dioxygenases, large terminal subunit |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 0.0804885 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 16 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCGCACT TCCCGAAGCC CGCGGAGGGC AGCTGGACCG AGCACTACCC GGACCTCGGT ACCGGGTTCG TCTCGTTCGA GGACTCGATC TCGCCCGAGC ACTATGAGCT GGAGCGCAAG GCGATCTTCG GGCGGACCTG GCTCAACGTC GGCCGGGTCG AGCAGATCCC GCGGACCGGC AACTACTTCA CGAGGGAGAT CCACGCCGCG CGTGCGTCGC TGATCGTCGT CCGCGACGCC GAAGGCGAGG TCCGCGCGTT CCACAACGTC TGCCGGCATC GCGGCAACAA GCTGGTGTGG AACGACTTCC CCCAGGAGGA GACGCACGGC ACGGCACGGC AGTTCACCTG CAAGTACCAC GCCTGGCGTT ACGGGCTCGA CGGCTCCTGC ACGTTCGTCC AGCAGGAGTC GGAGTTCTTC GACCTCGACA GGTCGCAGCT CGGGCTGGTA CCGGTGCGCT GTGAGCTGTG GGAGGGCTTC ATCTTCATCA ACCTCGACAG CGAGGGCACG ACATCGCTGG CCGAGTACCT GGGGCGCTTC GCGAAGGGAC TCGAGGGCTA TCCCTTCGGC GAGATGACCG AGGTCTACAA GTACCGGGCC GAGGTCGGGA GCAACTGGAA GCTCTACATC GACGCGTTCG CGGAGTTCTA CCACGCGCCC GTCCTGCACG CGAAGCAGTA CGTGGGCGAC GAGTCACGCA AGCTGCTGGG CTACGGCTAC GAGTCACTGC ACTACGACAT CGACGGGCCG CACTCGATGC AGTCGGCGTG GGGCGGGATG TCGCCGCCCA AGGACCTCAA CATGGTGAAG CCGATCGAGC GGGTCCTGCG CAGCGGCAAC TTCGGGCCGT GGGACCGCCC CGACATCGAG GGCCTGAACC CGCTGCCGTC AGGGGTCAAC CCGGTCGGCC ACCCGGCGTG GGGCCTGGAC TCCTACGTCT TCTTCCCGAA CTTCATGATC GTGGTGTGGG CGCCGGGCTG GTACCTCACC TACCACTACT GGCCGACGGC GTACAACCAG CACATCTTCG AGGGCACGCT GTACTTCGTG CCGGCCCGGA CCGCGCAGGA TCGCATCCGG CAGGAGCTTG CCGCGGTCAC CTTCAAGGAG TTCGCGCTGC AGGACTGCAA CACGCTCGAG GCCACGCAGA CCATGCTCGA GTCCCGCGCC GTCTCGCGGT TCCCGCTCAA CGACCAGGAG ATCGCCATCC GGCACCTCCA CAAGACCGCG GGTGACTACG TCGCCGGGTA CCAGGGGTAG
|
Protein sequence | MAHFPKPAEG SWTEHYPDLG TGFVSFEDSI SPEHYELERK AIFGRTWLNV GRVEQIPRTG NYFTREIHAA RASLIVVRDA EGEVRAFHNV CRHRGNKLVW NDFPQEETHG TARQFTCKYH AWRYGLDGSC TFVQQESEFF DLDRSQLGLV PVRCELWEGF IFINLDSEGT TSLAEYLGRF AKGLEGYPFG EMTEVYKYRA EVGSNWKLYI DAFAEFYHAP VLHAKQYVGD ESRKLLGYGY ESLHYDIDGP HSMQSAWGGM SPPKDLNMVK PIERVLRSGN FGPWDRPDIE GLNPLPSGVN PVGHPAWGLD SYVFFPNFMI VVWAPGWYLT YHYWPTAYNQ HIFEGTLYFV PARTAQDRIR QELAAVTFKE FALQDCNTLE ATQTMLESRA VSRFPLNDQE IAIRHLHKTA GDYVAGYQG
|
| |