Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_4290 |
Symbol | |
ID | 5672645 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | + |
Start bp | 5128400 |
End bp | 5129671 |
Gene Length | 1272 bp |
Protein Length | 423 aa |
Translation table | 11 |
GC content | 65% |
IMG OID | 641243163 |
Product | Rieske (2Fe-2S) domain-containing protein |
Protein accession | YP_001508580 |
Protein GI | 158316072 |
COG category | [P] Inorganic ion transport and metabolism [R] General function prediction only |
COG ID | [COG4638] Phenylpropionate dioxygenase and related ring-hydroxylating dioxygenases, large terminal subunit |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 11 |
Fosmid unclonability p-value | 0.502996 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGCCCCACT TCAGCAAGCC CCCGGCGGGG AGCTGGACCG AGAACTATCC CGAGCTCGGC ACCGCGCCCG TCGACTACAA CGACTCGATC GACCCCGAGT TCTACGAGCA GGAGCGCGAG GCGATCTTCA AGCGCTCCTG GCTGAACGTG GGCCGGGTCG AGCGCCTCCC CCGCACCGGC AGCTACTTCA CCAAGGAGCT GCCCGCCGCC GGGACGTCCC TGATCATCGT CAAGGGCGGC GACGGCAAGG TGCGCGCGTT CCACAACGTG TGCCGGCACC GCGGCAACAA GCTGGTGTGG AACGACTTCC CGGGTGAGGA GACCGCCGGC AGCTGCCGGC AGTTCATCTG CAAGTACCAC GCCTGGCGTT ACGACCTCAC CGGCGAGCTG ACCTTCGTCC AGCAGGAGGG TGAGTTCTTC GACCTGGACA AGAAGCAGTT CGGCCTCAAG GAGGTCGCCT GTGAGGTCTG GGAAGGCTTC ATCTTCATCA ACCTGAACCC GCAGGAGACC CTCACCGAGT ACCTCGGTGA CATGGCCAAG GGCCTCGAGG GCTACCCGTA CTCGGAGCTG ACCGAGGTCT ACTCCTACCG GGCCGAGGTC GGCGCCAACT GGAAGCTGTT CATCGACGCT TTCGCGGAGT TCTACCACGC GCCCGTCCTG CACCAGAAGC AGGCCGTCAA GGGCGAGTCC GAGAAGCTCA TCGGCTACGG GTTCGAGGCG CTGCACTACC AGTTGTTCAG CCCGCACTCG ATGGTGTCCT CCTGGGGCGG CATGGCCCCG CCGAAGGACC CGTCGATGGT CAAGCCGATC GAGCGCGTGC TGCGCAGCGG CCTTTTCGGC CCCTGGGACA CCCCCGAGGT CGACGGCCTC GAGGTCGAGA AGCTCCCGAC GGGGATCAAC CCGGTGAAGC ACAAGTCCTG GGGCACCGAC TCGTTCGAGA TCTTCCCCAA CTTCACGCTG CTGTTCTGGA AGCCGGGCTG GTACCTGACG TACCACTACT GGCCGACCGC GGTGAACAAG CACACGTTCG AGGCGAGCCT CTACTTCGCC CCGCCGAAGA ACGCCCGCGA GCGACTGGCC CAGGAGCTGG CGGCGGTGAC GTTCAAGGAG TACGCGCTCC AGGACGCCAA CACCCTCGAG GCCACCCAGA CGATGATCGG TACCCGTACC GTCACCGAGT TCCCCCTGTG CGACCAGGAA ATCCTGCTCC GGCACCTGCA CAAGGTCGTC GGCGACCGAG TCAAGGAGTT CAGCGATGCC GCTGCCGTCT GA
|
Protein sequence | MPHFSKPPAG SWTENYPELG TAPVDYNDSI DPEFYEQERE AIFKRSWLNV GRVERLPRTG SYFTKELPAA GTSLIIVKGG DGKVRAFHNV CRHRGNKLVW NDFPGEETAG SCRQFICKYH AWRYDLTGEL TFVQQEGEFF DLDKKQFGLK EVACEVWEGF IFINLNPQET LTEYLGDMAK GLEGYPYSEL TEVYSYRAEV GANWKLFIDA FAEFYHAPVL HQKQAVKGES EKLIGYGFEA LHYQLFSPHS MVSSWGGMAP PKDPSMVKPI ERVLRSGLFG PWDTPEVDGL EVEKLPTGIN PVKHKSWGTD SFEIFPNFTL LFWKPGWYLT YHYWPTAVNK HTFEASLYFA PPKNARERLA QELAAVTFKE YALQDANTLE ATQTMIGTRT VTEFPLCDQE ILLRHLHKVV GDRVKEFSDA AAV
|
| |