Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_4273 |
Symbol | |
ID | 5672628 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | + |
Start bp | 5106996 |
End bp | 5108066 |
Gene Length | 1071 bp |
Protein Length | 356 aa |
Translation table | 11 |
GC content | 69% |
IMG OID | 641243146 |
Product | Rieske (2Fe-2S) domain-containing protein |
Protein accession | YP_001508563 |
Protein GI | 158316055 |
COG category | [P] Inorganic ion transport and metabolism [R] General function prediction only |
COG ID | [COG4638] Phenylpropionate dioxygenase and related ring-hydroxylating dioxygenases, large terminal subunit |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 0.56971 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 11 |
Fosmid unclonability p-value | 0.449147 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACGCCGA CGGAAGGCGG CGTGGCGGCG ACGGCCTATA CCAGCCGGGA ACGTTTCGAA CTGGAACGTG AAGTCGTCCG GCGTTCACCG CAGCTCGTGG GCTACAGCTC CGAACTGCCC GACGCCGGAA CCTACTGCAC AAAGACCGTG ATGGACGTCC CCGTGCTGTT GACTCGCGGC GATGACGGAA CGGTGCGCGC TTTCCAGAAC GTGTGCGCCC ATCGGCAGGC TCAGGTCGCC CAGGGCTGCG GCGTGGCGCA GCGGTTCACC TGCCCCTGGC ATGCGTGGAC GTACGACGCG CGCGGCGAGT TCGTCGGCGG TCCGGGGCGG GAGGGCTTCC CCGCGACACT CAACGGTCAG GCGCGGCTCA ACGAGCTGCC CGCCGCCGAG AACGCGGGGT TCCTGTGGGT CGGGCTGGAC CCGGCCGCCG GGCCGCTCGA CATCGACGCA CACCTGGGCG AGCTGGGCCC GGAGCTGGCG TCCTGGAACA TCGGGTCGTG GGCGCCGGTG GGCGAGAAGG TCATCGACTC ACCGGTCAAC TGGAAGCTCG CGCTCGACAC GTTCGCCGAG AGCTACCACT TCGCGTCGGT GCACCGGGAC ACGTTCGCGC TGATCAACAA GAGCAACTGC GCGCTGTTCG ACTCCTACGG GCCGCACCAC CGCCTGGTCT TCCCGATGAA CCACATCACC GACCTGGCGG ACAAGCCGGA GGAGGAGTGG GAGCCGCTGA ACAACTTCGT GTTGATCTAC GCCCTGTTCC CCAACATCGT CCTGTCCGTG ACGGTCGCGA ACGGCGAGGT GTTCCGGGTG TACCCGGGCG AGCGCCCGGG CCATTCGGTC ACCTACCACC AGAACGCGTC CCCGATGGAC CTCACCGACG AGGCGACCCG GGAGACCGCG GAGACGATCT TCGACTACGC GCACAACGCC GTCCGCGACG AGGACTACGC GCTGGCGGCC CAGGTGCAGG CGAGCATGGC CTCGGGCGCG CGCGCGGACC TCGTCTTCGG GCGCAACGAG CCCGGCCTGC ACCACCGGCA CGAGGTTCTC GAGGACGCGC TCGGCCGCTA G
|
Protein sequence | MTPTEGGVAA TAYTSRERFE LEREVVRRSP QLVGYSSELP DAGTYCTKTV MDVPVLLTRG DDGTVRAFQN VCAHRQAQVA QGCGVAQRFT CPWHAWTYDA RGEFVGGPGR EGFPATLNGQ ARLNELPAAE NAGFLWVGLD PAAGPLDIDA HLGELGPELA SWNIGSWAPV GEKVIDSPVN WKLALDTFAE SYHFASVHRD TFALINKSNC ALFDSYGPHH RLVFPMNHIT DLADKPEEEW EPLNNFVLIY ALFPNIVLSV TVANGEVFRV YPGERPGHSV TYHQNASPMD LTDEATRETA ETIFDYAHNA VRDEDYALAA QVQASMASGA RADLVFGRNE PGLHHRHEVL EDALGR
|
| |