Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_6111 |
Symbol | |
ID | 5674432 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | - |
Start bp | 7438201 |
End bp | 7439379 |
Gene Length | 1179 bp |
Protein Length | 392 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | 641244963 |
Product | radical SAM domain-containing protein |
Protein accession | YP_001510361 |
Protein GI | 158317853 |
COG category | [H] Coenzyme transport and metabolism [R] General function prediction only |
COG ID | [COG1060] Thiamine biosynthesis enzyme ThiH and related uncharacterized enzymes |
TIGRFAM ID | [TIGR00423] radical SAM domain protein, CofH subfamily |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 0.0611343 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 9 |
Fosmid unclonability p-value | 0.194203 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | TTGTCGGTCC GCGGAGCTAG GCTGGATCGC ATGGATGCCG GGCTCAAGCG CGAGATCGAA GCCAAGGTCC ACGACGGTGC CCGGCTCAGC CGGGCCGACG GTGAGGCCCT CTACGCGAGC GACGACCTCG CCTGGCTGGG CGGTCTCGCG CACGAGGTGC GTACCCGAAA GAACGGCGAC CAGACCTTCT TCAACGTGAA CCGGCACCTC AACCTGACGA ACGTCTGCTC GGCCTCGTGC GCCTACTGCT CGTTCCAACG CAAGCCCGGC GAGTCGGACG CCTACACTAT GCGCATCGAG GAGGCCGTCC GGCTGGCGAA GGATATGGAG CCGGCCGGGA TCACCGAGCT GCACATCGTC AACGGCCTGC ACCCGACGTT GCCGTGGCGT TACTATCCGC GGTCGCTGCG CGAGCTGGGG AAGGCGCTGC CCGGCGTCGC GCTCAAGGCG TTCACCGCCA CCGAGATCCA CTGGTTCGAG AAGATCAGCG GCCTCTCCGC GGACGAGATC CTGGACGAGC TCATCGACGC GGGCCTGGAG TCGTTGACGG GCGGCGGCGC GGAGATCTTC GACTGGGAGG TGCGGCAGAA GATCGTCGGC CACGAGACCC ACTGGGAGGA CTGGTCGCGG ATCCACCGTC TCGCGCACTC CAAGGGCCTG CGCACGCCGT GCACGATGCT GTACGGCCAT GTCGAGGAGC CGCGGCACCG GGTCGACCAC GTGCTGCGCC TGCGTGAGCT GCAGGACGAG ACGGGCGGTT TCGCGGTGTT CATCCCGCTG CGCTTCCAGC ACGACTCGGT CGGCGATCCC CGCAACCGCC TGATGAACCA GCCGATGGCG ACCGGCGCGG AGGCTCTCAA GACGTTCGCG GTGTCGCGGC TGCTGTTCGA CAACGTCGAT CACGTCAAGT GCTTCTGGGT GATGCACGGG CTGACCACCG CCCAGCTGTC CCTGAACTTC GGCGTCGACG ACCTCGACGG CTCGGTCGTC GAGTACAAGA TCACTCACGA CGCGGACGGC TTCGGAACGC CGAACACGAT GACCCGGGAG GATCTTCTAT CCGTGATCCG TGACGCGGGC TTCCGGCCGG TCGAGCGGGA CACCCGCTAC CGGGTCGTGC GCAGGTACGA CGGTCCGGAC ACCACCCGGC GGGACAACCC CGTCTCGATC GACGCCTGA
|
Protein sequence | MSVRGARLDR MDAGLKREIE AKVHDGARLS RADGEALYAS DDLAWLGGLA HEVRTRKNGD QTFFNVNRHL NLTNVCSASC AYCSFQRKPG ESDAYTMRIE EAVRLAKDME PAGITELHIV NGLHPTLPWR YYPRSLRELG KALPGVALKA FTATEIHWFE KISGLSADEI LDELIDAGLE SLTGGGAEIF DWEVRQKIVG HETHWEDWSR IHRLAHSKGL RTPCTMLYGH VEEPRHRVDH VLRLRELQDE TGGFAVFIPL RFQHDSVGDP RNRLMNQPMA TGAEALKTFA VSRLLFDNVD HVKCFWVMHG LTTAQLSLNF GVDDLDGSVV EYKITHDADG FGTPNTMTRE DLLSVIRDAG FRPVERDTRY RVVRRYDGPD TTRRDNPVSI DA
|
| |