Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_6099 |
Symbol | |
ID | 5674420 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | - |
Start bp | 7424410 |
End bp | 7425957 |
Gene Length | 1548 bp |
Protein Length | 515 aa |
Translation table | 11 |
GC content | 73% |
IMG OID | 641244951 |
Product | radical SAM domain-containing protein |
Protein accession | YP_001510349 |
Protein GI | 158317841 |
COG category | [H] Coenzyme transport and metabolism [R] General function prediction only |
COG ID | [COG1060] Thiamine biosynthesis enzyme ThiH and related uncharacterized enzymes |
TIGRFAM ID | [TIGR00423] radical SAM domain protein, CofH subfamily |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 0.61579 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 6 |
Fosmid unclonability p-value | 0.0270865 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACGCCCG TGTCGGGCCG GACAACATGC ACTCCGACGT GCACCGTTCC GACTTCGGCG GCCAGAACGG CGCCGGGCCT TCCGGCCCGG GAACGACCGG TGCGGGCGGG CCCGGCGCCG GGGCGGGTTC CGGCTCCCGC GACAGCTCGG GATACGACCG GATCTGAGCC GGTCCGCCGC CCGGGCCGGC GCCCGCCGGA GCCGGCGCCA CCCGGGCCGG CGCTTCCCCG GGCTGGTGCA GCCCGGGCCG GGGCTTCTGA GGCGGTGTCT TCCGAGCCGG CTCGTCCGAG TTCGGCCGCC CGCCGCCCGG GGCTCGCCCG GCGCAGGTCG TACGATCCTG GGGTGAGCCT TGTTGATGCC GACGGTGAGT CCCCGGCGGG CCGGGCCCGC GTTCCCGACG CCGAGATCCG CGCGCTGCTC GACCGGGCGG CCGACGGCGG GCGGATCTCG CCGGAGGAGG CGCTCCTCCT CTACACGTCC GCTCCGCTGC ACGGGCTCGG GCGCGCGGCC GACGCCGTCC GTCGCCGCCG GTACCCGGAC GGCATCGCCA CCTACATCAT CGACCGGAAC ATCAACTACA CCAACGTCTG CGTGACGGCC TGCCGGTTCT GCGCCTTCTA CCGGTCGCCG AAGCACGCCG AGGGCTGGGT CCGCGACCTC GACGACATCG TTGCCAAGTG CGGCGAGGCG GTCGAGCTGG GCGCCACCCA GATCATGCTG CAGGGCGGCC ACAACCCCGA GTTCGGGATC GAGTGGTACG AGCGGACGTT CGCCGGCATC AAGGCCGCCT ACCCGCAGCT CGCCCTGCAC TCGCTCGGCG CCAGCGAGGT CGTGCACATC GCGCGGACGT CCGACCTGGA CTTCGCCGAG GTGATCACCC GGCTGCGCGA CGCCGGGCTC GACAGCTTCG CCGGCGCGGG CGCGGAGATC CTCACCGAGC GTCCCCGGCA CGCGATCGCC CCGCTCAAGG AGCCCGGCCA CGTGTGGCTG TCGGTCATGG AGATCGCCCA CGGCCTGGGC CTGGAGTCGA CGGCGACGTT CATGATGGGC ACCGGCGAGA CCAACGCCGA GCGCATCGAG CACATGCGCA TGATCCGGGA CGTCCAGGAC CGCACCGGCG GGTTCCGCTC GTTCATCCCG TGGACGTACC AGCCGGAGAA CAACCACCTC GGCGGCCGGA CGCAGGCGAC GACCCTGGAG TACCTGCGGC TGGTGGCCGT CGCGCGGCTG TTCTTCGACA ACGTGGCCCA CCTGCAGGGC TCGTGGCTGA CCACCGGCAA GGAGGTCGGC CAGCTCACCC TGCACATGGG CGCCGACGAT CTCGGCTCGG TGATGCTCGA GGAGAACGTC GTCTCCTCCG CGGGGGCCCG GCACCGCACC AACCGGTCCG AGCTCATCCA CCTGATCCGG GCCGCCGACC GCATCCCGGC GCAGCGCGAC ACGCTGTACC GGCACCTCGT CGTGCACCGT GACCCGGCGC TCGACCCGGT CGACGACCGG GTGGCCTCGC ACTTCTCCTC GACCGCGCTG CCGCTGGTGT CGACCTGA
|
Protein sequence | MTPVSGRTTC TPTCTVPTSA ARTAPGLPAR ERPVRAGPAP GRVPAPATAR DTTGSEPVRR PGRRPPEPAP PGPALPRAGA ARAGASEAVS SEPARPSSAA RRPGLARRRS YDPGVSLVDA DGESPAGRAR VPDAEIRALL DRAADGGRIS PEEALLLYTS APLHGLGRAA DAVRRRRYPD GIATYIIDRN INYTNVCVTA CRFCAFYRSP KHAEGWVRDL DDIVAKCGEA VELGATQIML QGGHNPEFGI EWYERTFAGI KAAYPQLALH SLGASEVVHI ARTSDLDFAE VITRLRDAGL DSFAGAGAEI LTERPRHAIA PLKEPGHVWL SVMEIAHGLG LESTATFMMG TGETNAERIE HMRMIRDVQD RTGGFRSFIP WTYQPENNHL GGRTQATTLE YLRLVAVARL FFDNVAHLQG SWLTTGKEVG QLTLHMGADD LGSVMLEENV VSSAGARHRT NRSELIHLIR AADRIPAQRD TLYRHLVVHR DPALDPVDDR VASHFSSTAL PLVST
|
| |