Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_3932 |
Symbol | |
ID | 5672293 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | + |
Start bp | 4701508 |
End bp | 4702368 |
Gene Length | 861 bp |
Protein Length | 286 aa |
Translation table | 11 |
GC content | 72% |
IMG OID | 641242811 |
Product | hypothetical protein |
Protein accession | YP_001508228 |
Protein GI | 158315720 |
COG category | [C] Energy production and conversion |
COG ID | [COG2141] Coenzyme F420-dependent N5,N10-methylene tetrahydromethanopterin reductase and related flavin-dependent oxidoreductases |
TIGRFAM ID | [TIGR03620] probable F420-dependent oxidoreductase, MSMEG_4141 family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 6 |
Fosmid unclonability p-value | 0.0176124 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGATATTT CCGGAGTTGG CATCTGGAGC CAGCAGCTGC GGTGGGGGGA TCAGGCCGAG GTGGCCGACG CGGCGGCGGA GCTGGAGGAG CTGGGCTACC GCGCGCTGTG GATTCCCGAC GTCGGGGGCC CGGTGTTCGA GGCCGTCGAG GCACTGCTGG CGGCGACCCG ATCGGTGACC GTGGCCACCG GCATCCTCAA TCTGTGGATG CACAGCCCGG CGGACACGGC CACGGGATAC GCCGCGCTCA CGGCCACCCA CGGAGACCGC TTCCTGATGG GCATCGGGGT GAGCCACGCA CCGCTGGTCG ACGCGTCGGA ACCCGGTCGC TACCGCCGGC CGCTGGCGGC CATGACGGCG TTCCTCGACG GGCTGGACAA GGCCGACCAG CCGGTCCCAC CCGAGCGCCG CGTGCTGGCC GCGCTCGGCC CCAAGATGCT CCAGCTGGCC GGAAGCCGGG CGCGCGGCGT ACATCCGTAC CTCGTGACGC CTGAGCACAC CCGGCAGGCA CGCGACAGCC TGGGAGCCGG GCCCCTCGTG CTCCCCGAGC AGACGGTGAT CCTCACCACC GACGCCGACG AGGCCCACGC GATCGGCCGG GACTGGTTGC GCGGGTACCT GGGGCTGCCG AACTACGCGA ACAACATCCG CAGGCTCGGC TTCACCGACG ACGACGTCGC GGACGCCAGC GACCGGCTCT TCGACGCCCT GATCGCCTGG GGGGACGAGG ACGCGATCCG GCGCCGGGTC GACGAACACC GGGCCGCCGG TGCCGACCAC GTCTGCGTCC AGGTTCTCAC CGCCGACCCG AAGGCCTTTC CACGCGAGCA GTGGCGCCGG CTCGCTCCCG CGCTCCGGTA G
|
Protein sequence | MDISGVGIWS QQLRWGDQAE VADAAAELEE LGYRALWIPD VGGPVFEAVE ALLAATRSVT VATGILNLWM HSPADTATGY AALTATHGDR FLMGIGVSHA PLVDASEPGR YRRPLAAMTA FLDGLDKADQ PVPPERRVLA ALGPKMLQLA GSRARGVHPY LVTPEHTRQA RDSLGAGPLV LPEQTVILTT DADEAHAIGR DWLRGYLGLP NYANNIRRLG FTDDDVADAS DRLFDALIAW GDEDAIRRRV DEHRAAGADH VCVQVLTADP KAFPREQWRR LAPALR
|
| |