Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_4813 |
Symbol | |
ID | 5673154 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | + |
Start bp | 5746233 |
End bp | 5747327 |
Gene Length | 1095 bp |
Protein Length | 364 aa |
Translation table | 11 |
GC content | 74% |
IMG OID | 641243669 |
Product | alkanesulfonate monooxygenase |
Protein accession | YP_001509085 |
Protein GI | 158316577 |
COG category | [C] Energy production and conversion |
COG ID | [COG2141] Coenzyme F420-dependent N5,N10-methylene tetrahydromethanopterin reductase and related flavin-dependent oxidoreductases |
TIGRFAM ID | [TIGR03565] alkanesulfonate monooxygenase, FMNH(2)-dependent |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 8 |
Fosmid unclonability p-value | 0.0820217 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTCCCTGA CCTTCCACTG GTTCCTGCCC ACCTCCGGCG ACGCGCGCGA CGTGGTCGGC GGCGGGCACG GCGCACCCGC CACGGCGAGC ACCCGCGGCC GGCCGCCGAC ACTTGAGTAC CTCGGCCAGA TCGCCCGCTC GGCCGAGCAG CTCGGTTTCG CCGGCGCCCT GACCCCCACC GGTTCGTGGT GCGAGGACGC CTGGCTGACC ACGGCGATGC TGGTCGAGGC GACGAAGCGG CTGAAGTTCC TGGTCGCCGT ACGGCCGGGG CTGGTCTCAC CGACGCTGGC CGCGCAGATG GCGTCCACGT TCCAACGGCT CTCCGGTGGG CGGCTGCTGC TCAACGTCGT CACCGGCGGC GAGGCGGACG AACAGCGCGG CTACGGCGAC TTCCTCGACA AGGACGACCG GTACGCCCGC TGCGACGAGT ACCTGTCGGT GCTCTCCGAT CTGTGGCGGG GACGGACCGT GGACGTCGAC GGCCGCTTCG TGCGGCTGAG CGGGGCGCGG CTGTCGCGGC TCCCCGACCC GCCGCCGGAG ATCTACTTCG GGGGCTCGTC GCCGGCCGCG GTCGAGGTGG CCGCGCGCCA CGCCGACGTC TACCTGACGT GGGGCGAACC GCCGGACGCC GTCGCCGCCA AGTTCGCCGA GGTCCGGTCC AGGGCCGAGG CGGCCGGGCG GGCGCCGCGT TTCGGGCTGC GCGCCCACGT CATCACCCGG GACACCGCCG AGGACGCCTG GGCGGCGGCG CACCGCCTGA TCGCCGGCCT GGACGCCCGG ACGGTCGCGG CGGTGCAGGA GCAGCTGGCG CGCAGCGAGT CCGAGGGCCA GCGCCGGATG CTCGCGCTGC ACAACGGGTC GACCGCGAAG CTGGAGATCT TCCCGAACCT GTGGGCGGGG ATCGGGCTGG TCCGCGGCGG CGCCGGGACC GCCTTCGTCG GCAGCCACCA CGAGGTGGCG GACCTCATCG AGACCTACCA CCAGGTCGGC GTCACCGAGT TCGTGCTCTC CGGCTATCCC CACCTGGAGG AGGCGTACTG GTTCGGGGAG GGTGTGCTCC CGATCCTGCG CGCCCGGGGC CGATGGTCCC CGTGA
|
Protein sequence | MSLTFHWFLP TSGDARDVVG GGHGAPATAS TRGRPPTLEY LGQIARSAEQ LGFAGALTPT GSWCEDAWLT TAMLVEATKR LKFLVAVRPG LVSPTLAAQM ASTFQRLSGG RLLLNVVTGG EADEQRGYGD FLDKDDRYAR CDEYLSVLSD LWRGRTVDVD GRFVRLSGAR LSRLPDPPPE IYFGGSSPAA VEVAARHADV YLTWGEPPDA VAAKFAEVRS RAEAAGRAPR FGLRAHVITR DTAEDAWAAA HRLIAGLDAR TVAAVQEQLA RSESEGQRRM LALHNGSTAK LEIFPNLWAG IGLVRGGAGT AFVGSHHEVA DLIETYHQVG VTEFVLSGYP HLEEAYWFGE GVLPILRARG RWSP
|
| |