Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_4104 |
Symbol | |
ID | 5672462 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | - |
Start bp | 4888115 |
End bp | 4889551 |
Gene Length | 1437 bp |
Protein Length | 478 aa |
Translation table | 11 |
GC content | 71% |
IMG OID | 641242980 |
Product | NtaA/SnaA/SoxA family monooxygenase |
Protein accession | YP_001508397 |
Protein GI | 158315889 |
COG category | [C] Energy production and conversion |
COG ID | [COG2141] Coenzyme F420-dependent N5,N10-methylene tetrahydromethanopterin reductase and related flavin-dependent oxidoreductases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 22 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 14 |
Fosmid unclonability p-value | 0.958651 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTCCGCTC GCACGCTCAA GCTGGGCGCC GTCCTCATCG GGGTCGGCGG CCCGGGGCAG CACAACACCT GGCTCAGCCC CGAGATCCCG GGCAACGCCA GCGTGGACGT CCGCTGGTAC ATCGCCCGCG CGCAGCAGGC GGAGGCCGCG AAGTTCGACC ACGTCTTCAT CGTGGACAGC CAGTTCATCA CCGCGGACTC GCCGAACCAC TACCTCAGCC GGCTCGAGCC GCTGACGCTG CTGTCCGCGA TCGCCGTGCA CACCAGCCAC ATCGGCCTGG TGGGGACGAT CACCACCTCC TACAACGAGC CGTTCAACGT CGCCCGGCGG CTCGCCTCGC TCGACCACAT CAGCGGCGGC CGGGCCGGCT GGAACGTTGT CACCAGCGGT GACGCCGGGA CCGCGGGCAA CTACAGCCGC GACGAGCACT ACGACTACGC CACCCGGTAC GGGCGGGCGC ACGAGCACGT CCGGATCGCG CAGGGGCTGT GGGACTCTTA CGAGGCCGAC GCCTTCCCCC GGGACAAGGA CCGCGGCGTC TTCTTCGACC GCGACCGCCA GCACGCGCTG AACCACAAGG GCGAGTACTT CTCGGTGGTC GGGCCGCTCA ACCTCGAGCG CACCCCCCAG GGCCAGCCGG TGATCTTCCA GGCGGGTGAC TCGGAAGAGG GCCGCGACCT GGGGGCGAGC ATCGCCGACG CGATCTTCAG CCACGCCCAG ACCCTTGAGC AGGCACAGGC GTTCCGCAGC GAGCTGCGGG CGCGGGCGGC GACCAAGGGG CGCGACCCCG AGCAGCTGCT CGTCATCCCG GGGATTTTCC CGATCATCGC CGACACGGAC GAAGAGGCCC GCGCCAAGGA GGACGCGACC CTCGGCGCGA AGAGCTTCGA GCGGGCGCTG GCCGAGCTCG GCCGTCCCTT CGGCTGGCAC GACTTCTCGC AGTACGACCT GGACGCACCG TTCCCCGAGG TCGGCGACGC CGGCGAGCGC AGCTTTCGGA CCCAGGCGGA GGGGATCAAG AAGCTGGCCC GGGACAACAA CCTCACGCTG CGCCAGGTGG TCGAACACCA GCTCACGGCG TTCCGCTCGC CCTTCGTCGG CTCGCCGCGC ACCGTCGCGA ACGAGATCGA GCGCTGGTTC GAAGACGGCG GCTTCGACGG CATCAACATC ATGGTGACGG TCCCCAGCGA GTTCGCCCTG TTCGTCGACC AGGTGCTGCC GATCCTGCGT GAGCGTGGCG TCGCCCGCAG CGAGTACGAG TCCACGACGC TGCGCGGCAA CCTCGGCCTG CCCGTCCCGG AGAACCGGCA CACCGCGGCC CGCCGCCAGC AGCAGCACTC TGCCGCCGCC CCCACCGACG CCGCCGTCGC CGACAGCGCG CCCGCACTCG ACGTCCCGCC GGCGCCCGCG GCCACGGAGT CGGCCCTCAT CTCCTGA
|
Protein sequence | MSARTLKLGA VLIGVGGPGQ HNTWLSPEIP GNASVDVRWY IARAQQAEAA KFDHVFIVDS QFITADSPNH YLSRLEPLTL LSAIAVHTSH IGLVGTITTS YNEPFNVARR LASLDHISGG RAGWNVVTSG DAGTAGNYSR DEHYDYATRY GRAHEHVRIA QGLWDSYEAD AFPRDKDRGV FFDRDRQHAL NHKGEYFSVV GPLNLERTPQ GQPVIFQAGD SEEGRDLGAS IADAIFSHAQ TLEQAQAFRS ELRARAATKG RDPEQLLVIP GIFPIIADTD EEARAKEDAT LGAKSFERAL AELGRPFGWH DFSQYDLDAP FPEVGDAGER SFRTQAEGIK KLARDNNLTL RQVVEHQLTA FRSPFVGSPR TVANEIERWF EDGGFDGINI MVTVPSEFAL FVDQVLPILR ERGVARSEYE STTLRGNLGL PVPENRHTAA RRQQQHSAAA PTDAAVADSA PALDVPPAPA ATESALIS
|
| |