Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_3959 |
Symbol | |
ID | 5672320 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | - |
Start bp | 4739732 |
End bp | 4740967 |
Gene Length | 1236 bp |
Protein Length | 411 aa |
Translation table | 11 |
GC content | 71% |
IMG OID | 641242838 |
Product | monooxygenase FAD-binding |
Protein accession | YP_001508255 |
Protein GI | 158315747 |
COG category | [C] Energy production and conversion |
COG ID | [COG0644] Dehydrogenases (flavoproteins) |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 0.442261 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 7 |
Fosmid unclonability p-value | 0.0366683 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTACGACG TCATCGTGGT CGGCGCGCGC TGCGCCGGCT CACCGCTGGC CATGCTGCTC GCCCGCCAGG GCCACCGGGT CCTGGTCGTG GACAGGTCGA CCTTCCCCAG CGACACCGTG TCCACCCACT ACATGCACCA GACCGGCCTG GCCCGGTTGC GGGACTGGGG TCTGCTGGAC CGCCTGGTCG CCACCGGCGT CCCGCCGATG CGCCACCTCA CCTTCTCCTA CACCGGCCTG CACCTCGAGG GCTTCGCCGA CCCGATCGAC GGGATCACGG AGGTCTACTC ACCGCGTCGG ATCATCCTCG ACAAGCTGCT GGTGGACGCC GCCCGCGAGG CCGGCGCCGA GGTGATCGAG GGCTTCACGG TCAGCGACCT GCTCTTCGAC GATGGCCGGG TCACCGGCAT CCGCGGCCGG ACCGGCGAGG GACCCGAGCA GGAGTTCCGC GCCGCGTTCG TGGTCGGGGC GGACGGGCGG ACGTCCACCG TGGCCGACAA GGTCGGCGCG GACTTCTACC GGGTCGTGCC GGCGGCCGGC TTCATCTACT ACTCGTACTT CGAGGGCCTC GACTGGACGT TCCAGCACCG GACCGGCTTC GGGGAGCAGC AGTTCGGCGC CTGGCCCACG CACGACGGCC GGCACCTGGT GTCGATCATC CGCCCCCGCT CGGCGTTCAG CGAGTTCCGC GCCGACGTCG AGGGCAGCTT CCAGGCCATC TTCGACGCGG TCGTCCCCGA GCTCGGCGAG GACCTGCGGA CCCGCGGCCG CCGCGTCGAG GAGTTCCGTC CGATGCGCTA CCCGGACAAC TACTACCGGC GCTCGCACGG GCCCGGCTGG GCGCTGGTCG GCGACGCCGG CTACCACAAG GACCCGTTCA CCGGCTGGGG TATCACCGAC GCGTTCCTCC AGGCGCAGAC GCTGGCGGAC CGGCTGCATT CCGGCCTCGC CGGCGAGCGG ACGCTGGACG ACGCCGCCGC CGAGTACGTC AAGATCCGCG ACGAGGAGAG CCACGGGACG TTCGAGCTGA CCTGCACGCT CTCCCACCTC GTGCTGCCGC CGTTCCTGCA CTCGGCCTTC GCCGCGACGG CGCAGAGCCC CCGCTACACG AAGAAGTTCT TCGGGTTGAT CGCCGGTGGC GTTCCCGGCC ACGACTTCTT CCACCCCGAC AACCTCGCGG AGCTCTACGA GGAGGTCGGC ATGCCCGCCG AGAAGCGCCT GCTGTCGGCC AGCTGA
|
Protein sequence | MYDVIVVGAR CAGSPLAMLL ARQGHRVLVV DRSTFPSDTV STHYMHQTGL ARLRDWGLLD RLVATGVPPM RHLTFSYTGL HLEGFADPID GITEVYSPRR IILDKLLVDA AREAGAEVIE GFTVSDLLFD DGRVTGIRGR TGEGPEQEFR AAFVVGADGR TSTVADKVGA DFYRVVPAAG FIYYSYFEGL DWTFQHRTGF GEQQFGAWPT HDGRHLVSII RPRSAFSEFR ADVEGSFQAI FDAVVPELGE DLRTRGRRVE EFRPMRYPDN YYRRSHGPGW ALVGDAGYHK DPFTGWGITD AFLQAQTLAD RLHSGLAGER TLDDAAAEYV KIRDEESHGT FELTCTLSHL VLPPFLHSAF AATAQSPRYT KKFFGLIAGG VPGHDFFHPD NLAELYEEVG MPAEKRLLSA S
|
| |