Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_0121 |
Symbol | |
ID | 5668546 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | + |
Start bp | 145039 |
End bp | 146910 |
Gene Length | 1872 bp |
Protein Length | 623 aa |
Translation table | 11 |
GC content | 72% |
IMG OID | 641239049 |
Product | glucose-methanol-choline oxidoreductase |
Protein accession | YP_001504494 |
Protein GI | 158311986 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG2303] Choline dehydrogenase and related flavoproteins |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 0.412208 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 15 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGTCGTCC GTCACGCTCC GTTCCCGGTG TCCCGGCGCA CCTGCATGCC TGGCGGCATG ATGGGGCACA TGGGAGGCGA CGAGACCGAG CGGTCGGGCG CGGACATCGA GCGTCGGGAC GACTGCGACG TTCTGGTGAT CGGCAGCGGG TTCGGGGGCA GCGTGACCGC GCTCCGCCTG GCCGAGAAGG GCTACCGCGT CACGGTCGTC GAGGCCGGCC GCCGGTTCAC CCCGGCGACA CTGCCGAAGA CGAGCTGGGA CATCCGCAGG TACCTGTGGC TTCCGCGGCT CGGGTGCCAC GGGATCCAGA AGATCACCCT GCTCGGGCGG GTGCTGGTGC TCTCCGGAGC AGCGGTCGGC GGTGGATCGG TCGTCTACGC GAACACGCTG TACAGGCCCC TGGACGGTTT CTACGACGAT CCCCAGTGGG CGGACATCGC CGACTGGCGT GCCGAGCTCG GGCCGTTCTA CGGCCAGGCC GAGCGCATGC TCGGCTCCAC CCGGAACCCG TCGATGACCA TGGCGGACGA GGTCTTCCGG GACGTCGCAC GGGATATGGG TGTCGGGGAC ACCTTCCGCC TGGCCGACGT CGGCGTGTTC TTCGGCCGCG ACGGCCAGCG CGAGCCGGGG GTGACGGTTC CCGATCCCTA CTTCGGCGGC GCCGGCCCCG CCCGGACGGG CTGCGTCGAG TGCGGCGAGT GCATGACGGG ATGCCGGCGC GGCGCGAAGA ACACCCTGGA CCGCAACTAC CTCCACCTGG CCGAAGGGCT GGGCGCCCGC ATCGTGGCGG ACACGACCGT GCGCTCGGTG CGCCCCGACG GGCGCGGTGG CTACGAGGTC GAGACGGTCC GCACCGGCAG CCGGCGGCGG GACGCGCGGC GCTGGACGGC CCGCCAGGTC GTGTTCGCCG CGGGCGCCCT GGGAACCCAG CGCCTCCTGC TCGGCATGCG CGAACAGGGG CACCTGCCCG GCATCTCCGA CCGGGTCGGC CACCTGACCA GGACGAACTC CGAGGCGATC CTGGGCGCGA CCCGGCTGCG GCCGGATGGG CGGATCACCC GGGGGGTGGC CATAACGTCG TCCTTCTACC CGAACGATCA CACGCACGTG GAACCTGTCC GGTACGGCCG GGGCAGCAAC CTCATGGCGT TCCTGTCGAG CGCCATGACC GACGGCGGAG GCTCCCTGCC ACGCTGGGCG AAGCACCTGC TGCTGCTGGC CCGCCGCCCG TACCTCACGC TGCTCGCCCT GCCGTGGCGG TGGTCGGAGC GAACCATGAT CGCTCTCGTC ATGCAGTCCC GGGACAACTC GCTCACCGTC AGCCGGCGCC GCGGGCCGTT CGGGACGTCC TGGCTGACGA GCCACGCTGG CCACGGGGAG TCCAACCCGA CCTGGATACC GGAGGGCAAC CAGGCGGCGC GGCTGGTGGC GGAGCGGCTG GGCGGGCATC CGGGAGGCGC GATCACCGAG CTGGCCGACA TCCCGCTGAC GGCGCACATC CTCGGCGGGG CGGTCCTGGC CGCTGATCCC GCCCGGGGCG TGATCGACCC GTATCACCGG GTTTTCGGCC ATCCCGGCCT GCACGTTGTG GACGGCTCGG CGGTGCCGGC GAACCTCGGG GTGAACCCCT CGCTGACGAT CACGGCGATG GCGGAGCGGG CGATGGCCTT CTGGCCGAAC CGGGGTGACA CCGACCCCCG CCCGCCGGTG GGCGCGGCGT ACCGGCGGAT CGCGCCGGTG GCACCGCGGA ACCCGGCGGT GCCCGCGGAT GCACCGGGAC ATTATGAAAT AAGTGCCCTG ACCAGCACGA ATACTGATCT TCCCCGGGAT GCTGTAGCTG GCGGTGGCCG GGAGACTCCT CACAATATGT GA
|
Protein sequence | MVVRHAPFPV SRRTCMPGGM MGHMGGDETE RSGADIERRD DCDVLVIGSG FGGSVTALRL AEKGYRVTVV EAGRRFTPAT LPKTSWDIRR YLWLPRLGCH GIQKITLLGR VLVLSGAAVG GGSVVYANTL YRPLDGFYDD PQWADIADWR AELGPFYGQA ERMLGSTRNP SMTMADEVFR DVARDMGVGD TFRLADVGVF FGRDGQREPG VTVPDPYFGG AGPARTGCVE CGECMTGCRR GAKNTLDRNY LHLAEGLGAR IVADTTVRSV RPDGRGGYEV ETVRTGSRRR DARRWTARQV VFAAGALGTQ RLLLGMREQG HLPGISDRVG HLTRTNSEAI LGATRLRPDG RITRGVAITS SFYPNDHTHV EPVRYGRGSN LMAFLSSAMT DGGGSLPRWA KHLLLLARRP YLTLLALPWR WSERTMIALV MQSRDNSLTV SRRRGPFGTS WLTSHAGHGE SNPTWIPEGN QAARLVAERL GGHPGGAITE LADIPLTAHI LGGAVLAADP ARGVIDPYHR VFGHPGLHVV DGSAVPANLG VNPSLTITAM AERAMAFWPN RGDTDPRPPV GAAYRRIAPV APRNPAVPAD APGHYEISAL TSTNTDLPRD AVAGGGRETP HNM
|
| |