Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_4160 |
Symbol | |
ID | 5672515 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | + |
Start bp | 4941642 |
End bp | 4943297 |
Gene Length | 1656 bp |
Protein Length | 551 aa |
Translation table | 11 |
GC content | 77% |
IMG OID | 641243033 |
Product | cobyrinic acid a,c-diamide synthase |
Protein accession | YP_001508450 |
Protein GI | 158315942 |
COG category | [H] Coenzyme transport and metabolism |
COG ID | [COG1797] Cobyrinic acid a,c-diamide synthase |
TIGRFAM ID | [TIGR00379] cobyrinic acid a,c-diamide synthase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 3 |
Plasmid unclonability p-value | 0.0028663 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | 10 |
Fosmid unclonability p-value | 0.287821 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGTGAGCG CCGGGCTGAT CCCCCGGCTG GTGCTCGCCG CCCCGGCGTC CGGCGCGGGG AAGACGACGG TGGCGACCGG GCTGATGGCC GCGCTGACCG CCCGCGGCCT GCGGGTGTCG GGTCACAAGG TGGGCCCGGA CTACATCGAC CCGGGTTATC ACGCGCTGGC GACCGGCCGG CCCGGCCGCA ATCTGGACGC GGTGCTGTGC GGGCCGGATC TGATCGGCCC GCTGTTCGCC CACGGGGCGG CCGGCGCGCA GCTGGCGGTC GTCGAAGGGG TGATGGGCCT GTTCGACGGG GTCGCCGCCC CTGCTGCCGG GCAGGAGGCT GATCACGGGT CGACCGCGCA TGTCGCCCGG CTCCTCGACG CGCCGGTGGT GCTGGTCGTC GACGCCGCCG GGGCCGGGCG GTCGGTGGCC GCGCTGGTCT GCGGGTTCGC CGCGTTCGAC CGGCGGGTGC GCCTCGCCGG GGTGATCCTC AACCGGGTCG GGTCGGACCG GCACCGACAG ATCCTCACCG GTGCGCTGGC CGGGATCGGG GTGGCGGTGC TGGGCGCGGT GCCCCGCGAC GGCGCGGTGC ACACCCCGTC GCGGCATCTG GGGCTGGTGC CGGCGGCGGA ACGGGCGGTG GCGGCCGCGC AGGCGGTGCG GCGTCTCGGT GTGCTGGTCG GGGCGGCGGT CGACCTGGAC GCGCTGATCC GCCTCGCCTC CTCGGCGCCA CCCCTGCCGG TCGACCCGTG GGATCCCGCC CGGCAGATCG CGCAGGCCAC CGCCACCGGT GCTGCGGGTC GGGTCGGGCA GCACACCGTC AACGCGGGCG CGGGGGGTGT GGTGCCCGGG GGGCGGCCGG TGCGGATCGC GGTGGCCGGC GGGGCGGCGT TCACCTTCGG CTACACCGAG CACGTCGAGC TCCTCACCGC CGCTGGCGCG CAGGTGCTCA CCGTCGACCC GCTGCGCGAC GAGACCCTCC CGGACGGCAC GGACGCGCTG GTCGTCGGTG GCGGGTTCCC CGAGGAGCAT GCCGGCGCGC TGGCGGCGAA CAGCCGGCTG CGTGGGCAGG TCGCGGCGCT GGCGGCCCGC GGCGCGCCGC TGGTCGCCGA ATGCGCCGGG CTGCTCTACC TGGGCCGTTC GTTGGACGGG ACGGCGATGT GTGGGGTTCT CGACACCGAC GCGGTGATGG GCCCGCGGCT CACCCTGGGC TACCGGCATG CGGTCGCCGC GGCTGACAGC CCGCTGGTGG CGGCGGGGAC GGTCGTCACC GCCCACGAGT TCCACCGCAC CCGGCTGAGC GTCGACCGGG CGGAGCTGCC CGGGACACCG GCCTGGCAGG TGGACATCCC ACCGCCGCGG TTCGGTGACA GCAACCCCGC CGCCGGTGCC GCCGGTGCCG GTGGGCGGGT GGACGGCGGA GTGGACGGTA CGGCGGCGGG TGGGCGGCCT GAGGGGTTCG TCCGCGGCGG GGTGCACGCC TCCTACCTGC ACCTGCACTG GGCGGGCCTA CCCGCCGTGC CGGCGCGGCT CGTCGCCGCC GCCGGTACCG CCCGCACCCG CCCGTCCGGC GGCACCCCAC CCGCCGGCGG CGGGCATGGG CATGGACGGG ACGCCGGTGG CAATCCCCCG GCGTCCCCTG GAACAACGGA GGTGTCATCG CGGTGA
|
Protein sequence | MVSAGLIPRL VLAAPASGAG KTTVATGLMA ALTARGLRVS GHKVGPDYID PGYHALATGR PGRNLDAVLC GPDLIGPLFA HGAAGAQLAV VEGVMGLFDG VAAPAAGQEA DHGSTAHVAR LLDAPVVLVV DAAGAGRSVA ALVCGFAAFD RRVRLAGVIL NRVGSDRHRQ ILTGALAGIG VAVLGAVPRD GAVHTPSRHL GLVPAAERAV AAAQAVRRLG VLVGAAVDLD ALIRLASSAP PLPVDPWDPA RQIAQATATG AAGRVGQHTV NAGAGGVVPG GRPVRIAVAG GAAFTFGYTE HVELLTAAGA QVLTVDPLRD ETLPDGTDAL VVGGGFPEEH AGALAANSRL RGQVAALAAR GAPLVAECAG LLYLGRSLDG TAMCGVLDTD AVMGPRLTLG YRHAVAAADS PLVAAGTVVT AHEFHRTRLS VDRAELPGTP AWQVDIPPPR FGDSNPAAGA AGAGGRVDGG VDGTAAGGRP EGFVRGGVHA SYLHLHWAGL PAVPARLVAA AGTARTRPSG GTPPAGGGHG HGRDAGGNPP ASPGTTEVSS R
|
| |