Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_1696 |
Symbol | |
ID | 5670098 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | + |
Start bp | 2028499 |
End bp | 2029905 |
Gene Length | 1407 bp |
Protein Length | 468 aa |
Translation table | 11 |
GC content | 72% |
IMG OID | 641240614 |
Product | aminodeoxychorismate lyase |
Protein accession | YP_001506040 |
Protein GI | 158313532 |
COG category | [R] General function prediction only |
COG ID | [COG1559] Predicted periplasmic solute-binding protein |
TIGRFAM ID | [TIGR00247] conserved hypothetical protein, YceG family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 0.255824 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 9 |
Fosmid unclonability p-value | 0.195294 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGATTCGGG ACGGGGACGG ATTCGACGAG CTGCTCGACG GCGCGGGCTA CGACGACTAC GACGACCGGG ACTACAACGA CCGGGCGTAT GAGGAGCAAC GCCGTGCCGC GGCGGGTGCG CGGCGCTCCC GCCGCGGCCG CGCCGGGAGC GTGGACGGCC GCGACCGGGC CGGCCTGGCC GCGGGTGAGC GCGAGCCCGC CGGCTACGAG GACGGCTTCG CGCCGGACGA GGCCGACGCG TACGGGGCCT ACGGCGGCGC CTATGACGAC GACGATGGCG ACCCCGACGA CGGGGCCGCC CCGCCGAGGC GTGGGCGTGA CGATGACCGG GGGCGGCCCG GTGGACGTCG GCCGGGCTCG GCCCTGCCGA AGCTGATCGG TGTGCTGGTC GTGGTCGCCG TCCTGGTCGG TGCGGGGATC TTCGGCGTCG GCAAGGTCAT CGGCCGGATC GGCGGGGAGC CCGCGGCCGA CTACGCGGGG TCCGGCGAGG GCATCGTCGT CGTCCAGGTT CCCGACGGCG CCACGTCGTC GGAGATCGCG GGCCGGCTGG CCGCCTCCGA CGTGATCGCC AGCCGGCAGG CCTTCGTCAA CCTCGCGTCG CGGGACCAAC GCGCGCTGTC GATCCAGCCG GGCACCTACC GGCTGCGCTC GAAGATGAGC GCGGCGGCGG CGCTCGACGC GCTGCTCGAC GACGCGTCCT CCGCGCTGTT CCGCTACACG ATCAGCCCTG GGGACACGGT CCGGCAGGTG TTCAAGGAGC TGTCCACACG TACGGGCACG CCGGTGGCCG ACCTGGAGGC GATCGCGCGC AAGCCGTCGA CGCTGGGCCT GCCCGACTAC GCCACGGGGC TGGAGGGCTA CCTCTTCCCG TCGACCTACG ACGTCGCGCC CGGCACGGAC CCGGTGGATG TGCTCAAGGA GGCCGTCGCC CGGTTCCGGG CCAACGCCGA GGAGATCGAC CTCGTCGGGC GCGCCGAGGC CGGGCACGTG AAGCCCCAGG ACGTTGTGAT CATCGCCTCC ATCATCGAGA AGGAGGTGGC GAACGAGGGC GAGGGGCCCA AGGTGGCGCG GGTCATCTAC AACCGGCTGA ACGACACCTC GGGCCGCTTC CGGCGGATCG ACATGGACTC GACCACCCGG TACGCCCTCG ACGAGTACGA GGGCCCGCTG ACCCAGGACC AGCTCCGCCA GAACAATCCG TACAACACCC GCGCGGTCGA GGGTCTCCCA CCCGGCGCGA TCTCGAATCC GAGTACCTGG GCGATCGAGT CCGCGCTCAG CCCGGCACAG GGGTCGTGGT TCTACTTCGT CTCCATGCCC CAGACGCACG AGACGGTCTT CGCCACCACC GACGCGGAGT TCCAGGACGC GCTGGACGAA TACCATCGCC AGGGAGGGGC CGAGTAG
|
Protein sequence | MIRDGDGFDE LLDGAGYDDY DDRDYNDRAY EEQRRAAAGA RRSRRGRAGS VDGRDRAGLA AGEREPAGYE DGFAPDEADA YGAYGGAYDD DDGDPDDGAA PPRRGRDDDR GRPGGRRPGS ALPKLIGVLV VVAVLVGAGI FGVGKVIGRI GGEPAADYAG SGEGIVVVQV PDGATSSEIA GRLAASDVIA SRQAFVNLAS RDQRALSIQP GTYRLRSKMS AAAALDALLD DASSALFRYT ISPGDTVRQV FKELSTRTGT PVADLEAIAR KPSTLGLPDY ATGLEGYLFP STYDVAPGTD PVDVLKEAVA RFRANAEEID LVGRAEAGHV KPQDVVIIAS IIEKEVANEG EGPKVARVIY NRLNDTSGRF RRIDMDSTTR YALDEYEGPL TQDQLRQNNP YNTRAVEGLP PGAISNPSTW AIESALSPAQ GSWFYFVSMP QTHETVFATT DAEFQDALDE YHRQGGAE
|
| |