Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_6904 |
Symbol | |
ID | 5675217 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | + |
Start bp | 8411592 |
End bp | 8413013 |
Gene Length | 1422 bp |
Protein Length | 473 aa |
Translation table | 11 |
GC content | 71% |
IMG OID | 641245753 |
Product | glutamate decarboxylase |
Protein accession | YP_001511144 |
Protein GI | 158318636 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0076] Glutamate decarboxylase and related PLP-dependent proteins |
TIGRFAM ID | [TIGR01788] glutamate decarboxylase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 15 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCCCTAC ACGGCCGGGC GAGAGGCAGG CGCGGCGACC AGGCCGTGGA GGTACGGCCG CACCTGGTCA TCCCGGACGA GGACGGGCCC GTCCCCCGGT ACCGCATGCC GCGCTCCTCG ATGTCCGCCG AGACCGCCTA CCAGATCGTG CGCGACGAGC TGATGCTGGA CGGCAACGCC CGACTGAACC TGGCCACGTT CGTGACCACC TGGATGGACG AGCACGCCGA CCGGCTGATG ACCGAGTGTG CGGCGAAGAA CATGATCGAC AAGGACGAGT ACCCGCAGAC GGCCGAGCTC GAGGCGCGGT GCGTCAACAT GCTCGCCGAC CTGTGGCACG CCCCCGACGC CACCGACGCC GTCGGCTGCT CGACGACCGG GTCCTCCGAG GCGTGCATGC TCGCCGGGCT GGCGATGCTA CGCCGGTGGC GCTCGACCAG GGAGCCCCAC CGCGGGGAAC AGGGCGGCGG GCAACGGGGC ACCGGTGCGC GGCCGAACAT CGTCATGGGT GCCAACGTGC AGGTGTGCTG GGAGAAGTTC GCCCGCTACT GGGACGTCGA GCCCCGGCTG ATGCCACTCG CGCCCGGGCG GACCCACCTC ACCGCCCCGG AGGCCGTCGC CCGCTGCGAC GAGAACACGA TCGGGGTCGT GGCCGTCCTC GGCTCGACGT TCGACGGGAC GTACGAGCCG GTGGCCGAGA TCGTCGCCGC GTTGGACCAG CTGGCGGCCT CGGGCGGCCC GGACGTCCCG GTGCACGTCG ACGGGGCTTC CGGCGGGTTC ATCGCCCCGT TCTGCGACCC CGACCTGGTC TGGGATTTCC GGCTCGAGCG CGTGGTCTCC ATCAACGCCT CCGGGCACAA GTACGGGCTG GTCTATCCCG GGGTCGGCTG GGCGCTCTGG CGCGACGCCC GGCACCTTCC GGCCGAGCTC GTCTTCGACG TGGACTATCT CGGCGGCTCG ATGCCGACCT TCGCGTTGAA CTTCTCCCGG CCGGGAGCAC AGGTCGTCGC GCAGTACTAC TCTCTGCTAC GACTCGGCCG GGCCGGTTAC CGGCATACCG CCCGCACGTG CCGTGACAAT GCGCGTTGGC TGGCGGATGA AATCGCGAAG CTCGGCCCGT TCGAGCTGAT CTCGGACGGT TCGGGGATCC CGGCCTTCGC GTTCACGACG AGGGATGCGG CCGAGTTCAG CGTCTTCGAG GTCTCCGAGG CGCTGCGCGC CCGTGGCTGG CTGGTGCCCG CATACCGATT CCCGCCTGAT CTGGCCGAGC TGGCGGTACT GCGGATCGTC GTGCGGGCCG AGTTCAGCCG GGATCTGGCG CATCTGCTGG TCGAGGACCT GCACCGGGTG GTCGGACGAC TCTCCGGCCC CCGGTGGCGG ACCGCGGCCG GCGGCGCCGA TCTCGCGTCG TTCCACCATT GA
|
Protein sequence | MALHGRARGR RGDQAVEVRP HLVIPDEDGP VPRYRMPRSS MSAETAYQIV RDELMLDGNA RLNLATFVTT WMDEHADRLM TECAAKNMID KDEYPQTAEL EARCVNMLAD LWHAPDATDA VGCSTTGSSE ACMLAGLAML RRWRSTREPH RGEQGGGQRG TGARPNIVMG ANVQVCWEKF ARYWDVEPRL MPLAPGRTHL TAPEAVARCD ENTIGVVAVL GSTFDGTYEP VAEIVAALDQ LAASGGPDVP VHVDGASGGF IAPFCDPDLV WDFRLERVVS INASGHKYGL VYPGVGWALW RDARHLPAEL VFDVDYLGGS MPTFALNFSR PGAQVVAQYY SLLRLGRAGY RHTARTCRDN ARWLADEIAK LGPFELISDG SGIPAFAFTT RDAAEFSVFE VSEALRARGW LVPAYRFPPD LAELAVLRIV VRAEFSRDLA HLLVEDLHRV VGRLSGPRWR TAAGGADLAS FHH
|
| |