Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_2523 |
Symbol | |
ID | 5675695 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | + |
Start bp | 3002768 |
End bp | 3003874 |
Gene Length | 1107 bp |
Protein Length | 368 aa |
Translation table | 11 |
GC content | 66% |
IMG OID | 641241439 |
Product | poly-gamma-glutamate synthesis protein |
Protein accession | YP_001506860 |
Protein GI | 158314352 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG2843] Putative enzyme of poly-gamma-glutamate biosynthesis (capsule formation) |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 14 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGTGCGAGG GTGCGGTGAG GTTGTTCCTC TGCGGCGACG TGATGCTCGG CCGCGGTATC GACCAGATCC TGCCGCATCC CGGTGACCCG ACACTTTCCG AGAGATACGT CTGGGATGCC CGCAGCTACG TCGAGTTGGC AGAGGCGGTG AACGGCCCGA TCCCTCACCC GGTCGACTTC GCCTGGCCCT GGGGAGATGT GCTGCCAGCG CTGGACGAGG CTGCGCCCGA TGTCCGGGTG CTGAATCTGG AAACCACCAT CACCCGGTGC GATGCCTTCG CGACGGGGAA GGAAGTCCAC TATCGGATGA GCCCGGACAA CCTGCCCTGC CTGACTGCCG CTCGACCTGA CGTGTGTGTG CTGGCAAACA ACCATTTGCT TGATTTCGGT CACCGGGGGC TCATCGAGAC GCTCGATGTG CTGTCCGGTG CCGGGCTGAT GGGGGCGGGA GCCGGACACG ACGCAGAAGA GGCCCGCCGG CCCGCGGTTG TGCCGATCGA TGGCAGCCGG CGGGTCCTGA TCTTCTCGAT CGGGCTGCCG TCCAGCGGCA TTCCAGCGAC GTGGGCCGCG ACCGAGGGCA GGGCCGGCGT TGACCTCGTC CCGGAGCTGT CGGATGGCTG GACCGACGAG GTCGCGGGCC GTGTCCGGCA GGTGAAGCGG CCCGGTGACC TCGCCGTCGC GTCCATCCAC TGGGGCTCCA ACTGGGGCTA CGACGTCGAT GACGACCAGA TCCGCTTCGC GCATCGGCTG GTGGACGGCG GTATCGACAT CGTGCACGGA CACTCGTCGC ACCATCCACG GCCTGTCGAG ATATATCGGG ACAGGCTCAT CCTGCACGGG TGCGGCGATT TCATCGACGA CTATGAGGGG ATCGCCGGCT ACGAAAGCTA TCGGGACGAC CTGCGGCTGG CGTACTTCGT GTCGGTCGAT CCGGGCAGCG GGAATCTGAT CGACCTGCGT GTGATGCCCC TGCAGGCACG GCAGATGCGA CTTCGGTACG CCGGCTCCGA AGACTCCGCC TGGCTACAGG AGATTCTTGA CAGCATCAGC CGCGGCTTCG GGACACGGTT CGGCCTTCAG GCGGACGGCA TGCTCTCGCT CCGCTGA
|
Protein sequence | MCEGAVRLFL CGDVMLGRGI DQILPHPGDP TLSERYVWDA RSYVELAEAV NGPIPHPVDF AWPWGDVLPA LDEAAPDVRV LNLETTITRC DAFATGKEVH YRMSPDNLPC LTAARPDVCV LANNHLLDFG HRGLIETLDV LSGAGLMGAG AGHDAEEARR PAVVPIDGSR RVLIFSIGLP SSGIPATWAA TEGRAGVDLV PELSDGWTDE VAGRVRQVKR PGDLAVASIH WGSNWGYDVD DDQIRFAHRL VDGGIDIVHG HSSHHPRPVE IYRDRLILHG CGDFIDDYEG IAGYESYRDD LRLAYFVSVD PGSGNLIDLR VMPLQARQMR LRYAGSEDSA WLQEILDSIS RGFGTRFGLQ ADGMLSLR
|
| |