Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_3693 |
Symbol | |
ID | 5672059 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | + |
Start bp | 4371224 |
End bp | 4372819 |
Gene Length | 1596 bp |
Protein Length | 531 aa |
Translation table | 11 |
GC content | 72% |
IMG OID | 641242576 |
Product | SMP-30/gluconolaconase/LRE domain-containing protein |
Protein accession | YP_001507996 |
Protein GI | 158315488 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG3386] Gluconolactonase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 6 |
Plasmid unclonability p-value | 0.0345166 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 22 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTTGTCGC AAGGGGCGCG CTACAGCGCC GCATCCATCG GCCTCGCCGA AGGATGGAGC CTGGACCGGC TCACCTCGCC GAGCAGACTT TTCGGCGCGA ACGGATTACG CACCGGCCCG GACGGCCGCG TCTACGTGGC GCAGGTGACC GGCAGCCAGA TCAGCGCACT CGACGTCGAG ACCGGCCAGC TGGAGACCAT CAGCGCGAAG GGCGGCGCGA TCATCGCCCC CGACGACGTG GCCTTCGGCC CGCAGGGCGA CCTCTACGCC ACCGAGGTCA TGGACGGCCG GGTGAGCGTG CGCGGCGCCG ACGGCACCAC CCGGGTGCTG CGCGACGACC TCCCGAGCGT GAACGGCATC ACCGTCCACC AGGGCCGGCT GTTCGTCAAC GAGTGCCGGG TCGGCGGGCG CCTGATGGAG CTCGACCTCA ACGGCGGCGC GCCCCGCGTG CTGCTCGAGG ACCTGGCGAT GCCCAACGCC ATGGAGGTCG GCCCCGACGG TCTCCTGTAC TACCCGGTGA TGGGGACGAA CGACATCTGG CGGATCGACC CGGAGGGCGG CGAGCCGCAG CGGGTCGTGG GGGATCTCGG CGTCCCGGAC TCGCTGAAGT TCGACCCCGA GGGCTTCATC GTCTCCACCC AGGTCGCCTC CGGCGAGGTG CTGCGGATCA ACCCGCGCAA CGGCGAGCGC ACCGTGCTGG CCGCCCTCAC GCCTGGCCTG GACAACTGCA CCTTCGTCGG CGACCGGCTG TTCGTCTCGA ACTTCACCGG GCAGATCACC GAGATCCTCG GCGGCGGGGC GACCCGGGAG ACGCTGCCGG GCGGGCTGAA CTGGCCGCTG GACCTGACCG TCGGCGAGGA CGGCAACCTC TACATCGCCG ACGGGACGTA CTTCTACCGC CTGCGCCCCG GGGGCCGGCC GGAGACGCTC GGCATGCTCT TCAGCCCGGG CTACCCGGGC TTCGTCCGCG GGCTCACCGC CGTCGGCGCG GGCGAGTTCA TCGTCACCAC CTCGGGCGGG CAGGTCACGC GCTACCGGCC GGCGAGCGCG GAGAGCGAGG TGCTCGCCGA CGAGCTCGAC CAGCTCTACG GCGTGGCGGT CGCCCCGGGC GGCGACATCG TGGCCGCGGA GCTCGGCAAG GGCCGCGTGC TCTCCATCCG TTCCGGACGG GTCGAGGAGC TGGCGTCCGG GCTGAACGAT CCGCTGGGTG TCGCGTTCGG CGCGGACGGC ACGCTCTTCG TCTCCGAGTC GGGCGCCGGG CGGGTCGTCA CGATCTCCGG CTTCCGCGTC GACACGGTGG TCGACGGCCT GGAACAGCCG CAGGGAATCC TCGTGCGCGA CGGCCAGCTC CTCATCGTCG ACGCCGGCAC GAAGCAACTG ATCAGCGTCG ACCTCGGCAC CGGGGCCCGG CAGACCATCG CCCGTGACCT TCCGGTGGGC GCCCCGCCCG GCGTGACACC CAAGCCCCTG CGCGGGCTGC CGCCCTTCTC CGGGCCGCAG GGCCCGTTCG CGGGCATCGC CGCCGGACCG GACGGCACGC TCTACCTATC TGCTGACGCC GAGGGCAGCG TGCTCGCGCT GCGGCGGGAG GGCTGA
|
Protein sequence | MLSQGARYSA ASIGLAEGWS LDRLTSPSRL FGANGLRTGP DGRVYVAQVT GSQISALDVE TGQLETISAK GGAIIAPDDV AFGPQGDLYA TEVMDGRVSV RGADGTTRVL RDDLPSVNGI TVHQGRLFVN ECRVGGRLME LDLNGGAPRV LLEDLAMPNA MEVGPDGLLY YPVMGTNDIW RIDPEGGEPQ RVVGDLGVPD SLKFDPEGFI VSTQVASGEV LRINPRNGER TVLAALTPGL DNCTFVGDRL FVSNFTGQIT EILGGGATRE TLPGGLNWPL DLTVGEDGNL YIADGTYFYR LRPGGRPETL GMLFSPGYPG FVRGLTAVGA GEFIVTTSGG QVTRYRPASA ESEVLADELD QLYGVAVAPG GDIVAAELGK GRVLSIRSGR VEELASGLND PLGVAFGADG TLFVSESGAG RVVTISGFRV DTVVDGLEQP QGILVRDGQL LIVDAGTKQL ISVDLGTGAR QTIARDLPVG APPGVTPKPL RGLPPFSGPQ GPFAGIAAGP DGTLYLSADA EGSVLALRRE G
|
| |