Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_0853 |
Symbol | |
ID | 5669269 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | - |
Start bp | 1000496 |
End bp | 1001452 |
Gene Length | 957 bp |
Protein Length | 318 aa |
Translation table | 11 |
GC content | 75% |
IMG OID | 641239782 |
Product | polysaccharide deacetylase |
Protein accession | YP_001505217 |
Protein GI | 158312709 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG0726] Predicted xylanase/chitin deacetylase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 3 |
Plasmid unclonability p-value | 0.000296204 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | 11 |
Fosmid unclonability p-value | 0.358313 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGTCTTCCC CGAAATCCGC CACCACCGCG ACGTGCAAGG AAAACAACGA AGGAGCCGGA GCCACGGCTC GTTCGGCGGC GGCCGGCCGG GCACGCGGTC GCAACGGCGC GGCCAACCGG CCGCGGCCGG TCCCGATCCT GCGCTACGAC GTTGCCGCCG ATCCCACGGG CCACGCGCCA CGGAGATCGA CGGTGACGCC CGCCGTCTTC GCGCGCCACC GTCGCATGAT CCAGAGATCC CGTCGCCACT TCATGACCGT CGTGGACTAC TGCGCGGCGC TGCGCTCGGC CGACGTGGCG CCCGACGCGA TCGTCATCAC CTTCGACGGC GATCACGCGG AGACCTTCGT CGCGGCCCGT GAGCTGGCCG AACGAGGCCT CGCCGCCACC GTCTTCGTCA CCACCTCCCG GCTCGGGACA CCCGGCATGC TCAGCGAGGG GGACGTGCGC CGCCTGCACG AGATGGGCGT CGAGATCGGC GCGGCCGGGC ACAGCGGGCG CCGGCTCGAC GGTCTGCAGC GCTCCGACGT CACCGCGGAG ATCACGCTGA GCCGGCGGCG GCTGGCCGCG ATCACCGGCG ACGAGCCACG CTCGTTCGCC TACCCCCGCG GAGGCTGGGA CCCGACCGCG CGCCAGCTGG TGATCGCGGC CGGATGCGGC GGCGCGTGCG CCGTCGGGCA GGCGCTGTCC CACCAGGGTG ACGACCCGTT CGCCATTACC AGGCTGACCG TGGACGGCGG TCTCGCCGAC CACCGCGTGC GCGCCTGGCT GGACGGGATC GGCCGGACGC TGCCGCGCAC GGTCCCGGCC AGGCCGCGGA TCAGGGTCGT CCCGTCGGTG CTGGGGGCAC GGGCGCCGGC ACGGCTGGCC GGCGCGTACC GCCCCCGGTT CGCGCCGCAC ATCGCCGAGG GGACCCGTCT CCCCGCGCCC GGCCAACCCA CCCTCCCACC GCTGTGA
|
Protein sequence | MSSPKSATTA TCKENNEGAG ATARSAAAGR ARGRNGAANR PRPVPILRYD VAADPTGHAP RRSTVTPAVF ARHRRMIQRS RRHFMTVVDY CAALRSADVA PDAIVITFDG DHAETFVAAR ELAERGLAAT VFVTTSRLGT PGMLSEGDVR RLHEMGVEIG AAGHSGRRLD GLQRSDVTAE ITLSRRRLAA ITGDEPRSFA YPRGGWDPTA RQLVIAAGCG GACAVGQALS HQGDDPFAIT RLTVDGGLAD HRVRAWLDGI GRTLPRTVPA RPRIRVVPSV LGARAPARLA GAYRPRFAPH IAEGTRLPAP GQPTLPPL
|
| |