Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Francci3_1587 |
Symbol | |
ID | 3903722 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. CcI3 |
Kingdom | Bacteria |
Replicon accession | NC_007777 |
Strand | - |
Start bp | 1903403 |
End bp | 1905283 |
Gene Length | 1881 bp |
Protein Length | 626 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | 637878924 |
Product | polysaccharide deacetylase |
Protein accession | YP_480692 |
Protein GI | 86740292 |
COG category | [G] Carbohydrate transport and metabolism [R] General function prediction only |
COG ID | [COG0726] Predicted xylanase/chitin deacetylase [COG1216] Predicted glycosyltransferases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 0.0605374 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 12 |
Fosmid unclonability p-value | 0.689393 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGGAATTCG CGATGAGCTA CCGTGAGAAA CCGAGCGTGG ACATCCCGGA CCATTCGGCG GGGCAGCCCG CCCGTAATGG GCGTTCGGTC CGGTTCAGCA TTGTGATGCC GACATACAAC CGCCGAGACG TGGTATTGCA GAGTATTGCG GAACTCAGGT ATCTCGATAT GCCGTGGCCC TGCGAATTGA TCGTCGTCGT CGACGGTTCC ACGGATGGCA CTGAGAAGGC TCTCCAGGCT GTCTCTCTGC CGTTCTCCAA GCGAATCCTG TGGCAGGAGA ACGCGGGCGC GGCCGCAGCG AGAAACCGTG GCGCCGCGGG CGCGCTCGGC GAGTTCATCC TCTTTCTCGA CGACGACATG GCGGCGGACC GGCGGCTCCT CGTCGAGCAT GAGCGGCTTC TCGCCGCCGG CGCCGACGCC GTGGTCGGCC ACATCCCGAT GCACCCCGCC TCGCCTCCTA CGGTGCTCAC CTACGGAGTC GACCGGTGGG CCCGGGCACG CCGCCGGAGG CTGCTGCGGT CGGACGGGCG GCTGACCGTC GCGGACCTGC TCTCCGGGCA GGTGTCGATG CGGGCGTCGA CCTTCGCCAG GCTCGGCGGT TTCGACACGG GTTTTACCGC CGGCGGCAGC TTCGGCGGTG AGGACACCGA CTTCCTCTAC CGGCTGCTGC AGAGCGGTGC GCGGGTTCGA TTCGCCGCCG ACGCCGTCTC CTACCAGCGT TACGTCGTTT CGCCCGAGGC GCACCTGAGG CAGTGGACGC AGGCCGGGCA GGCCGATGCC GCGCTTTCCC GAAAACATCC GGGTCTCGGT GACCTGCTCT TTGCCCAGCA TGGCGGGAAG ACGATCAGAG GAAGCCTCAC CAGGGCCGCG GCGGCGGCAC CGCCCTGGGC GTCCCGCGTC CCGGCGGGTC TCGTGCTCGG AAGAGTCAGC GCCGGGAAGG TGGACCGGCC CACCAGATGG ATGTTCCTGG GACTACGCGA CTGCTCCTAC TGGCGGGGAA CCCACCAGCG CGGCGGTCTC CTGCACGCAC CGGACGTTGG CCTGCGGATC CTTGCCTATC ATGCGATAGA GGACGTGTCG GATCCCCTGC TGAGTAGGTA TGCGGTTCCC CCGCCACAGT TCCGGGCCCA GCTGACAGCC CTGCTGGGCG CCGGCTTCAC CTTCGTCGGG GTCGATGAAC TCCTGCATCA TCTCGACGGC CGGCCCGCCC GGCATCAGTC GCTGGTCCTC ACCTTCGACG ACGCCTATTC GAGTCTGTTC GAACACGCCG TGCCCGTCCT TCGGGAACTT GGTATTCCCG CGACGGTTTT CGTGGTCACG AAAGAGATCG GCGGTTGGAA CAGGTGGGAT GCGGTGAACG GCGCCGCCAG GCTGCCGCTC CTCGACGCGA GCCGGCTGCG TGCCCTTCAC CAGGAAGGCT GGGAAGTGGC GGCGCACTCC CGCACTCACG GGCAGTTGAC GAGGATGAGT GGGGCCGGCC TGTGGGACGA TCTTTCCGCG GCGCGCGGCG ACCTCGCGGC CATCGGGCTG CCCGTGCCAC GCCTGTTCGC CTATCCCTAC GGGGAACATG ACGCGCGCGT GCGCATGATG GTGAAGAAGG CGGGATACGA CGCGGCGTTC GCGCTGCAAA CTCGGCGCGC CTTCCCCACG GCTCAGGACC GCTATGCCCT GCCCAGGATC GAGGTGGAGC GCCATACCCG CGTTGATGCG CTCGTCGAGA CGGTGCGCAG ATCGCAGGTT TACCGACGTC CCGACATCCG TCCCGACATC GAGCGGGAAC TGGGTGGGGC ATTGCGTCGA GTTCTCCCGG TCCAACGGAC GATCCGCGGA CACACCAAAG ATCACGAGGC CGAAGATCAC GAGGCGAGGA GCGTCGGATG A
|
Protein sequence | MEFAMSYREK PSVDIPDHSA GQPARNGRSV RFSIVMPTYN RRDVVLQSIA ELRYLDMPWP CELIVVVDGS TDGTEKALQA VSLPFSKRIL WQENAGAAAA RNRGAAGALG EFILFLDDDM AADRRLLVEH ERLLAAGADA VVGHIPMHPA SPPTVLTYGV DRWARARRRR LLRSDGRLTV ADLLSGQVSM RASTFARLGG FDTGFTAGGS FGGEDTDFLY RLLQSGARVR FAADAVSYQR YVVSPEAHLR QWTQAGQADA ALSRKHPGLG DLLFAQHGGK TIRGSLTRAA AAAPPWASRV PAGLVLGRVS AGKVDRPTRW MFLGLRDCSY WRGTHQRGGL LHAPDVGLRI LAYHAIEDVS DPLLSRYAVP PPQFRAQLTA LLGAGFTFVG VDELLHHLDG RPARHQSLVL TFDDAYSSLF EHAVPVLREL GIPATVFVVT KEIGGWNRWD AVNGAARLPL LDASRLRALH QEGWEVAAHS RTHGQLTRMS GAGLWDDLSA ARGDLAAIGL PVPRLFAYPY GEHDARVRMM VKKAGYDAAF ALQTRRAFPT AQDRYALPRI EVERHTRVDA LVETVRRSQV YRRPDIRPDI ERELGGALRR VLPVQRTIRG HTKDHEAEDH EARSVG
|
| |