Gene Francci3_1587 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFrancci3_1587 
Symbol 
ID3903722 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. CcI3 
KingdomBacteria 
Replicon accessionNC_007777 
Strand
Start bp1903403 
End bp1905283 
Gene Length1881 bp 
Protein Length626 aa 
Translation table11 
GC content67% 
IMG OID637878924 
Productpolysaccharide deacetylase 
Protein accessionYP_480692 
Protein GI86740292 
COG category[G] Carbohydrate transport and metabolism
[R] General function prediction only 
COG ID[COG0726] Predicted xylanase/chitin deacetylase
[COG1216] Predicted glycosyltransferases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0605374 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value0.689393 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGGAATTCG CGATGAGCTA CCGTGAGAAA CCGAGCGTGG ACATCCCGGA CCATTCGGCG 
GGGCAGCCCG CCCGTAATGG GCGTTCGGTC CGGTTCAGCA TTGTGATGCC GACATACAAC
CGCCGAGACG TGGTATTGCA GAGTATTGCG GAACTCAGGT ATCTCGATAT GCCGTGGCCC
TGCGAATTGA TCGTCGTCGT CGACGGTTCC ACGGATGGCA CTGAGAAGGC TCTCCAGGCT
GTCTCTCTGC CGTTCTCCAA GCGAATCCTG TGGCAGGAGA ACGCGGGCGC GGCCGCAGCG
AGAAACCGTG GCGCCGCGGG CGCGCTCGGC GAGTTCATCC TCTTTCTCGA CGACGACATG
GCGGCGGACC GGCGGCTCCT CGTCGAGCAT GAGCGGCTTC TCGCCGCCGG CGCCGACGCC
GTGGTCGGCC ACATCCCGAT GCACCCCGCC TCGCCTCCTA CGGTGCTCAC CTACGGAGTC
GACCGGTGGG CCCGGGCACG CCGCCGGAGG CTGCTGCGGT CGGACGGGCG GCTGACCGTC
GCGGACCTGC TCTCCGGGCA GGTGTCGATG CGGGCGTCGA CCTTCGCCAG GCTCGGCGGT
TTCGACACGG GTTTTACCGC CGGCGGCAGC TTCGGCGGTG AGGACACCGA CTTCCTCTAC
CGGCTGCTGC AGAGCGGTGC GCGGGTTCGA TTCGCCGCCG ACGCCGTCTC CTACCAGCGT
TACGTCGTTT CGCCCGAGGC GCACCTGAGG CAGTGGACGC AGGCCGGGCA GGCCGATGCC
GCGCTTTCCC GAAAACATCC GGGTCTCGGT GACCTGCTCT TTGCCCAGCA TGGCGGGAAG
ACGATCAGAG GAAGCCTCAC CAGGGCCGCG GCGGCGGCAC CGCCCTGGGC GTCCCGCGTC
CCGGCGGGTC TCGTGCTCGG AAGAGTCAGC GCCGGGAAGG TGGACCGGCC CACCAGATGG
ATGTTCCTGG GACTACGCGA CTGCTCCTAC TGGCGGGGAA CCCACCAGCG CGGCGGTCTC
CTGCACGCAC CGGACGTTGG CCTGCGGATC CTTGCCTATC ATGCGATAGA GGACGTGTCG
GATCCCCTGC TGAGTAGGTA TGCGGTTCCC CCGCCACAGT TCCGGGCCCA GCTGACAGCC
CTGCTGGGCG CCGGCTTCAC CTTCGTCGGG GTCGATGAAC TCCTGCATCA TCTCGACGGC
CGGCCCGCCC GGCATCAGTC GCTGGTCCTC ACCTTCGACG ACGCCTATTC GAGTCTGTTC
GAACACGCCG TGCCCGTCCT TCGGGAACTT GGTATTCCCG CGACGGTTTT CGTGGTCACG
AAAGAGATCG GCGGTTGGAA CAGGTGGGAT GCGGTGAACG GCGCCGCCAG GCTGCCGCTC
CTCGACGCGA GCCGGCTGCG TGCCCTTCAC CAGGAAGGCT GGGAAGTGGC GGCGCACTCC
CGCACTCACG GGCAGTTGAC GAGGATGAGT GGGGCCGGCC TGTGGGACGA TCTTTCCGCG
GCGCGCGGCG ACCTCGCGGC CATCGGGCTG CCCGTGCCAC GCCTGTTCGC CTATCCCTAC
GGGGAACATG ACGCGCGCGT GCGCATGATG GTGAAGAAGG CGGGATACGA CGCGGCGTTC
GCGCTGCAAA CTCGGCGCGC CTTCCCCACG GCTCAGGACC GCTATGCCCT GCCCAGGATC
GAGGTGGAGC GCCATACCCG CGTTGATGCG CTCGTCGAGA CGGTGCGCAG ATCGCAGGTT
TACCGACGTC CCGACATCCG TCCCGACATC GAGCGGGAAC TGGGTGGGGC ATTGCGTCGA
GTTCTCCCGG TCCAACGGAC GATCCGCGGA CACACCAAAG ATCACGAGGC CGAAGATCAC
GAGGCGAGGA GCGTCGGATG A
 
Protein sequence
MEFAMSYREK PSVDIPDHSA GQPARNGRSV RFSIVMPTYN RRDVVLQSIA ELRYLDMPWP 
CELIVVVDGS TDGTEKALQA VSLPFSKRIL WQENAGAAAA RNRGAAGALG EFILFLDDDM
AADRRLLVEH ERLLAAGADA VVGHIPMHPA SPPTVLTYGV DRWARARRRR LLRSDGRLTV
ADLLSGQVSM RASTFARLGG FDTGFTAGGS FGGEDTDFLY RLLQSGARVR FAADAVSYQR
YVVSPEAHLR QWTQAGQADA ALSRKHPGLG DLLFAQHGGK TIRGSLTRAA AAAPPWASRV
PAGLVLGRVS AGKVDRPTRW MFLGLRDCSY WRGTHQRGGL LHAPDVGLRI LAYHAIEDVS
DPLLSRYAVP PPQFRAQLTA LLGAGFTFVG VDELLHHLDG RPARHQSLVL TFDDAYSSLF
EHAVPVLREL GIPATVFVVT KEIGGWNRWD AVNGAARLPL LDASRLRALH QEGWEVAAHS
RTHGQLTRMS GAGLWDDLSA ARGDLAAIGL PVPRLFAYPY GEHDARVRMM VKKAGYDAAF
ALQTRRAFPT AQDRYALPRI EVERHTRVDA LVETVRRSQV YRRPDIRPDI ERELGGALRR
VLPVQRTIRG HTKDHEAEDH EARSVG