Gene Francci3_1350 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFrancci3_1350 
Symbol 
ID3906563 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. CcI3 
KingdomBacteria 
Replicon accessionNC_007777 
Strand
Start bp1622543 
End bp1624873 
Gene Length2331 bp 
Protein Length776 aa 
Translation table11 
GC content67% 
IMG OID637878683 
Productglycogen debranching protein GlgX 
Protein accessionYP_480456 
Protein GI86740056 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1523] Type II secretory pathway, pullulanase PulA and related glycosidases 
TIGRFAM ID[TIGR02100] glycogen debranching enzyme GlgX 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAGCGAC CCCGGCCCGC GAATACCGTT CGGCCCGTAG GCTTCGGTCG TATGTCACAG 
GTATGGCCCG GCAGCCCCTA TCCCCTGGGC GCCACATACG ACGGATCGGG CACAAATTTC
GCGATCTTCT CCGAGGTCGC CGACCGGGTC CAGCTGTGCC TGTTCGACGA CGCGGGCAAC
GAGGAGCGGA TCGACCTGCG GGAACGGGAC GCCTTCGTCT GGCACGCCTA CCTTCCCACC
GTGGGACCGG GCCAGCGCTA CGGCTATCGG GTCCACGGGT CCTACGACCC CGCCCGAGGG
CTGCGCTGCA ACTCCACCAA GCTCCTCCTC GACCCGTACG CGAAGGCGGT CGACGGGGAG
GTCGCCTGGG ACCAGGCGGT GTTCCCCTAC ACCTTCGGCG ACCCCGACAG CGTCAACGAC
GCCGACTCCG GCCCCCACAT GATGAAGTCG GTGGTAATCA GCCCGTTCTT CGACTGGAAC
GGGGACCGCC CGCCACGCCG GCCCTACAAC GAGTCGGTGA TCTACGAGGC GCATGTCCGG
GGGCTGACCA AGAACCATCC CGGCCTGCCG GAGGAGTACC GCGGGACCTA CGCGGGAGTC
GCCCACCCGG TGATGATCGA CCACTATCGC AAGCTCGGCG TGACCGCGAT CGAGCTGCTG
CCGGTGCACC AGTTCGTCCA TGACGAGCAT CTGGTCAGCC GCGGGCTGCG CAACTACTGG
GGCTACAACT CCATCGCGTT CCTGGCGCCG CACAGCGGCT ACTCCGCCTC CGGCGGCCAC
GGCCGCCAGG TGCAGGAGTT CAAGGGAATG GTCAAGAACC TGCACGAGGC CGGAATCGAG
GTGATCCTCG ACGTCGTCTA CAACCACACC GCCGAGGGCA ACCACATGGG TCCGATGCTG
TGCTTCCGCG GCATCGACAA CAGCGCCTAC TACCGGCTCG TGGACAACCG TCCCCAGTAC
TACATGGACT ACACGGGCAC CGGCAACAGC CTGCGGGTAC GACACCCGCA CGTGCTACAA
CTGATCATGG ACTCGCTGCG TTACTGGGTC ACCGACATGC ACGTGGACGG GTTCCGGTTC
GATCTCGCGG CGACCCTGGC CCGGGAGTTC TACGACGTCG ACCGGCTGTC ATCGTTCTTC
GACCTCGTCC AGCAGGATCC GGTGGTCTCC CAGGTCAAGC TGATCGCGGA ACCGTGGGAC
CTCGGCGAGG GCGGTTACCA GGTGGGGAAC TTCCCCCCGC TGTGGACCGA GTGGAACGGT
AAATACCGCG ACACCGTCCG CGGCTTCTGG CGCGGGCAGG ACCACGGGAT CGCCGAGTTC
GCCTCCCGGC TGACCGGATC GAGCGACCTG TACGAGAACA GCGGGCGCCG GCCGTGGGCA
TCGATCAACT TCATCACCGC GCACGACGGG TTCACCCTGC ACGACCTGGT CTCCTACAAC
GAAAAACACA ACGAAGCCAA CGGCGAGGAC AATCGGGATG GCTCCGACGA CAACCGGTCG
TGGAACTGCG GAGTCGAAGG GCCCACCGAC GACCCGACCG TCCTGTCCCT GCGCGCCGCG
CAGACCCGTA ACCTGCTGAC GACGCTGTTG CTGTCCCAGG GGGTGCCGAT GCTGGTCGCC
GGAGACGAGA TGGGACGCAC CCAGCAGGGC AACAACAACG CGTACTGCCA GGACAACCCG
ATCTCCTGGC TGGACTGGTC CGACGCCGAG CGCAACGCCG ACCTGATCGA GTTCACCGGC
ATGCTGTCAA GGCTGCGCCA TGATCATCCG ATCTTCCGCC GCCGCCGGTT CTTCCAGGGC
AAGTCGCTGC GCGGGCAGGG CGGCACGACG CACGGCGCGG CCGAGGCGGT CACCGGCGGG
CAGGCGGGCG GCCAGGACGC GGACGGCAGT GGCGGGCGCT CCGGCGACGA CCCCGCCGTC
AAGGACGACC CCGCCGTCAA GGACATCGCC TGGCTCCGCC CGGACGGCAC GGAGATGTCC
GACAGCGACT GGGAGTCCGG GACGGCGCGA TCACTCGGCG TCTACCTCAA CGGTGAGGGG
ATACCGGACC CCGACGCGCT GGGCCAGGCC ATCATCGACG ACTCGTTCCT GCTGTTCTTC
AATGCCCACC ATGAACCGGT CGGGTTCCAG GCGCCGGCGA CCAGTTTCGG CAGTTCGTGG
GAGATCGTCG TCGACACCCG GGCGTCGACG GCCGAGGTCG ACGCCCGGCT CAGCGACACC
GCGCTGCTCC ACTCGGCCCT CGATCCCGCG AGCATGGGAA GGGCCGTGAA GGCCGGCGAT
CCCATCGAAC TCGACGCCCG CTCGACCGTC GTGCTGCGCC GTGTCAGCTG A
 
Protein sequence
MKRPRPANTV RPVGFGRMSQ VWPGSPYPLG ATYDGSGTNF AIFSEVADRV QLCLFDDAGN 
EERIDLRERD AFVWHAYLPT VGPGQRYGYR VHGSYDPARG LRCNSTKLLL DPYAKAVDGE
VAWDQAVFPY TFGDPDSVND ADSGPHMMKS VVISPFFDWN GDRPPRRPYN ESVIYEAHVR
GLTKNHPGLP EEYRGTYAGV AHPVMIDHYR KLGVTAIELL PVHQFVHDEH LVSRGLRNYW
GYNSIAFLAP HSGYSASGGH GRQVQEFKGM VKNLHEAGIE VILDVVYNHT AEGNHMGPML
CFRGIDNSAY YRLVDNRPQY YMDYTGTGNS LRVRHPHVLQ LIMDSLRYWV TDMHVDGFRF
DLAATLAREF YDVDRLSSFF DLVQQDPVVS QVKLIAEPWD LGEGGYQVGN FPPLWTEWNG
KYRDTVRGFW RGQDHGIAEF ASRLTGSSDL YENSGRRPWA SINFITAHDG FTLHDLVSYN
EKHNEANGED NRDGSDDNRS WNCGVEGPTD DPTVLSLRAA QTRNLLTTLL LSQGVPMLVA
GDEMGRTQQG NNNAYCQDNP ISWLDWSDAE RNADLIEFTG MLSRLRHDHP IFRRRRFFQG
KSLRGQGGTT HGAAEAVTGG QAGGQDADGS GGRSGDDPAV KDDPAVKDIA WLRPDGTEMS
DSDWESGTAR SLGVYLNGEG IPDPDALGQA IIDDSFLLFF NAHHEPVGFQ APATSFGSSW
EIVVDTRAST AEVDARLSDT ALLHSALDPA SMGRAVKAGD PIELDARSTV VLRRVS