Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_5164 |
Symbol | |
ID | 5673498 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | + |
Start bp | 6188175 |
End bp | 6192788 |
Gene Length | 4614 bp |
Protein Length | 1537 aa |
Translation table | 11 |
GC content | 70% |
IMG OID | 641244018 |
Product | glycogen debranching enzyme GlgX |
Protein accession | YP_001509428 |
Protein GI | 158316920 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG1523] Type II secretory pathway, pullulanase PulA and related glycosidases |
TIGRFAM ID | [TIGR02100] glycogen debranching enzyme GlgX |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 4 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 7 |
Fosmid unclonability p-value | 0.0685376 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGCGCCCCA CGGCGCAGCG GCTCGAGGAC GTCGTCCCAG GCGGTTCCCA GCAGCCGGTG CAGGCCCCGC GAGAGGGTCT CCCGCGCCTC GCCGCGCTCC ATCAACGCCA GCAGCGCCCC CTCGCCGACG ACGACGTCCC CGTTGGCGGC GATCGCGGCC CGGTGCAGGC CGAGATCGGG AGTGAAGGAG TAGCGCTCCG CGTCGCAGCC GGGCGAGGGC CCCTCGACGG CCTCGAACGT GACATGACCC AGCCGGCGCA GGACGGCGGC GAGCCGCCCG GCGGTTCCCG GGCTCGCCCG CCATTCCAGA CCGGCATACA GGAAACCGGG CCGGCCTGGC TGCGCCGTCC AGGTCAGGGA GACCGGGGCG GACAGGGCAC CGGACACGGC GAACTCCAGG TGCGGGCACA GTGCCGGCGG ACATGCGAGC ACCGCCAACT TGCCCGGCAC GGCGAAGGAC GACGAGCCGG AAGACAGCTC ACGAGCTGTC GAGGACATTG CCACCACGCG GGACCTCCGT CGACGAGGAA CGCCTTCCCC TGCGTCTCGT CACGTCGGAC CCCTCGAATA CCAGTCGATC GTGCCATGAC CGGCGTTCGA GGACTAGACC CACCCCCTCC CCTGGTGCCA CATGTGGTGA CCCCCACCCC GGCGGAAGGG CTGGTCCGCG GGCCGCGCGG GCCCAACGTC CCGGCCGGCC ACGGGCGATG GTCGGGTCCG CGCCGGGCCT GGTCCGCCCG TCCCGCCCCG TCAGTCGCGG GCAGGCCCCG GACCGGCCAC GTGCCCCGGA TCCCGGCTGT ACGACCCCGC TGGTCCTGCC GTGCTGACTG GTCCTGCCGT GCTGACTGGT CCCGCCGTGC TGACTGGTCC CGCCGTGCTG ACTGGTCCGA CCGAGCCGGT CGGACCAGCC GGACCAGATG GGCCGGCTGC CCGGCCACCG ACGTCGAACA GGCCGCCTGG CCCGGCCAGG CCACCGAGGG CGGCGGTGAG CTCGCTGGGG ACGATCCACA GCTTGCTGGC CTGGCCCTCT GCCAGCCGCG GGAGCATCTG CAGGTACTGG TAGGCGAGGA GCTTCTGGTC CGCGTCGCCC TCGTGGATGG CGCCGAAGAC GGTGCCGATG GCCTTCGCCT CGCCCTCGGC GGTGAGGATC TGGGCCTCCC GGTGGCCTTC CGCCCGCAGG ATCGCGGCCT GCTTCTCGCC CTCGGCCCGC AGGATCTCGC TCGCCTTGAC ACCCTCGGCG GTCAGGATCG CGGCCCGCCG GTCGCGTTCG GCCCGCATCT GCTTCTCCAT CGAGTCCTGG ATGGAGCGGG GCGGGTCGAT GGCCTTGAGC TCCACCCGGT TGACCCGGAT GCCCCACCGG CCGGTCGCGT CGTCGAGCAC CCCGCGCAGC TGCCCGTTGA TCTGGTCGCG CGAGGTGAGC GTGCCCTCGA GGTTCATCCC GCCGATCACG TTGCGCAGCG TGGTCACGGT GAGCTGCTCG ATGGCGCGGA TGAAGTCGGC GATCTCGTAG GTGGCCGCGC GCGGGTCGGT GACCTGGAAG TAGATGACGG TGTCGATCCC GACGACGAGG TTGTCCTCGG TGATGACCGG CTGGGGCGGG AAGCTGACCA CCTGTTCCCG CAGGTCGATC CGCTCCCGGA TCCGGTCGAC GATCGGCAGC ACGATCGCCA GGCCCGGCGT CAGCGTGCGG TGGTAGCGCC CGAGACGCTC GACCACCATC GCCCGGGCCT GCGGCACGAT CCGCACGGTC CGCGCCAGGA AGATCAGGAC CACCAGTGCC GCGACCGCGG CGGCGATGAC TCCGCCCATG TTCACTCCTC AAGCTCCCGA CGCGCGGTGT CGCCGCGCTC GACGTCCTCG TGCACGATCA CGTGGGCACC GTCGACGCGC AGCACCCGCA CGGTGGCCCC GATCTCGAAC TCGTCCGTCG CCGGGTACGA CCGCGCGGAC CAGACCTCGC CGGCGAGCCG GACTCGGCCG GTCGACCCGT CGACCCGTTC CAGGACGACG GCCGGCGTTC CGACCAGGGC GTCGGTCCCG GTACGCAGTA CGGGGCCGCT GGTGAGGTGG CGGCGGGCGA CCGGCCGCAG GCCGAACGTG AGCCCGCCCG CGGCGACCGC GAACCCCACG AGCTGCACGA TCAGGTCGGC CCCCAGGACT GCGAGGCCCG CCCCCACCAG CGCTGCGAGC GCGAACATGA CCAGGACCAG GTCGAGTGTG AGCAACTCGC CGACGACGAG CGCGCCGGCA ACGATGATCC ACACAACCTC GTCCGCCATG ACCATCACCC TAATAAGATC AACTATCCTG CGATCAACGA AACCTCAATG GCGGCAGTGG ACGAACCGGT CGGGGTTCGC GCGCCCCCAA GTCACCGACG GGGCGCGAGC CGTAGGCTTC GTGCGTATGT CGCAGGTCTG GCCTGGCAGC CCTTATCCCC TCGGTGCCAC CTTTGACGGA TCGGGGACGA ACTTCGCGAT CTTCTCGGAG GTCGCCGAGC GTGTCGAGCT CTGTCTGTTC GACGCCGTGG GCGCCGAGCG GAGGATCGAG CTGCGGGAGC GCGACGCGTT CGTGTGGCAC GCCTACCTGC CGACGGTCCT GCCGGGACAG CGGTACGGAT ACCGGGTCCA CGGTCCCCAT GATCCCGCGC GTGGCCACCG ATGCAACCCC AACAAGCTGC TGCTCGACCC GTACGCCAAG GCGGTGGACG GCGAGGTCGA CTGGAACCAG GCCGTCTTCG GCTACGACTT CGGCGATCCC GACAGCGTGA ACACGACCGA CTCCGCGCCG CACATGATGA AGTCCGTGGT GATCAGCCCG TTCTTCGACT GGAACGGCGA CCGGCCACCG CGCCGCCCCT ACAGCGAGAC CGTGATCTAC GAGGCGCACG TGCGCGGCCT GACCATGCGG CACGAGGGCC TGCCCGAGGA GTACCGGGGA ACCTACGCGG GCGTGGCCCA CCCGGTGATG ATCGAGCACT ACCGCCGCCT CGGCGTGACG GCGATCGAGC TCCTGCCGGT GCACCAGTTC GTCCACGACG AGCACCTGGT CAGCCGTGGG CTGCGCAACT ACTGGGGCTA CAACTCCATC GCCTTCCTGG CGCCGCACAA CGGCTACTCC GCCGCGGGCG GCGACGGCCG GCAGGTGCAG GAGTTCAAGG GCATGGTCCG CAACCTGCAC GAGGCGGGCA TCGAGGTGAT CCTCGACGTG GTCTACAACC ACACCGCCGA GGGCAACCAC ATGGGCCCCA TGCTGTGCTT CCGGGGCATC GACAACGCGG CCTACTACCG GCTGGTCGAC GACGACCCGC AGTACTACAT GGACTACACC GGCACCGGCA ACAGCATGCG GGTCCGCCAT CCACACGTGC TTCAGCTGAT CATGGACTCG CTGCGCTACT GGGTCACCGA GATGCACGTG GACGGCTTCC GCTTCGACCT GGCGGCGACC CTGGCCCGGG AGTTCTACGA CGTCGACCGG CTGTCGTCGT TCTTCGACCT CGTCCAGCAG GACCCGGTCG TCTCCCAGGT CAAGCTGATA GCCGAGCCGT GGGACCTGGG TGAGGGTGGC TACCAGGTGG GCAACTTCCC CCCGCTGTGG ACGGAGTGGA ACGGGAAGTA CCGCGACACC GTCCGCGACT TCTGGCGCGG GCAGGACCAC GGCATCGCCG AGTTCGCCTC CCGGCTCACC GGCTCGTCCG ACCTCTACGA GGACAGCGGC CGGCGCCCCT GGGCCTCGAT CAACTTCGTC ACCGCCCACG ACGGCTTCAC CCTGCACGAC CTGGTCTCCT ACAACGACAA GCACAACGAG GCCAACGGCG AGGAGAACCG GGACGGCTCC GACGACAACC GGTCCTGGAA CTGCGGCGTG GAGGGTCCGA CCGACGACGT CGCGGTCAAC CGGCTGCGCG ACGCGCAGAC CCGCAACCTG CTCGCCACCC TGCTGCTCTC GCAGGGGGTG CCCATGCTGG TAGCCGGCGA CGAGATGGGC CGAAGCCAAC AGGGGAACAA CAACGCCTAC TGCCAGGACA GCCCGATCTC CTGGCTGGAC TGGTCCGACG CCGAGCGCAA CGCCGGTCTC ATCGATTTCA CCGCCCAGCT CTCACATCTT CGGCGGACCC ATCCGGTCTT CCGGCGGCGG CGGTTCTTCC AGGGCGAGTC GATCCGCGGC TCGGCCGGCG GCGAGGACAA CACCGGCAGC CGACCCGACG GCGCGGCCGT CGTCGACCTG GCCGCCAAGG ACATAGTCTG GCTTCGTCCC GACGGTAGCG AGATGTCCGA CACCGACTGG GAGTCCGGTA CGGCGCGCTC GCTCGGCTGT TTCCTGAACG GGCACGGCAT CCCGGATCCC AACACGCTCG GCGAGGCGAT CGTCGACGAC TCGTTCCTGC TGTTCTTCAA CGCCCACCAC GAGCCGATCC AGTTCCGGGT ACCACCGGTC GACTTCGGCA CCACCTGGGA GATCATCGTG GACACGCGCT CGTCCAGCGC CGAGATCTCG GCAGTGCTCG GTGGCACCGA TCTCGATCCG GTCGACGTCG ACCGGGTTGT CAAGGCCGAG GACCCGCTCG AGGTGGACGC CCGCGCCACG CTTGTGCTCC GCCGTGTCAA CTGA
|
Protein sequence | MRPTAQRLED VVPGGSQQPV QAPREGLPRL AALHQRQQRP LADDDVPVGG DRGPVQAEIG SEGVALRVAA GRGPLDGLER DMTQPAQDGG EPPGGSRARP PFQTGIQETG PAWLRRPGQG DRGGQGTGHG ELQVRAQCRR TCEHRQLARH GEGRRAGRQL TSCRGHCHHA GPPSTRNAFP CVSSRRTPRI PVDRAMTGVR GLDPPPPLVP HVVTPTPAEG LVRGPRGPNV PAGHGRWSGP RRAWSARPAP SVAGRPRTGH VPRIPAVRPR WSCRADWSCR ADWSRRADWS RRADWSDRAG RTSRTRWAGC PATDVEQAAW PGQATEGGGE LAGDDPQLAG LALCQPREHL QVLVGEELLV RVALVDGAED GADGLRLALG GEDLGLPVAF RPQDRGLLLA LGPQDLARLD TLGGQDRGPP VAFGPHLLLH RVLDGAGRVD GLELHPVDPD APPAGRVVEH PAQLPVDLVA RGERALEVHP ADHVAQRGHG ELLDGADEVG DLVGGRARVG DLEVDDGVDP DDEVVLGDDR LGREADHLFP QVDPLPDPVD DRQHDRQARR QRAVVAPETL DHHRPGLRHD PHGPRQEDQD HQCRDRGGDD SAHVHSSSSR RAVSPRSTSS CTITWAPSTR STRTVAPISN SSVAGYDRAD QTSPASRTRP VDPSTRSRTT AGVPTRASVP VRSTGPLVRW RRATGRRPNV SPPAATANPT SCTIRSAPRT ARPAPTSAAS ANMTRTRSSV SNSPTTSAPA TMIHTTSSAM TITLIRSTIL RSTKPQWRQW TNRSGFARPQ VTDGARAVGF VRMSQVWPGS PYPLGATFDG SGTNFAIFSE VAERVELCLF DAVGAERRIE LRERDAFVWH AYLPTVLPGQ RYGYRVHGPH DPARGHRCNP NKLLLDPYAK AVDGEVDWNQ AVFGYDFGDP DSVNTTDSAP HMMKSVVISP FFDWNGDRPP RRPYSETVIY EAHVRGLTMR HEGLPEEYRG TYAGVAHPVM IEHYRRLGVT AIELLPVHQF VHDEHLVSRG LRNYWGYNSI AFLAPHNGYS AAGGDGRQVQ EFKGMVRNLH EAGIEVILDV VYNHTAEGNH MGPMLCFRGI DNAAYYRLVD DDPQYYMDYT GTGNSMRVRH PHVLQLIMDS LRYWVTEMHV DGFRFDLAAT LAREFYDVDR LSSFFDLVQQ DPVVSQVKLI AEPWDLGEGG YQVGNFPPLW TEWNGKYRDT VRDFWRGQDH GIAEFASRLT GSSDLYEDSG RRPWASINFV TAHDGFTLHD LVSYNDKHNE ANGEENRDGS DDNRSWNCGV EGPTDDVAVN RLRDAQTRNL LATLLLSQGV PMLVAGDEMG RSQQGNNNAY CQDSPISWLD WSDAERNAGL IDFTAQLSHL RRTHPVFRRR RFFQGESIRG SAGGEDNTGS RPDGAAVVDL AAKDIVWLRP DGSEMSDTDW ESGTARSLGC FLNGHGIPDP NTLGEAIVDD SFLLFFNAHH EPIQFRVPPV DFGTTWEIIV DTRSSSAEIS AVLGGTDLDP VDVDRVVKAE DPLEVDARAT LVLRRVN
|
| |