Gene Franean1_1049 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_1049 
Symbol 
ID5669463 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp1229420 
End bp1231687 
Gene Length2268 bp 
Protein Length755 aa 
Translation table11 
GC content72% 
IMG OID641239978 
Productglycogen branching enzyme 
Protein accessionYP_001505411 
Protein GI158312903 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG0296] 1,4-alpha-glucan branching enzyme 
TIGRFAM ID[TIGR01515] alpha-1,4-glucan:alpha-1,4-glucan 6-glycosyltransferase 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.145385 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value0.795016 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGCTCGACC GCCTCGCCGG CGGCGCGCAC CACGACCCGC ATGGCGTGCT CGGCGCGCAC 
CCCACCCGGG GCTGGACAGT CATCCGCGCG CTGCGCCCCG ACGCCACCGC CGTCACCGTC
CTCGTCGACG GGACGCGCCA CCCGATGACG AGGCTGCACA GCGGTGGGAT CTTCGCCGTC
GCCGTGCCCG GCGGCATCCC CGACTACCGG TTCGAGGTCA CCTACCCCGA CTCGACCCAC
GTGGTCGACG ACCCGTACCA GCACCTGCCG ACGCTCGGCG AGATCGACCT GCACCTGATC
GGCGAGGGAC GTCACGAGCG GCTCTGGCAC GTCCTGGGCG CGCACGTGCG CCGCCTCGAC
CGCGGGCCGG GCGGCGGCGG CGGCGGCGGG ACAGTCGCGG GGGTCAGCTT CGCGGTCTGG
GCGCCGAACG CCCGCGGCGT GCGGGTAGTC GGCGACTTCG ACTTCTGGGA CGGCCGCGGC
TTCCCGATGC GCTCGCTGGG CTCGTCGGGC GTCTGGGAGC TGTTCGTCCC CGGTATCGGG
GCCGGCACCC GCTACAAGTA CGAGATCCTC GGCTCTGACG GCGTGTGGCG GCAGAAGGCG
GACCCGCTGG CCTTCGCCAC CGAGCAGCCC CCGGCCACCG CCAGCGTGGT CCACGAGTCG
AGCTACACCT GGGACGACGC GGCCTGGCTC GAGCGGCGCG CGAGCATCCC CTGGCACGCC
GCGCCGGTCA GCGTCTACGA GGTGCACCTG GCCTCGTGGC GGCGCGGCCT GAGCTACCTC
GAGCTGGCCG AGCAGCTCGT CGAGTACGTG CGCGAGAACG GCTTCACCCA CGTCGAGATG
CTGCCGGTGG CCGAGCACCC GTTCGGTGGC TCCTGGGGCT ACCAGGTCAC CTCGTACTAC
GCGCCGACGT CCCGCTTCGG CAGCCCGGAC GAGTTCCGGC ACCTCGTCGA CGCGCTGCAC
GCCGCCGGGA TCGGCGTGAT CGTCGACTGG GTGCCCGCCC ACTTCCCCCG GGACGCCTTC
GCGCTGGGCC GCTTCGACGG CACCCCGCTC TACGAGCACC CCGACCCGCG CCGCGGCGAG
CAGCCCGACT GGGGCACCTA CGTCTTCGAC TTCGGCCGGT CCGAGGTGCG CAACTTCCTC
GTCGCCAACG CGCTGTACTG GCTGGAGGAG TTCCACGTCG ACGGGCTGCG GGTCGACGCC
GTCGCCTCGA TGCTCTACCT CGACTACTCC CGGCCCTCGG GCGAGTGGGT GCCCAACGCG
CACGGCGGCC GGGAGAACCT GGAGGCCGTC TCGCTGCTGC AGGAGGTCAA CGCGACCGTC
TACCGGCGGG TGCCGGGAGC GATGATGATC GCCGAGGAGT CGACGGCCTG GCCCGGCGTG
ACCCGCCCGA CCCACCTCGG GGGCCTCGGC TTCGGCTTCA AGTGGAACAT GGGCTGGATG
CACGACACCC TCGACTACTC GTCGCGGGCG CCGCTGTTCC GCACCTACCA CCACCACCAG
ATGACGTTCT CGCTGATGTA CGCGTTCTCC GAGAACTTCG TCCTGCCGTT CAGCCACGAC
GAGGTCGTGC ACGGCAAGGG CTCGCTGCTG CGCAAGATGC CGGGCGACCG CCGGGCCCAG
CTGGCCAACC TGCGGGCGCT CTACGGCTAC ATGTGGGCGC ACCCCGGCAA GCAGCTGCTG
TTCATGGGCT GCGAGTTCGC CCAGGACAAC GAGTGGAGCG AGGCCACCTC GCTGGACTGG
CACCTGCTTG GCGAGGCCGG GCACGGCGGG GTGGCCCGCC TGGTCCGCGA CCTCAACCAC
ACCTACCGGG AGATCCCGGC CCTGTGGGAG CGCGACTCCG ATCCGAGCGG CTTCTCGTGG
ATCGACGCCA GCGACGCCGA GAACAACGTC TTCGCGTTCG TACGCTGGGG TGAGGACGGC
GAGCGGGCGC TGGCGTGCGT GACGAACTTC GCCGGCGTCG GCCAGGTCGG CTACCGCCTC
GGCTTCCCGT TCCCGGGCCG CTGGCGCGAG GTGCTCAACA CCGACGCGCT CGACTACGGC
GGCGGCGGCA TGGGCAACCT CGGCGGGATC ACCGCGGTGG CGGACCCGTC GCACGGCCTG
CCCGCCTCGG CGGCCCTGTC ACTTCCCCCG CTGGGCACCG TCTGGTTCGT GCACGAGCCG
GCCGACGGCG GACCTGCCGG CGACGGACCT TCTGCCGACG GGCAGACCGA CAACGAGCTG
ACCGCCAGCG AGCCGGCAGA CGGCGTGCGG TCCGACCCCG AGGCCTGA
 
Protein sequence
MLDRLAGGAH HDPHGVLGAH PTRGWTVIRA LRPDATAVTV LVDGTRHPMT RLHSGGIFAV 
AVPGGIPDYR FEVTYPDSTH VVDDPYQHLP TLGEIDLHLI GEGRHERLWH VLGAHVRRLD
RGPGGGGGGG TVAGVSFAVW APNARGVRVV GDFDFWDGRG FPMRSLGSSG VWELFVPGIG
AGTRYKYEIL GSDGVWRQKA DPLAFATEQP PATASVVHES SYTWDDAAWL ERRASIPWHA
APVSVYEVHL ASWRRGLSYL ELAEQLVEYV RENGFTHVEM LPVAEHPFGG SWGYQVTSYY
APTSRFGSPD EFRHLVDALH AAGIGVIVDW VPAHFPRDAF ALGRFDGTPL YEHPDPRRGE
QPDWGTYVFD FGRSEVRNFL VANALYWLEE FHVDGLRVDA VASMLYLDYS RPSGEWVPNA
HGGRENLEAV SLLQEVNATV YRRVPGAMMI AEESTAWPGV TRPTHLGGLG FGFKWNMGWM
HDTLDYSSRA PLFRTYHHHQ MTFSLMYAFS ENFVLPFSHD EVVHGKGSLL RKMPGDRRAQ
LANLRALYGY MWAHPGKQLL FMGCEFAQDN EWSEATSLDW HLLGEAGHGG VARLVRDLNH
TYREIPALWE RDSDPSGFSW IDASDAENNV FAFVRWGEDG ERALACVTNF AGVGQVGYRL
GFPFPGRWRE VLNTDALDYG GGGMGNLGGI TAVADPSHGL PASAALSLPP LGTVWFVHEP
ADGGPAGDGP SADGQTDNEL TASEPADGVR SDPEA