Gene Francci3_3681 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFrancci3_3681 
Symbol 
ID3905365 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. CcI3 
KingdomBacteria 
Replicon accessionNC_007777 
Strand
Start bp4416474 
End bp4418912 
Gene Length2439 bp 
Protein Length812 aa 
Translation table11 
GC content70% 
IMG OID637881007 
Productglycogen branching enzyme 
Protein accessionYP_482762 
Protein GI86742362 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG0296] 1,4-alpha-glucan branching enzyme 
TIGRFAM ID[TIGR01515] alpha-1,4-glucan:alpha-1,4-glucan 6-glycosyltransferase 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.024212 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGAACAACG GCGACGTCAA CAACGGCACG GCGCGGACGG GACGCCTGGC CACCGGGCTC 
GCCCCCGACC GACCGAACCC CCGAGGGGCG GCCACCGCCG TCCGCAACGG CGGTGGCCGG
GTGGCGCTGG CGGACCGCGA CGCATCGTCC GCCAGCGCCC AGCCTGGACA GACCGCCGAC
GATCCCGCCG TGCCATCGGC CCCACCATCG GCCCCACCAT CGGCCCTGCC CGCAGACCCG
CCCGCGGTTG CGCCCGCGGT CTCGGTCGAC CTGCCCTACG ACGACCTGGA GCGGTTGGTC
AGCGGGGCGC ATCACGACCC GCACGCGCTG CTCGGGGCGC ACCCGCATCC CGGGAGCGAC
GCCACGGTCG TCCGGGTGCT GCGCCCCGAT GCGAGGGCCG TGACCGTCCT CGTCGGGCCG
GCCCGCTACC CGGCCACCCG GCTGCACAGC GGCGGGGTGT TCGGAGTGGC CGTGCCGGGA
ATGCTGCCCG ACTACCGCAT CGAGGTGACC TACCCGGACG GCCCGTACCT CATCGACGAC
CCCTACCGGC ACCTGCCCAC CCTCGGTGAG ATGGACCTCC ATCTGATCAT CGAGGGCCGG
CACGAACAGC TCTGGAAGGT GCTCGGGGCG CATCCGCGCG CGCTGACGAC GCCGGGCGGC
GCCACAGTCA CCGGGGTCAG CTTCGCCGTG TGGGCCCCCT CCGCGCGGGG AGTGCGGCTC
GTCGGCGATT TCGACTTCTG GGACGGGCGT GCCTTCCCGA TGCGCTCGCT CGGCCGCTCC
GGGATCTGGG AGCTGTTCGT CCCCGGCGCC GGCACCGGCG CCCGGTACAA GTACGAGATC
CTCGGCTTCG ACGGGATCTG GCGGCAGAAG GCGGATCCGC TGGCGTTCCA CACCGAGGTG
CCGCCGGCCA CCGCGAGCGT GGTGTTCGCC TCCGACTTCA CCTGGAACGA CGGCGCGTGG
CTCGACCGGC GGGCCCGGAC GGCCTGGCGC ACCGAGCCGG TGAGCGTCTA CGAGGTCCAC
CTGGGCTCCT GGCGGCGCGG GTTGTCCTAT CGGGAGCTCG CTGAGGAACT GACCGCGTAC
GTCGTCGAGA ACGGCTTCAC CCACGTGGAG ATGCTGCCCG TGGCCGAGCA TCCCTTCGGC
GGCTCCTGGG GCTACCAGGT GTCGGCCTAC TACGCCCCGA CGGCCCGCTT CGGCAGCCCC
GACGAGTTCC GCCACCTTGT GGACACGCTG CACCGCGCCG GCATCGGGGT GATCGTCGAC
TGGGTGCCGG CGCACTTCCC CCGGGACACC TGGGCGCTGG GCCGCTTCGA CGGCACCCCC
CTGTACGAAC ATCCCGATCC ACGCCGCGGT GAACAGCCCG ACTGGGGTAC CTACGTCTTC
GATCTGGGCC GCCCCGAGGT ACGCAACTTC CTCGTGGCAA ACGCCCTGTA CTGGTTCGAG
GAATTCCACA TCGACGGGCT ACGCGTCGAC GCGGTCGCCT CGATGCTGTA CCTCGACTAC
TCCCGGCCGG AGGGCGGCTG GCTGCCCAAC ATCCACGGCG GCCGGGAAAA CCTCGACGCC
GTCTCGTTCC TGCAGGAGAC CAACGCCACC GTCTACCGCC GTTTCCCGGG CGCAATGATG
ATCGCGGAGG AGTCGACCGC GTGGCCCGGG GTCACCCGCC CCACCCACCT CGGCGGCCTG
GGCTTCGGCT TCAAATGGAA CATGGGGTGG ATGCACGACA CCCTCGACTA CAACTCCCGG
CTGCCGATCC ACCGCATGTA CCACCACCAT CAGATGACCT TTTCGATGGT GTACGCCTAC
TCGGAGAACT TCATCCTGCC GTTCAGCCAC GACGAGGTGG TCCACGGCAA AGGTTCACTG
CTACGCAAGA TGCCCGGCGA CCGCTGGGCA CAGCTCGCGA ACCTGCGGGC GCTGCTCGCC
TACATGTGGG CGCATCCCGG CAAGAAGCTG CTCTTCATGG GCTGCGAGTT CGCCCAGGAC
AACGAGTGGA ACGAGTCCGC GTCCCTCGAA TGGCCCCTGC TGGACGATCC CGCCCACGCC
GGTGTCGCGG ACCTCGTGCG CGACCTGAAC GGTCTCTACC GGACGGTCCC GGCCCTGTAC
CAACACGACG CCGACCCGGC GGGCTTCTCC TGGATCGACG CGAACGACGC GGAGAACAAC
GTCTTCTCGT TCCTGCGGTG GTCGGGCGAG GACCCGGCGG GCGGCGTGCT GGCCTGCGTC
ACCAACTTCG CCGGCATCGG GCACGAGGGC TACCGCATCG GGCTGCCGTT CCCCGGCCGC
TGGCGCGAGA TACTCAACAC CGACGGGTAC CGCTACGGCG GCGGCAACAT CGGCAATCTC
GGCTCGGTCC AGGCTGTCGA GGAGCCGCAT CACGGTCTCG ACGCCTCAGC AACCCTGACG
CTGCCGCCTC TCGGGGCCAT CTGGCTCTCC CCCGCCTAG
 
Protein sequence
MNNGDVNNGT ARTGRLATGL APDRPNPRGA ATAVRNGGGR VALADRDASS ASAQPGQTAD 
DPAVPSAPPS APPSALPADP PAVAPAVSVD LPYDDLERLV SGAHHDPHAL LGAHPHPGSD
ATVVRVLRPD ARAVTVLVGP ARYPATRLHS GGVFGVAVPG MLPDYRIEVT YPDGPYLIDD
PYRHLPTLGE MDLHLIIEGR HEQLWKVLGA HPRALTTPGG ATVTGVSFAV WAPSARGVRL
VGDFDFWDGR AFPMRSLGRS GIWELFVPGA GTGARYKYEI LGFDGIWRQK ADPLAFHTEV
PPATASVVFA SDFTWNDGAW LDRRARTAWR TEPVSVYEVH LGSWRRGLSY RELAEELTAY
VVENGFTHVE MLPVAEHPFG GSWGYQVSAY YAPTARFGSP DEFRHLVDTL HRAGIGVIVD
WVPAHFPRDT WALGRFDGTP LYEHPDPRRG EQPDWGTYVF DLGRPEVRNF LVANALYWFE
EFHIDGLRVD AVASMLYLDY SRPEGGWLPN IHGGRENLDA VSFLQETNAT VYRRFPGAMM
IAEESTAWPG VTRPTHLGGL GFGFKWNMGW MHDTLDYNSR LPIHRMYHHH QMTFSMVYAY
SENFILPFSH DEVVHGKGSL LRKMPGDRWA QLANLRALLA YMWAHPGKKL LFMGCEFAQD
NEWNESASLE WPLLDDPAHA GVADLVRDLN GLYRTVPALY QHDADPAGFS WIDANDAENN
VFSFLRWSGE DPAGGVLACV TNFAGIGHEG YRIGLPFPGR WREILNTDGY RYGGGNIGNL
GSVQAVEEPH HGLDASATLT LPPLGAIWLS PA