Gene Franean1_2027 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_2027 
Symbol 
ID5670428 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp2435613 
End bp2438243 
Gene Length2631 bp 
Protein Length876 aa 
Translation table11 
GC content74% 
IMG OID641240948 
Productalpha amylase catalytic region 
Protein accessionYP_001506370 
Protein GI158313862 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG0296] 1,4-alpha-glucan branching enzyme 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.748072 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value0.857591 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TTGGGCAGAC ATGGCAGGCC GCGAGCGCCG GCCGAGGTCT ACCGATGGGA CGACGTGACG 
ACAGAGCATG GCGCCGACCG GCATGTGGGC CCGTCACCCG CCACCGGCGC CGTCCGCGTG
GACGTCCTCT ACCACACCGG CGTCGGCCGG CGGATCGCGA ACGGGGCCCG GCTGGTCGGG
AGCTGGGACG AGCACGGGCT GCAGGCGGGG AGCTGGTCGT CCACGCCGAT GGAGGAGTTC
ACCGCCGAGG ACGGCTGCGC CGCCTACCGG GCCACCGTCA CGGTGGACGC GAGCGTCTCC
CGGACCTTCG ACTGGGGCGT CTGGCTGACC CGTCCCGACG GCTCGCAGGT GTGGGGGATC
CCCGCCGAGG TCCCGGATCC CGGGGCGACG GAGCAGGTCC GGCGGTTCGA GGTCGGCCCG
GGGTGCGGGC CGGTGGTGCG CGCGGAGTTC CGCCTGGCCG CCCACCGGCA CAACGGCGCC
CGGCGCGTCC TGCCCGTCGG CTCGAGAGAC CTGGGCAGCC CGGTCAGCAC GGTCAGCACG
GGCGGCGGCT CAGGCGGCCC CAGCGGCCCC AGCGGCTCAG GCGCCGCCGT CAGCTCGGAC
GCCCGGGAAC GGATCCGGTT CCGGGTCTGG GCGCCGCACG CGCTCGCCGT CGAGGTGGCC
TTCGCCGGGC CCGGCGGCTA CATCGCCGAC GACGGCCACG GTGAGGACGA GCTGATCACC
AGGCTGCCGA TGCGCCAGGT CGGCGACGGC TGGTGGGAGG CGGCCGCACC CGGCTTCGCC
GACTGGGTCG GACGGCGCTA CCTCTACCGC GTCACCCGCG ACGACGGGTC CGTCGCCTGG
CGCTCGGACA TGTACTCGGC GCAGCAGTGC GGCACCGGGG ACATCGACCC CTGCGGCGCC
CCCCACGACG GCCCGGCCGA GGACCTCGAC GGCTCGGTGA GCTGCTCGGT CGTCGTCGAC
ACCCGGGACG ACGAGCGGTT CTGGGCCGAC GAGTTCGACC CGGCCCGGCC CGTCCCCCGG
CGGGTCGAGG ACCTGGTCAT CTACGAGCTG CACGTGGGCG CGCTGGGCTT CGGGCACACC
GGGGCCGGCA CGTTCGCCGA CGCGCTCGCG TTCGTCGACC ACCTCACCGA CCTCGGGGTG
AACGCCGTCG AGCTGCTGCC CATGTTCGAG TTCGCCGGGA CGAGGTCCTG GGGTTACGGC
AGCTCGCACT TCCTGGCGGT GAAGCAGAGC GCGGGCGGGC GGGCGGCGCT GCGCCGGTTC
GTGCGGGCCT GCCACCAGCG GGGCGTGGCC GTCCTGATGG ACGTCGTCTA CAACCACTAC
ACGCCGAACG CGCAGCGTTC CGCCTGGCAG TACGACAGCA CCGCGCCGAG CCGCAACATC
TACTACTGGT ACGAGGGCGC CGAGGACGAC CACCCGCACC CGGACGGCGG ATACCTCGAC
AACGTCTCCT CGGGCTGGGC GCCGCGCTAC TCCGACGAGA ACGTACGTGC GCTGTTCGTC
GCGAGCGCGG TGGCGCTGCT CGACGAGTTC CACATCGACG GGCTGCGGGT GGACCAGACG
ACGTCCATCC ACGCCTACAA CAGCCTGCAC GCTGACGGGC GGCCCGTTGC GGCGGCGAAC
ATCGCCGGCC GCAAGTTCCT GCGGGAGCTG TGCCAGACGC TGCGCCTGGT CGACCCGGAC
GTGATCCTCA TCGCCGAGGA CCACTCCGGC TGGGCGGAGG TCACCCGCCC CGCCGAGTCC
GGCGGGCTCG GCTTCGACGC CCACTGGTAC GTCGACTTCT ACCACCACCT CGTCGGCGAC
AAGGGCGAGG GACCGGAGTA CGCCAAGCTG CTGCACACCG CGGGGCGCGA CCCCGCCGGC
CCGCTGGCCA TGAGCCTGTT CGCCAAGGCG TTCACCGCCG CCGCGGACCG CACGGTCGTC
TACACCGAAA GCCACGACGA GGCCGGCAAC TCCGAGCACT CGGCCCGCAA CATCCTCGTC
GCCGTCGACC ACGCGCCGCT GCACGGCGAC ACGGCCTGGT TCGCGTTCGC GCGGCTGCGC
TGCGCCGCGG CGCTGACCCT GCTTTCGCCC GGCACGCCGA TGTTCCTCAT GGGCGACGAG
GTGGGAGCCC GGCGGGCCTA CACCCACGAC GGGTTCGCCG AGGCCAAGGA GGACCTCGCC
GGGCTGCGCG CGGGCGAGGG CGCCGAGCTG TTCGCCTGCT ATCGGGCGCT CGTCACGCTG
CGGCTGGGCA GCCCGGCGCT GCGCTCGCGC GCGGTCGAAC TGGTCGGCGC CGACGACACC
GCGCGGGTGC TGGCGTTCCG CCGCTGGGAC CGCGGCGAGG AGATCCTGGT CGTGGTAAGC
CTGAACAACG ATCCGCTGCC CAGGTTCGGG CTATCGCATC CGTCGCTGGC CGGCCGGCGG
TGGAAGCCGG TGCTGGACAC CGACGCGCCA CGGTTCGGCG GACGGGCGGG CGGCTCGCGC
CGGTCGCTGT CCCCACGGGG CGACAGCGTG CGGGTCGATC TGCCGGCCGC CGGTGCCGTG
GTGTTCCGCC GCCGCCGGCG CGGCGCCGGC CTGACCGACG GGCCGTCGGA CGTCCCGGCC
CGCCCCCGCC GCCTGCGCCT GCCCGGCGTG CGCCGACGAG GCGGGCGCTG A
 
Protein sequence
MGRHGRPRAP AEVYRWDDVT TEHGADRHVG PSPATGAVRV DVLYHTGVGR RIANGARLVG 
SWDEHGLQAG SWSSTPMEEF TAEDGCAAYR ATVTVDASVS RTFDWGVWLT RPDGSQVWGI
PAEVPDPGAT EQVRRFEVGP GCGPVVRAEF RLAAHRHNGA RRVLPVGSRD LGSPVSTVST
GGGSGGPSGP SGSGAAVSSD ARERIRFRVW APHALAVEVA FAGPGGYIAD DGHGEDELIT
RLPMRQVGDG WWEAAAPGFA DWVGRRYLYR VTRDDGSVAW RSDMYSAQQC GTGDIDPCGA
PHDGPAEDLD GSVSCSVVVD TRDDERFWAD EFDPARPVPR RVEDLVIYEL HVGALGFGHT
GAGTFADALA FVDHLTDLGV NAVELLPMFE FAGTRSWGYG SSHFLAVKQS AGGRAALRRF
VRACHQRGVA VLMDVVYNHY TPNAQRSAWQ YDSTAPSRNI YYWYEGAEDD HPHPDGGYLD
NVSSGWAPRY SDENVRALFV ASAVALLDEF HIDGLRVDQT TSIHAYNSLH ADGRPVAAAN
IAGRKFLREL CQTLRLVDPD VILIAEDHSG WAEVTRPAES GGLGFDAHWY VDFYHHLVGD
KGEGPEYAKL LHTAGRDPAG PLAMSLFAKA FTAAADRTVV YTESHDEAGN SEHSARNILV
AVDHAPLHGD TAWFAFARLR CAAALTLLSP GTPMFLMGDE VGARRAYTHD GFAEAKEDLA
GLRAGEGAEL FACYRALVTL RLGSPALRSR AVELVGADDT ARVLAFRRWD RGEEILVVVS
LNNDPLPRFG LSHPSLAGRR WKPVLDTDAP RFGGRAGGSR RSLSPRGDSV RVDLPAAGAV
VFRRRRRGAG LTDGPSDVPA RPRRLRLPGV RRRGGR