Gene Franean1_1236 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_1236 
Symbol 
ID5669649 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp1481138 
End bp1482574 
Gene Length1437 bp 
Protein Length478 aa 
Translation table11 
GC content72% 
IMG OID641240168 
Producthypothetical protein 
Protein accessionYP_001505596 
Protein GI158313088 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG3408] Glycogen debranching enzyme 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0457319 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.00331425 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
GTGGACGCCC GCGCGGGCGC CGCGTCGGGC GGGGCGATCT CGCCGCCGGC CGCGGAAATC 
CCCCTGTCGG CGGACATCGC CACGGTGCGC GAGACGGCCG TGGCGGTGCT GCGCGCCAAC
GACCTTGGTG ACATCACCCG GCCGTCGCCG ACGCTCTACC CGCACCAGTG GCTGTGGGAC
AGCTGCTTCA TCGCCATCGG CCTGCGCCAC CTCGACCCGG CGCGTGCCGC CGCCGAGCTG
CTCTCGCTGC TGCGCGGCCA GTGGCCCGGC GGCATGATCC CGCACGTGAT CTTCGCCGAG
ACGACCGACT ACTACCACGC GGGTCCGCAG CGCTGGCGCT GCGACCGAGT CTGCACGACC
GCCGGCGGGG TCCAGTCGAC CGGCGTCACC CAGCCGCCGA TGATCGCCGA GGCGGCCGTG
CGGGTGGGCC AGGCGATGAG CAGGGAGGAC CGGGCAGACT TCTACCGGCG GCTCGCACCG
GGCCTGGTGC GCTTCCACGA GTGGCTCTAC CGCGAGCGCG ACCCCGACGA CACCGGCCTG
GTGACGCTGG TCCACTCCTG GGAATCCGGA ATGGACAACA CCCCCACCTG GATGGAGATG
ACCAAGCCGG TCGCCCCCCG GGCGGTGCGC GCGCTGCGCC GGATCAACGG CGACGACACG
CTCGACGCCC TGCGCCGCGA CTCCAAGGAG GTGCCGCCGG ACGAGCGGCT GACCTCGGCG
GACCTGTTCA CGCTCTACCG GATCGTCCGC GAGCTGCGCC GCAGCCGGTA CGACTTCCGG
ACGATCCGCT CCACCTTCGT CCCGCTCGTG CAGGACGTGG CCTTCAACGC GATCCTGGCC
CGGGCCAACG AGCATCTCGA GACGATCGCG GCCGAGGCCG GGGTGCGCAT CCCGCGCTGG
CTGTCCCGTT CGATGCGCCG GACCCGCGTG GCCCTCAACG AGCTGTGCTC GGACGGTGTC
TACTACAGCC GGAACTTCCG CACCGGCGAG CTGCTGCGCC ACCACACCAT CGCCGGCTTC
CTGCCGCTGT ACGCGGGCGT GGTGCCGGAG TCCCAGGTGG ACGACATGGT GGACACGCTC
ATGTCACCGC GGTACTGGGC CCAGTACGGC ATCGCCAGCG TGCCGACCGA CGACCCCGCC
TTCCTGCCTC GCTGCTACTG GCAGGGCCCG GTATGGGTCA ACATGAACTG GCTGATCGCG
GACGGCCTGG ACCGCTACGG CCGGACCGAG GCCGCCGCCG CGATCCGCCG CAACACCCTC
GACGTGATCG TGTCGTCGGG ATCGATGTAC GAGTACTACT CACCGCTCGA CGGCTCCGGC
GCCGGCAGCA ACCGCTTCTC CTGGACGGCC GCCCTGCTGG TGGACATGCT CGACAACGGC
GCCCGCCCGA CCTCCCGGGC CCAGGCGGCC ATCCCCACCA CCCGCACCCC CGCCTGA
 
Protein sequence
MDARAGAASG GAISPPAAEI PLSADIATVR ETAVAVLRAN DLGDITRPSP TLYPHQWLWD 
SCFIAIGLRH LDPARAAAEL LSLLRGQWPG GMIPHVIFAE TTDYYHAGPQ RWRCDRVCTT
AGGVQSTGVT QPPMIAEAAV RVGQAMSRED RADFYRRLAP GLVRFHEWLY RERDPDDTGL
VTLVHSWESG MDNTPTWMEM TKPVAPRAVR ALRRINGDDT LDALRRDSKE VPPDERLTSA
DLFTLYRIVR ELRRSRYDFR TIRSTFVPLV QDVAFNAILA RANEHLETIA AEAGVRIPRW
LSRSMRRTRV ALNELCSDGV YYSRNFRTGE LLRHHTIAGF LPLYAGVVPE SQVDDMVDTL
MSPRYWAQYG IASVPTDDPA FLPRCYWQGP VWVNMNWLIA DGLDRYGRTE AAAAIRRNTL
DVIVSSGSMY EYYSPLDGSG AGSNRFSWTA ALLVDMLDNG ARPTSRAQAA IPTTRTPA