Gene Franean1_2221 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_2221 
Symbol 
ID5670620 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp2657317 
End bp2659206 
Gene Length1890 bp 
Protein Length629 aa 
Translation table11 
GC content72% 
IMG OID641241141 
Productglycoside hydrolase 15-related 
Protein accessionYP_001506562 
Protein GI158314054 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG3387] Glucoamylase and related glycosyl hydrolases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.47464 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0451069 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGCCATCCC TGATCGAGGA TTACGCGCTG ATAGGAGACA CTCACTCGGC GGCGCTGGTC 
TCCCGCGACG GCTCGATCGA CTGGCTGTGC CTGCCTCGGT TCGACTCCCC GGCCTGCTTC
GCCGCGCTGC TCGGGGACAA CGAGTCCGGC CACTGGAAGA TCGCGCCGGT CGAGCCGGTG
ATCGGGGTGA GCCGCCGCTA CCGCGGCGAC ACCCTGGTGC TCGAGACCGA CATGACCACC
GCGTCGGGCA TCGTCCGGAT CGTCGACGCG ATGTTCCCCC GGGCGGGGGC GCACGTCGTC
CTGCGGCTGG TGGAGTGCCT GCAGGGCACC GTCCGGCTCC GGTCGGAGAC CCGCTTCCGG
TTCGACTACG GCTCGATCGT GCCGTGGGTG CGCCGGCTCG ACGAGCACTC GGTCGCCGCG
ATCGCCGGCC CCGACTCGGT GACGCTGTGG ACCACCGCCC CGATGGAGGG GCACGACATG
GCCACCTACG CGGAGTTCAG CGTCTCGGCC GGGCAGTCCG TCCCGTTCTC GCTCACCTGG
CGGCCCTCGC ACGAACCGGC GCCAGCCCCG CAGGATGTCC GGCGGATGAT CTCGCAGACC
GAGGCATGGT GGTCGGACTG GATGGCCGGC TGCACCTACG ACGGCCAGTG GCAGCCGGCG
GTCCGCCGGT CGCTGATCAC CCTGAAGGCG CTCACCTACG CCCCGACGGG CGGCATCGTC
GCCGCGGTCA CCACCTCGCT GCCCGAGCAC ATCGGCGGCG TCCGCAACTG GGACTACCGC
TACTGCTGGC TGCGCGACGC CACGATCACC CTGCTGGCGC TGCTCGACGC CGGCTTCACC
AGCGAGGCGA CGGCCTGGCG GGAATGGCTG CTGCGCGCCG TGGCCGGCGA CCCGTCCCGG
GTGCAGATCA TGTACGGCGT GGCCGGGGAG CGGCGGCTGC CCGAGTACGA GGTCCCGTGG
CTGCCGGGCT ACGAGAACTC CTCCCCCGTC CGGGTGGGAA ACGCCGCCGT CGACCAGTTC
CAGCTCGACG TCTACGGGGA GGTCCTCGAC GCCCTGCACG TGGCCCGCGT CGCGGTGGCG
AACCGGCGTC CCGGCTCGGC CCGTGACGGT TTCCTCGCGG GGATCCACTC CTCCGAGGAC
GACGGCCGGG ACGGCTCCTG GCAGCTGCAG ACCAAGCTCA TGGACTTCCT CGAGACCGGG
TGGCGCAAGG CCGACGAGGG CATCTGGGAG GTCCGCGGGC CGCGCCGGCA TTTCGTCCAC
TCGAAGGTCA TGGCCTGGGT GGCCGCCGAC CGGGCCGTCC GGGGGATCGT GGAGTCCGGG
CTACCCGGCC CGGTCGAGCG GTGGTCGGCG CTGCGGGACG AGATCCACCA CGAGGTGTGC
GCCCGCGGGT TCGACTCCGA CCGCAACACG TTCACCCAGT TCTACGGCTC GAAGGAGCTC
GACGCGGCCC TGCTCTACAT GTCACTCGTC GGCTTCCTGC CGGCGACCGA CCCACGGGTC
GTCGGCACCG TCGCCGCCAT CGAGCGCGAG CTGATGGAGG ACGGCTTCGT CATGCGCTAC
CCGACGGCCG AGGACGGCGC CGTCGACGGC CTGCCGGCCG GGGAGGGTGC CTTCCTCGCC
TGCACCTTCT GGCTGGCGGA CAACTACGCG CTGTCCGGGC GGGTGCACGA GGCGCAGGAG
CTGTTCGAGC GGCTGCTGGC GCTGCGCAAC GACGTCGGGC TGCTGGCCGA GGAGTACGAC
CCGCGGCTGG GCCGGATGAC CGGCAACTTC CCGCAGGCGT TCAGCCACGT CCCGCTGGTC
AACACCGCGC GCACGCTCAC CGATGCGCTG CGCGGCCGGC CGCGCTCGCG CACCGACCGC
GCGCACCCGC CGGGCCACTT CTTCGGCTGA
 
Protein sequence
MPSLIEDYAL IGDTHSAALV SRDGSIDWLC LPRFDSPACF AALLGDNESG HWKIAPVEPV 
IGVSRRYRGD TLVLETDMTT ASGIVRIVDA MFPRAGAHVV LRLVECLQGT VRLRSETRFR
FDYGSIVPWV RRLDEHSVAA IAGPDSVTLW TTAPMEGHDM ATYAEFSVSA GQSVPFSLTW
RPSHEPAPAP QDVRRMISQT EAWWSDWMAG CTYDGQWQPA VRRSLITLKA LTYAPTGGIV
AAVTTSLPEH IGGVRNWDYR YCWLRDATIT LLALLDAGFT SEATAWREWL LRAVAGDPSR
VQIMYGVAGE RRLPEYEVPW LPGYENSSPV RVGNAAVDQF QLDVYGEVLD ALHVARVAVA
NRRPGSARDG FLAGIHSSED DGRDGSWQLQ TKLMDFLETG WRKADEGIWE VRGPRRHFVH
SKVMAWVAAD RAVRGIVESG LPGPVERWSA LRDEIHHEVC ARGFDSDRNT FTQFYGSKEL
DAALLYMSLV GFLPATDPRV VGTVAAIERE LMEDGFVMRY PTAEDGAVDG LPAGEGAFLA
CTFWLADNYA LSGRVHEAQE LFERLLALRN DVGLLAEEYD PRLGRMTGNF PQAFSHVPLV
NTARTLTDAL RGRPRSRTDR AHPPGHFFG