Gene Franean1_0217 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_0217 
Symbol 
ID5668642 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp264473 
End bp265507 
Gene Length1035 bp 
Protein Length344 aa 
Translation table11 
GC content66% 
IMG OID641239146 
Productfructose-bisphosphate aldolase 
Protein accessionYP_001504590 
Protein GI158312082 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG0191] Fructose/tagatose bisphosphate aldolase 
TIGRFAM ID[TIGR00167] ketose-bisphosphate aldolases
[TIGR01520] fructose-bisphosphate aldolase, class II, yeast/E. coli subtype 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.113705 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCCATCG CCAGCCCAGA CGTCTACGCC GAGATGCTCA GCCGGGCGAA GTCGAACGCC 
TACGCCTACC CCGCCATCAA CGTGACCTCG TCGCAGACCC TCAATGCCGC GCTCCGGGGT
TTCGCGGAAG CCGGCAGCGA CGGAATCGTC CAGGTGTCGA CCGGCGGCGC CGAGTTCCTC
TCGGGAACGA CCATCAAGAA CATGGTGCTG GGCGCGGAAG CGCTCGCCGA ATACGCGCAC
CACGTCGCCA AGGCATACCC GGTGAACATC GCGCTGCACA CGGACCACTG CCCCGCCGAC
AAGCTGGACA CCTACATCCG CCCGCTGATC GCCATCTCGA AGGAGCGTGT GGCGCAGGGC
CGCGACCCGC TTTTCCAGTC CCACATGTGG GACGGTTCGG CGGTCGAGCT CGAGGAGAAC
CTCAAGATTG CCGACGAGCT GCTCGCCGAC TGTCGCGCGG CGCGCATCGT GCTGGAAGTC
GAAATCGGTG TCGTCGGTGG TGAGGAGGAC GGCGTCGTAG GCGCCATCGA CGAGAAGCTC
TACACCACCC CCGGTGACAT GTTCCGCACC GCCGAGGTTC TCGGCACCGG AGAAAAGGGC
AGCTACATGC TGGCCGCGAC GTTCGGCAAC GTGCACGGCG TCTACAAGCC GGGGAACGTC
AAGCTCCGGC CCTCGATCCT GCGCGAGGGT CAGCAGCACG TGGCCGAGAA GCTCGGCCTG
GCCGCCGACG CGAAGCCGTT CAACCTGGTC TTCCATGGCG GCAGTGGGTC GGACCTCTCC
GAGATCCGCG AAACGCTCGA CTACGGCGTC ATCAAGATGA ACGTGGACAC CGACACCCAG
TACGCGTTCA CCCGGCCGAT CGTCGACCAC ATGCTCCGCA ACTACGACGG TGTCCTCAAG
GTGGACGGTG AGGTCGGGGT CAAGAAGGCC TACGACCCGC GCACCTACGG CAAGGCCGCC
GAAACCGCCA TGGCCGCCCG GGTCGCCCAG GCCTGTGACG ACCTGCGGTC CGCCGGCCGG
TCGATCGGGG TCTGA
 
Protein sequence
MPIASPDVYA EMLSRAKSNA YAYPAINVTS SQTLNAALRG FAEAGSDGIV QVSTGGAEFL 
SGTTIKNMVL GAEALAEYAH HVAKAYPVNI ALHTDHCPAD KLDTYIRPLI AISKERVAQG
RDPLFQSHMW DGSAVELEEN LKIADELLAD CRAARIVLEV EIGVVGGEED GVVGAIDEKL
YTTPGDMFRT AEVLGTGEKG SYMLAATFGN VHGVYKPGNV KLRPSILREG QQHVAEKLGL
AADAKPFNLV FHGGSGSDLS EIRETLDYGV IKMNVDTDTQ YAFTRPIVDH MLRNYDGVLK
VDGEVGVKKA YDPRTYGKAA ETAMAARVAQ ACDDLRSAGR SIGV