Gene Francci3_4364 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFrancci3_4364 
Symbol 
ID3907336 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. CcI3 
KingdomBacteria 
Replicon accessionNC_007777 
Strand
Start bp5211977 
End bp5213014 
Gene Length1038 bp 
Protein Length345 aa 
Translation table11 
GC content66% 
IMG OID637881695 
Productfructose-bisphosphate aldolase 
Protein accessionYP_483439 
Protein GI86743039 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG0191] Fructose/tagatose bisphosphate aldolase 
TIGRFAM ID[TIGR00167] ketose-bisphosphate aldolases
[TIGR01520] fructose-bisphosphate aldolase, class II, yeast/E. coli subtype 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00568724 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCCATCG CCACGCCAGA CGTATATGCC GAGATGCTCG ACCGGGCGAA GGCCCAGTCC 
TTCGCCTACC CCGCGATCAA CGTGACCTCG TCGCAGTCCC TGAACGCCGC GCTGCGGGGC
TTCACCGAAG CCGGCAGTGA CGGCATCGTG CAGGTGTCCA CCGGCGGTGC CGAGTACCTG
TCCGGCTCGA CGGTGAAGAA CATGGTCCTC GGCGCGGAGG CACTGGCCGA GTTCGCCCAC
CACGTCGCGA AGGCGTATCC GGTCAACATC GCCCTGCACA CCGACCACTG CCCGGCCGAC
AAGCTTGACA CCTACATCCG CCCCTTGATC GCGATCTCGA AGAATCGGGT GGCCCAGGGA
CGCGAGCCGC TCTTCCAGTC GCACATGTGG GACGGCTCGG CGGTCCCCCT CGAGGAGAAC
CTCAAGATCG CCGAAGAACT ACTCGCGGAC GCCGCCGCCG CGAAGATCGT TCTCGAGGTC
GAGATCGGGG TCGTCGGCGG CGAGGAGGAC GGCGTCGTCG GCGCGATCGA CGAGAAGCTC
TACACGACGC CGGAGGACAT GTGGCGGACG GCCGAGGTGC TCGGCACCGG CGCGAAGGGG
CGCTACCTGC TCGCCGCCAC CTTCGGCAAC GTGCACGGTG TGTACAAGCC CGGAAACGTC
AAGCTACGCC CGACGATTCT GCACGAAGGA CAGGAGTACG TCGCCAGGAA GCTCGGGCTG
CCAGCCGGCG CGAAGCCGTT CAACCTCGTC TTTCACGGCG GTAGCGGGTC GGCTCTCACC
GAAATCCGCG AGACCCTCGA CTACGGGGTG GTCAAGATGA ACGTGGACAC GGACACCCAG
TACGCATTCA CCCGCCCCAT CGTGGACCAC GTGTTCAAAA ACTATGACGG CGTTCTCAAG
GTGGACGGTG AGGTCGGCGT GAAGAAGGCG TACGACCCGC GTACCTACGG AAAGCTCGCG
GAGAGCAGCA TGGCGGCCCG CGTCGCCCAG GCGTGTGAGG ACCTCCGTTC CGCCGGCACC
AGCCTCGGGC GGGCATAG
 
Protein sequence
MPIATPDVYA EMLDRAKAQS FAYPAINVTS SQSLNAALRG FTEAGSDGIV QVSTGGAEYL 
SGSTVKNMVL GAEALAEFAH HVAKAYPVNI ALHTDHCPAD KLDTYIRPLI AISKNRVAQG
REPLFQSHMW DGSAVPLEEN LKIAEELLAD AAAAKIVLEV EIGVVGGEED GVVGAIDEKL
YTTPEDMWRT AEVLGTGAKG RYLLAATFGN VHGVYKPGNV KLRPTILHEG QEYVARKLGL
PAGAKPFNLV FHGGSGSALT EIRETLDYGV VKMNVDTDTQ YAFTRPIVDH VFKNYDGVLK
VDGEVGVKKA YDPRTYGKLA ESSMAARVAQ ACEDLRSAGT SLGRA