Gene Franean1_2123 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_2123 
Symbol 
ID5670523 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp2550384 
End bp2552267 
Gene Length1884 bp 
Protein Length627 aa 
Translation table11 
GC content72% 
IMG OID641241044 
Producttransketolase 
Protein accessionYP_001506465 
Protein GI158313957 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG0021] Transketolase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGGCAACTC CCGTGTCCGA AGTGGTCGGT GTCGCCGACC TGGAGACGGT TCGGGAACTC 
GCACAGCAGC TGCGGGTCGA CTCGGTGCGC AGCAGCACCT CGGCCGGATC CGGTCACCCG
ACCTCCTCCA TGTCCGCCGC GGACGTCATG GCGGTGCTGC TGACGCGCCA CCTGCGCTAC
GACTGGGACA CCCCGGACGA CCCGGCGAAC GACCACCTGA TCTTCTCCAA GGGGCACGCG
TCGCCCCTGA TCTACTCGAT GTTCAAGGCG GTCGGCGTCG TCACCGACGA CGAGCTCATG
ACCGGCTACC GCCGGTTCGG GCACCGCCTG CAGGGCCACC CGACCCCGGT CCTGCCCTGG
GTCGACGTGG CGACCGGATC ACTCGGCCAG GGCCTCCCCG ACGCCGTCGG GGTCGCGCTG
GCCGGCAAGT TCCTGGACAA GGTCCCGTAC CGGGTCTGGG TGGTCTGCGG CGACAGCGAG
ATGGCTGAGG GCTCGATGTG GGAGGCGCTG GACAAGGCCG CCTTCTACGG CCTGTCCAAC
CTGATCGCGA TCGTGGACGT CAACCGTCTC GGCCAGCGCG GCCCGACCGA GTTCGGCTGG
GACATGGACG CCTACGCCCG CCGGGTCGAG GCGTTCGGCG CCCGCGCGGT CGTGATCGAC
GGCCACGACC TGGGCGCGAT CGACGCCGCG CTCGCCGACG CCGAGGACGC GACGCGGCCC
ACGGTGATCC TGGCCCGCAC CCGCAAGGGC GAAGGCTTCT CAGAGGTCGC CGATGCCGAG
GGCTGGCACG GCAAGCCGCT GCCCGAGGAC ATGGCGACCC GAGCGATCAT CGAGCTGGGC
GGCGAGCGGA ACCTGCGCGT GCGCGGCCCG GTTCCCCCGG CGGACATGGC CCGCCCGGCC
CCGGCCCAGA AGCCGGCGAT CAAGCTGCCG ACCTACGAGC TGGGCGCCAA GGTCGCCACC
AGGCGGGCCT ACGGCCAGGC GCTGGCCGCG GTCGGCGCGC ACACCGACGT CGTCGCCCTC
GACGCCGAGG TGAGCAACTC GACCTACGCC GAGGACTTCG CCAAGGCGCA CCCTGACCGG
TTCTTCGAGA TCTTCATCGC CGAGCAGCAG CTCGTCGCGG CGGCGGTCGG CCTGTCGGTG
CGCGGCTACG TTCCGTTCGC CTCCACGTTC GCCGCGTTCT TCTCCCGCGC CTACGACTTC
ATCCGGATGG CGGCCATCTC GCAGGTCGAC ATCCGGCTGG CCGGGTCGCA CGCCGGGGTG
GAGATCGGTG CGGACGGCCC GTCCCAGATG GCGCTCGAGG ATCTCGCGAT GATGCGGGCC
GTGGGCGGCT CGACGGTGCT CTACCCGAGC GACGCGACCT CCGCGGCCAA GCTGATCGAG
GCGATGGTCG GGATCTCCGG CGTCAGCTAC ATGCGGACGA CCCGCGGTGC CTACCCGGTG
CTCTACGGCC CGGACGAGAC GTTCGCGCCG GGCGGTTCCA AGGTGCTGCG CAGCTCCGAC
GCCGACACCG TGACGCTGAT CGGCGCCGGG GTCACCCTGC ACCACTGCCT GGCCGCGGCC
GACGTGCTCG CGGCCGAGGG AGTCTCGGCA CGGGTGATCG ACCTGTACTC GGTCAAGCCG
GTGGACACCG CGACCCTGGC GGAGGCGGCG CGGGCGACCG GTGGCCGCCT CGTCGTCGCG
GAGGACCACT ACGCCGAGGG CGGCCTCGGC GGCGCGGTGC TCGAGGGCCT CGCGGGCGTC
CTGCGCGACG GCAGCGTCTC GCTGCGGCTG GCGCATCTTG CCGTCGGTGA GCTTCCCGGT
TCCGGTACCC CGGACGAGCT GCTGGACTTC GCCGGCATCT CCACGCCGCA CATCGTCGAG
GCGGCGCGCG GCCTGCTGCG CTGA
 
Protein sequence
MATPVSEVVG VADLETVREL AQQLRVDSVR SSTSAGSGHP TSSMSAADVM AVLLTRHLRY 
DWDTPDDPAN DHLIFSKGHA SPLIYSMFKA VGVVTDDELM TGYRRFGHRL QGHPTPVLPW
VDVATGSLGQ GLPDAVGVAL AGKFLDKVPY RVWVVCGDSE MAEGSMWEAL DKAAFYGLSN
LIAIVDVNRL GQRGPTEFGW DMDAYARRVE AFGARAVVID GHDLGAIDAA LADAEDATRP
TVILARTRKG EGFSEVADAE GWHGKPLPED MATRAIIELG GERNLRVRGP VPPADMARPA
PAQKPAIKLP TYELGAKVAT RRAYGQALAA VGAHTDVVAL DAEVSNSTYA EDFAKAHPDR
FFEIFIAEQQ LVAAAVGLSV RGYVPFASTF AAFFSRAYDF IRMAAISQVD IRLAGSHAGV
EIGADGPSQM ALEDLAMMRA VGGSTVLYPS DATSAAKLIE AMVGISGVSY MRTTRGAYPV
LYGPDETFAP GGSKVLRSSD ADTVTLIGAG VTLHHCLAAA DVLAAEGVSA RVIDLYSVKP
VDTATLAEAA RATGGRLVVA EDHYAEGGLG GAVLEGLAGV LRDGSVSLRL AHLAVGELPG
SGTPDELLDF AGISTPHIVE AARGLLR