Gene Francci3_1278 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFrancci3_1278 
Symbol 
ID3905083 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. CcI3 
KingdomBacteria 
Replicon accessionNC_007777 
Strand
Start bp1527621 
End bp1529495 
Gene Length1875 bp 
Protein Length624 aa 
Translation table11 
GC content71% 
IMG OID637878612 
Producttransketolase 
Protein accessionYP_480385 
Protein GI86739985 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG0021] Transketolase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGGCAACTC CCGTATCCGA GAGCACGGTC GGGACGGCCG ACCTGGAGAC GGTTCGGGAA 
CTCGCGCAGC AACTGCGGGT CGACTCGGTG CGCAGCAGCA CCTCGGCGGG TTCCGGGCAT
CCCACCTCGT CGATGTCGGC CGCGGACGTG CTGGCGGTTC TCGTCGCGCG CCACCTTCGG
TACGACTGGG ACAACCCGGC GGACCCGGCG AACGATCATC TGATCTTTTC GAAGGGGCAC
GCATCCCCGC TGATCTACTC CATCTTCAAG GCGGTCGGGG TCGTCGGTGA CGACGAGTTG
ATCAATGGTT ACCGCCGTTT CGGTGAGCGG CTTCAGGGCC ACCCGACGCC TGTCCTGCCG
TGGGTCGACG TCGCGACCGG GTCGCTCGGG CAGGGCCTGC CCGACGGCGT GGGCGTCGCG
CTCGCCGGTA AGTACCTGGA CAAGGTGCCC TACCGGGTCT GGGTGATCTG CGGTGACAGC
GAGATGGCGG AGGGCTCGGT CTGGGAGGCA CTGGACAAGG CCTCCTACTA CAACCTCTCC
AACCTCATCG CGATCGTCGA CGTCAACCGG CTCGGCCAGC GCGGCCCGAC CGAGCTCGGC
TGGGACCTCG ACACCTACGC CAGGCGGGTG GAGTCCTTCG GCGCCCGCGC GGTCGTCGTT
GACGGCCACG ACATCGCCGC GATCGACGCG GTGCTCGCCG ACGCCGAGGA CGTCACCCGG
CCGACCGTCA TCCTCGCCCG CACCCGCAAG GGCGAGGGCT TCTCCGAGAC CGCGGACGTC
GAGGGCTGGC ACGGCAAGCC GTTCCCCGCC GACATGGCCG ACCGCGCCAT CGCCGAGCTC
GGCGGCCTGC GCGACCTGCG GGTTCGCGGT CCGGTGCCGC CCGCGGACCT CCCCCGACCG
GCGCCGGTCG AGCGGCCCGT CATCACACTG CCCACCTACG ATCTCGGCGC CAAGGTGGCG
ACTCGGCGCG CCTACGGCCA GGCGCTCGCC GCGCTCGGAG CCCGCACCGA CGTGGTGGCG
CTCGACGCGG AGGTCAGCAA CTCGACCTAC TCCGAGGACT TCGCCAAGAT CTACCCGGAC
CGCTTCTTCG AGATCTTCAT CGCCGAGCAG CAGCTCGTCG CGGCGGCGGT CGGGCTGTCG
GTGCGTGGCT ACGTGCCCTT CGCCTCCACG TTCGCGGCGT TCTTCTCCCG GGCCTACGAC
TTCATCCGGA TGGCGGCGAT CTCCCAGGCC GACATCCGGC TGTCCGGCAG CCACGCGGGA
GTCGAGATCG GCGCCGACGG GCCGTCGCAG ATGGCGCTCG AGGACATCGC CATGATGCGG
GCGGTCGGCG GCTCGACGGT GCTCTACCCG AGCGATGCGA CGTCCGCCGC GAAGCTCGTC
GAGGCGATGG CGGATCTGCC CGGGGTCAGC TACCTGCGCA CCACCCGTGG AGCCTACCCC
GTGCTCTATG GCCCGGACGA GCCGTTTGCC CCCGGTGGTT CCAAGGTGCT CCGCCGATCG
AATGCCGACA CCGTCACGCT CATCGGTGCC GGTGTCACCC TGCACGAGTC GCTGGCCGCG
GCGGACACCC TCGTGGCCGA GGGGATCGCG GCCCGCGTGA TCGACCTGTA TTCCGTGAAG
CCGGTCGACG CGGCGACGCT CGCCGACGCC GCCCAGGCCA CCGGCGGGCG CATCGTCGTC
ACCGAGGATC ACTACCCGGA GGGTGGACTG GCCAGCGCGG TGCTGGAAGC GCTCAGCGAC
GGCGGCGTGC CGCTGTCGGT CCGTCATCTC GCGGTGCGGT CGCTGCCGGG TTCCGGGACG
TCTCAGGAGC TCCTGGACGC GGCCGGCATC TCCGCGAGGC ACATCGTCGA GGCCGCCCGC
GACCTGCTGC GCTGA
 
Protein sequence
MATPVSESTV GTADLETVRE LAQQLRVDSV RSSTSAGSGH PTSSMSAADV LAVLVARHLR 
YDWDNPADPA NDHLIFSKGH ASPLIYSIFK AVGVVGDDEL INGYRRFGER LQGHPTPVLP
WVDVATGSLG QGLPDGVGVA LAGKYLDKVP YRVWVICGDS EMAEGSVWEA LDKASYYNLS
NLIAIVDVNR LGQRGPTELG WDLDTYARRV ESFGARAVVV DGHDIAAIDA VLADAEDVTR
PTVILARTRK GEGFSETADV EGWHGKPFPA DMADRAIAEL GGLRDLRVRG PVPPADLPRP
APVERPVITL PTYDLGAKVA TRRAYGQALA ALGARTDVVA LDAEVSNSTY SEDFAKIYPD
RFFEIFIAEQ QLVAAAVGLS VRGYVPFAST FAAFFSRAYD FIRMAAISQA DIRLSGSHAG
VEIGADGPSQ MALEDIAMMR AVGGSTVLYP SDATSAAKLV EAMADLPGVS YLRTTRGAYP
VLYGPDEPFA PGGSKVLRRS NADTVTLIGA GVTLHESLAA ADTLVAEGIA ARVIDLYSVK
PVDAATLADA AQATGGRIVV TEDHYPEGGL ASAVLEALSD GGVPLSVRHL AVRSLPGSGT
SQELLDAAGI SARHIVEAAR DLLR