Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Francci3_1278 |
Symbol | |
ID | 3905083 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. CcI3 |
Kingdom | Bacteria |
Replicon accession | NC_007777 |
Strand | + |
Start bp | 1527621 |
End bp | 1529495 |
Gene Length | 1875 bp |
Protein Length | 624 aa |
Translation table | 11 |
GC content | 71% |
IMG OID | 637878612 |
Product | transketolase |
Protein accession | YP_480385 |
Protein GI | 86739985 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG0021] Transketolase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 13 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGGCAACTC CCGTATCCGA GAGCACGGTC GGGACGGCCG ACCTGGAGAC GGTTCGGGAA CTCGCGCAGC AACTGCGGGT CGACTCGGTG CGCAGCAGCA CCTCGGCGGG TTCCGGGCAT CCCACCTCGT CGATGTCGGC CGCGGACGTG CTGGCGGTTC TCGTCGCGCG CCACCTTCGG TACGACTGGG ACAACCCGGC GGACCCGGCG AACGATCATC TGATCTTTTC GAAGGGGCAC GCATCCCCGC TGATCTACTC CATCTTCAAG GCGGTCGGGG TCGTCGGTGA CGACGAGTTG ATCAATGGTT ACCGCCGTTT CGGTGAGCGG CTTCAGGGCC ACCCGACGCC TGTCCTGCCG TGGGTCGACG TCGCGACCGG GTCGCTCGGG CAGGGCCTGC CCGACGGCGT GGGCGTCGCG CTCGCCGGTA AGTACCTGGA CAAGGTGCCC TACCGGGTCT GGGTGATCTG CGGTGACAGC GAGATGGCGG AGGGCTCGGT CTGGGAGGCA CTGGACAAGG CCTCCTACTA CAACCTCTCC AACCTCATCG CGATCGTCGA CGTCAACCGG CTCGGCCAGC GCGGCCCGAC CGAGCTCGGC TGGGACCTCG ACACCTACGC CAGGCGGGTG GAGTCCTTCG GCGCCCGCGC GGTCGTCGTT GACGGCCACG ACATCGCCGC GATCGACGCG GTGCTCGCCG ACGCCGAGGA CGTCACCCGG CCGACCGTCA TCCTCGCCCG CACCCGCAAG GGCGAGGGCT TCTCCGAGAC CGCGGACGTC GAGGGCTGGC ACGGCAAGCC GTTCCCCGCC GACATGGCCG ACCGCGCCAT CGCCGAGCTC GGCGGCCTGC GCGACCTGCG GGTTCGCGGT CCGGTGCCGC CCGCGGACCT CCCCCGACCG GCGCCGGTCG AGCGGCCCGT CATCACACTG CCCACCTACG ATCTCGGCGC CAAGGTGGCG ACTCGGCGCG CCTACGGCCA GGCGCTCGCC GCGCTCGGAG CCCGCACCGA CGTGGTGGCG CTCGACGCGG AGGTCAGCAA CTCGACCTAC TCCGAGGACT TCGCCAAGAT CTACCCGGAC CGCTTCTTCG AGATCTTCAT CGCCGAGCAG CAGCTCGTCG CGGCGGCGGT CGGGCTGTCG GTGCGTGGCT ACGTGCCCTT CGCCTCCACG TTCGCGGCGT TCTTCTCCCG GGCCTACGAC TTCATCCGGA TGGCGGCGAT CTCCCAGGCC GACATCCGGC TGTCCGGCAG CCACGCGGGA GTCGAGATCG GCGCCGACGG GCCGTCGCAG ATGGCGCTCG AGGACATCGC CATGATGCGG GCGGTCGGCG GCTCGACGGT GCTCTACCCG AGCGATGCGA CGTCCGCCGC GAAGCTCGTC GAGGCGATGG CGGATCTGCC CGGGGTCAGC TACCTGCGCA CCACCCGTGG AGCCTACCCC GTGCTCTATG GCCCGGACGA GCCGTTTGCC CCCGGTGGTT CCAAGGTGCT CCGCCGATCG AATGCCGACA CCGTCACGCT CATCGGTGCC GGTGTCACCC TGCACGAGTC GCTGGCCGCG GCGGACACCC TCGTGGCCGA GGGGATCGCG GCCCGCGTGA TCGACCTGTA TTCCGTGAAG CCGGTCGACG CGGCGACGCT CGCCGACGCC GCCCAGGCCA CCGGCGGGCG CATCGTCGTC ACCGAGGATC ACTACCCGGA GGGTGGACTG GCCAGCGCGG TGCTGGAAGC GCTCAGCGAC GGCGGCGTGC CGCTGTCGGT CCGTCATCTC GCGGTGCGGT CGCTGCCGGG TTCCGGGACG TCTCAGGAGC TCCTGGACGC GGCCGGCATC TCCGCGAGGC ACATCGTCGA GGCCGCCCGC GACCTGCTGC GCTGA
|
Protein sequence | MATPVSESTV GTADLETVRE LAQQLRVDSV RSSTSAGSGH PTSSMSAADV LAVLVARHLR YDWDNPADPA NDHLIFSKGH ASPLIYSIFK AVGVVGDDEL INGYRRFGER LQGHPTPVLP WVDVATGSLG QGLPDGVGVA LAGKYLDKVP YRVWVICGDS EMAEGSVWEA LDKASYYNLS NLIAIVDVNR LGQRGPTELG WDLDTYARRV ESFGARAVVV DGHDIAAIDA VLADAEDVTR PTVILARTRK GEGFSETADV EGWHGKPFPA DMADRAIAEL GGLRDLRVRG PVPPADLPRP APVERPVITL PTYDLGAKVA TRRAYGQALA ALGARTDVVA LDAEVSNSTY SEDFAKIYPD RFFEIFIAEQ QLVAAAVGLS VRGYVPFAST FAAFFSRAYD FIRMAAISQA DIRLSGSHAG VEIGADGPSQ MALEDIAMMR AVGGSTVLYP SDATSAAKLV EAMADLPGVS YLRTTRGAYP VLYGPDEPFA PGGSKVLRRS NADTVTLIGA GVTLHESLAA ADTLVAEGIA ARVIDLYSVK PVDAATLADA AQATGGRIVV TEDHYPEGGL ASAVLEALSD GGVPLSVRHL AVRSLPGSGT SQELLDAAGI SARHIVEAAR DLLR
|
| |