Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_4431 |
Symbol | |
ID | 5672783 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | - |
Start bp | 5292752 |
End bp | 5295556 |
Gene Length | 2805 bp |
Protein Length | 934 aa |
Translation table | 11 |
GC content | 63% |
IMG OID | 641243300 |
Product | hypothetical protein |
Protein accession | YP_001508716 |
Protein GI | 158316208 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG4354] Predicted bile acid beta-glucosidase |
TIGRFAM ID | [TIGR01409] Tat (twin-arginine translocation) pathway signal sequence |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 5 |
Plasmid unclonability p-value | 0.402809 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 21 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGGTGAGA GAGAATTATC CGGCGCACAC CGTGGCGGTG GCGTCTCGCG CAGGACCTTC GTGCGCGGTA CAGCCGCGGC GGCCGGCTTG GCGGCGTTCG GCGGTGCGCT TGCTGCCTGC ACCGGAAGCG AACCCGCGGC TGCGGTGAAG CCGGCGGTTC ACCCTGTGCC CAAGGCGGCA TATGTCCGCA AGCTAGGCGC GGTGCCAGCG GGGTCCTGCA ACGCGCTCGG GGATCAGCAG TGCAGGACGG GAGTCCCGAA CCCCGTGTTC CGCAGTGTGG GGGCGGGCCT GTCCGTACCC GGCCTGGGCC TCCCGCTCGG GGGCGTCGGC GGCGGCTCTT TCATGCTCAA CCAGTGTGGG ACGTTCGGAC CGTGGAACAT GGGCGGGCAG CCGACCACGG AATTCTGGGA GATGCGCACC CTCGCGCAGG CGGCGTTCCA CGCACGTGAG GAGGTCGTCG GGGGCGGCGG GGGTGTGTCG GTCAGGACAC TGGCCGTACC GCACACCAAC ACCGCTCCTG ATCGCACTTT TGGCGACGTC CTGCCTGCCT GGAACACGCT GAAGCCCGGG GACGGCAGCT ACGCTGTACT CTTCCCGTTC GGGTACATGA CCTACAGCGG CTTCCAGTCA AAGGTCTCCA CCAAGATCTG GTCGCCGATC GTGGCCAACG AGGACGAGCG CACGTCGATG CCGGTGGCGT TCTTCGACAT GCTCATGAAC AATCCCACCG CCAAGCCGAT CAAGATTTCT GTCATGCTGA CGTTCCCGAA CGCCCCGGCG TTCGCCACGG GTTCGGTGCG GACTGGTCTT TACAGCAGGT TCGATCGCGA TTCCGCGTCG GGTATAGGCG GGGTGACCCT CGGCTCGGAC TCCCCGGAGA ACACGCCGGA CACTGTGACG TCCGAGTGGA CCATTGCCGC GCATCCATTC GCCGGGCAGA CACTCACCTA CTGCACCTCG TGGGACGGAT CGGGCGACGG GAGCGACATC TACGCCCCGT TCTCCGCGGC TGGCGCGGAC GGGAAGCTGC CGAACGGCGA CATCGACCAG TCGGCATCGG CCGGTGCGGT GGCCGTGGCG CTCACCCTGG AGCCCGACCA GACACAGACT GTTCGCTTCG CCCTTTCCTG GGACTTCCCG CAGGTCTATT ACGACGGCGA GGACGCGACG ACGAGGGCCG TCTGGATGCG TCGGTACACG GCGTTCCTCG GCGGAAAGAC ATCGCGGACC AACGACTACG TCCAGGATTC GTACCCCTTC AGGCAGGGTT TCACCATCGC CCGGAAGGAG CTGGCCCGGT ACGATGACTC TCTCGCCGCC GTCGAGTCGT GGTGGAAGCC GATCGCCGAG AATCCACAGG TTCCGCCGTG GCTACGCAAG GCTTCTCTGA ACGAGCTGTA TCACATGATC TTCAACGGTT CGTTCTGGGA GTCCGGGCTC GTCAGCAACA CGATGCCGAT GAGTGTCGAA GAGGGAACCT CGCCTCGTCT GGGATCCGCG ATCCCGGAAA CCCACATCTA TTTCCACGCC GATGGCGGGG ACGGTGGAGC GCAGACGAAC GAAGTCGACA TGGACAGCTC CGGCTACCTC GTTTTCGCGA AGCTGTTCCG CAGCTTGGAG CTGGGTCGTG TTCGCCCGCT GCTTCAGATG GTCAGGCAGA ATCCGCTGGG AATCGGGCGC GTGATTCAGC AGACCTTCAG AAGTTCGGGA CCCTACATCA CCCAGACGGC GTCATTCCAG AATCTCCCGT TCTCCAAGCC GCCCACAGCG GGAAACCCTC CCGCTCCGCC CACCAGAGAT CTCGGTGATC TGTTCGCGGA CGAAGCCGGA GATCCCTTTC GTGACTGCCC GCACAAGCTC ATCTACCGAA CGTACGCGCT GATCAAGTTC TACGACGACG ACGATCTGCT GGAATACGGA TACGCGCCGA TGCTGAAGGC GCTGACATAC TCGCAGTTCT TCCGTCCGAC CGGCTCCCAC CTGCCGGCAG ACCCGGCATC CAACAACCCG CCGAACACTA TGGATCAGGC TGTCGTGAAC GGTCACGGAA TCTACAACTG CGGGCTGTAT CTGCTGTCGC TTCAAATCCT CTCGACGCTG ACGCCCCAGG CTGCCCGACT CGGTGTTGAC GAGGCCACAC CTGAGATACA GAAGGAACTC GACGAGGAAC TGGCGGCAGC GAAGGAGGAA TTCGAGAGGA TCTTCTGGAA CCCGGCCACC GGTCGATACC GCTACTGCGA CGGCACCGGC GGGATCGGAG ATCGTACCGG TACTATCAGG GGTCGTTTCA AGCCGGTGCC GCCGCCGGAC GCCATCTGGC TCGAGTCCTT CGCCGGTCAG CTCGTCGCGA TGGAGCTCGG CCTGCCTGAC GTCGTCGATC TGGACCATGC CCGTACTCAC CTGAAGAACA CTCTGGATTC ATTCGTCCGG TTCAGGGATC CCGAAGGGAA CCTGATGGGT GGCCCGATTA TCCTCAAGCC GGACTTCAGT ATCTACCCTA GTTCGCTGAG GACCACAGAA ATCAATGAAG TGATTCCGGG TATCGCCTTC CTGGCCGCCG CGGGAGCATT CCGAATCGGC GCCAAGGTCA AGGACAAGGA CATCACGGAA AAGGCGTTGA AGCTCGGAGA GGGGTGTGCG CTCCAGATCT ACGACATCGA GAGCAACGGT TACGCCTTCG CAACCCCCGA GAGCTGGTTC GTGGACGACC ACCATATCTC CAGGTTTCCT GGATACACGC GAACCCGCTC TGTCTGGTCG CTCTACGACG CGGTCAGCGA AATCTCGGTG AAGAAACCGT CCTGA
|
Protein sequence | MGERELSGAH RGGGVSRRTF VRGTAAAAGL AAFGGALAAC TGSEPAAAVK PAVHPVPKAA YVRKLGAVPA GSCNALGDQQ CRTGVPNPVF RSVGAGLSVP GLGLPLGGVG GGSFMLNQCG TFGPWNMGGQ PTTEFWEMRT LAQAAFHARE EVVGGGGGVS VRTLAVPHTN TAPDRTFGDV LPAWNTLKPG DGSYAVLFPF GYMTYSGFQS KVSTKIWSPI VANEDERTSM PVAFFDMLMN NPTAKPIKIS VMLTFPNAPA FATGSVRTGL YSRFDRDSAS GIGGVTLGSD SPENTPDTVT SEWTIAAHPF AGQTLTYCTS WDGSGDGSDI YAPFSAAGAD GKLPNGDIDQ SASAGAVAVA LTLEPDQTQT VRFALSWDFP QVYYDGEDAT TRAVWMRRYT AFLGGKTSRT NDYVQDSYPF RQGFTIARKE LARYDDSLAA VESWWKPIAE NPQVPPWLRK ASLNELYHMI FNGSFWESGL VSNTMPMSVE EGTSPRLGSA IPETHIYFHA DGGDGGAQTN EVDMDSSGYL VFAKLFRSLE LGRVRPLLQM VRQNPLGIGR VIQQTFRSSG PYITQTASFQ NLPFSKPPTA GNPPAPPTRD LGDLFADEAG DPFRDCPHKL IYRTYALIKF YDDDDLLEYG YAPMLKALTY SQFFRPTGSH LPADPASNNP PNTMDQAVVN GHGIYNCGLY LLSLQILSTL TPQAARLGVD EATPEIQKEL DEELAAAKEE FERIFWNPAT GRYRYCDGTG GIGDRTGTIR GRFKPVPPPD AIWLESFAGQ LVAMELGLPD VVDLDHARTH LKNTLDSFVR FRDPEGNLMG GPIILKPDFS IYPSSLRTTE INEVIPGIAF LAAAGAFRIG AKVKDKDITE KALKLGEGCA LQIYDIESNG YAFATPESWF VDDHHISRFP GYTRTRSVWS LYDAVSEISV KKPS
|
| |