Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Francci3_4347 |
Symbol | |
ID | 3907319 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. CcI3 |
Kingdom | Bacteria |
Replicon accession | NC_007777 |
Strand | - |
Start bp | 5188732 |
End bp | 5191146 |
Gene Length | 2415 bp |
Protein Length | 804 aa |
Translation table | 11 |
GC content | 69% |
IMG OID | 637881678 |
Product | glycoside hydrolase family protein |
Protein accession | YP_483422 |
Protein GI | 86743022 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG1554] Trehalose and maltose hydrolases (possible phosphorylases) |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 0.138652 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 16 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGATCGACA AAGCGTCCTA TCTGATCGAG TCGTGGTCCA TCCAGGAGTC CGGTCTCGAC ATCTCCGATC TGGGCCGCTC CGAGTCATTG TTCGCCCTGT CCAACGGCCA TATCGGGCTC CGGGGAAACC TCGACGAGGG CGATCCGCAC GGCCTGCCGG GCACCTACCT GAACTCGGTC CACGAGCTGC GGCCCCTGCC CTATGCCGAG GCCGGCTACG GATATCCAGA GTCCGGCCAG ACGGTCATCA ACGTCACCAA CGGCAAGGTG ATCCGTCTAC TTGTCGACGA CGAGCCGTTT GACATCCGCT ATGGCGATCT GCGGTCGCAT CAGCGGGCGA TCGACTTCCG GGAGGGCGTG CTCCGGCGGG ACGTGGAATG GGTCTCGCCG GCGGGGCAGA CTATCCGGGT GCACTCCGAG CGGCTGGTGT CGTTCTCGCA GCGTTCCATC GCGGCCGTGT ACTACGAGAT CGAGCCGGTG GGCGACCCGG CGCGGGTCGT CATCCAGTCG GAGCTGGTGG CCAACGAGCA GCTGCCGGCC TGGCACGGTG ACCCGCGGGC CGCGTCCGCG CTGGAGTCGC CGCTGCGATC CGAGCGCCAT CGGGCGAGCG GCACGATGGT TGAGCTCGTC CACATCACCC ACCGCAGCGA GATCCGTGTC GCCGCGGCCA TGGACCACGA GTTTGCCGGG CCGGACTCGC TGGGTGTCTC CTCGGAGAGC GAGCCGGACG CCGGCCGGGT GACGGCCACG GCGGTACTCG CCCCGGGCGA GAAGCTGCGG ATGGTCAAGT TCATCGCCTA CGGCTGGTCG GAGCAACGGT CCCTGCCGGC CCTCAGAGAC CAAGCGGCCG CGGCGCTGGT CGCAGCGCGG CAGACCGGGT GGCCCGGTCT CGTCGCCGAG CAGCGGCAGT ACCTGGATGC CTTCTGGAAG CGGGCCGACG TCGAGGTCGA CGGCGATCCG GAGGTGCAGC AGGCCGTTCG CTTCGCGCTG TTCCACGTCC TGCAGGCCGG ATCGCGGGCC GAGCGGCGGG CGATCCCGGC CAAGGGCCTG ACCGGCCCCG GCTACGACGG GCACGCCTTC TGGGACTCCG AGTCGTACGT GCTCCCCGTC CTGACGTACA CCGCGCCCGA CGCGGCCGCC GACGCGCTGC GCTGGCGGTA CGCGACTCTG CCGCTGGCCA GGGAACGAGC CGATCTGCTC AATCTGAAGG GCGCCGTCTT CCCCTGGCGC ACCATCCACG GAGAGGAGTG CTCCGGATAC TGGCCAGCCG GGACGGCCGC CTTCCACGTC AACGCCGACA TCGCCGACGC GGTCGCCCGC TACGTCACCA TCACCGGGGA CGAGCGGTTC GAGCGCACGG TGGGGCTGGA GATCCTGATC GAGACCGCCC GGCTGTGGCG GTCGCTGGGA CATCACGACC TCGACGGCCG CTTCCGGATC GACGGTGTCA CGGGCCCGGA CGAGTACTCG GCCATCGCCG ACAACAACGT CTACACGAAC CTGATGGCGC AGCGGAACCT CGTGGCCGCG GCGGACGCCG CCCGCCGCCA TCCCGACCGG GCCGCCGAGC TGGGCGTGGA CGCCGAGGTC ACGGCGTCCT GGCGGGACGC GGCCGAGAAC ATGTTCATCC CCTACGACCC CCATCTCGGC GTCCATCCGC AGTCCGAGGG ATTCACCGAA CACCAGGTCT GGGACTTCGC GAACACCGCG CCCGAGCAGT ACCCTCTGCT GCTGCACTTC CCGTACTTCG ACCTCTACCG CAAGCAGGTC ATCAAGCAGG CGGACCTGGT CCTGGCGATG CAGCGCCGGG GTGACGCCTT CACGATGGAC GAGAAGATCC GCAACTTCGC CTACTACGAA GCTCTGACGG TTCGGGACTC GTCACTGTCG GCATGCTCCC AGGCGGTGCT GGCCGCTGAG TGTGGGCACC TGTCGCTCGC CCATGACTAC CTGCGCGAAG CCGCCCTGAT GGATCTGCAT GACATCGAGC ACAACACCGG AGATGGGCTG CACATGGCCT CCCTCGCGGG AAGCTGGATC GCCCTCGTCG AGGGCTTCGG TGGGCTGCGT GACGGCGGTG AGGTGATCTC GTTCGCGCCG CGGTTGCCCG AGGGACTGAC CAGGCTGGCC TTCGGGCTCT GTGTCCGCGG GCGGCACCTG CGGGTGGAGG TCTCCGACTC GACGGCCGTC TACACCGTCC CCGCGGGGCC GGCGATGACG TTGCTGCACC ACGGCAAGAC CGTCCGGGTG GTTCCCGGCC AGCCGGTCAG CATGGAGGTC CCGCCGACGC CCGTCCTGGA ACGGCCGAAG CAGCCCTCGG GCCGGGAGCC GCTGCGCTTC CTGCGCCTGC CCGGGACCGG CGGCCGTATC GGTCCTACCG GCCCTAGCGG TTCCGTCAAC AGCATCGACG GCTGA
|
Protein sequence | MIDKASYLIE SWSIQESGLD ISDLGRSESL FALSNGHIGL RGNLDEGDPH GLPGTYLNSV HELRPLPYAE AGYGYPESGQ TVINVTNGKV IRLLVDDEPF DIRYGDLRSH QRAIDFREGV LRRDVEWVSP AGQTIRVHSE RLVSFSQRSI AAVYYEIEPV GDPARVVIQS ELVANEQLPA WHGDPRAASA LESPLRSERH RASGTMVELV HITHRSEIRV AAAMDHEFAG PDSLGVSSES EPDAGRVTAT AVLAPGEKLR MVKFIAYGWS EQRSLPALRD QAAAALVAAR QTGWPGLVAE QRQYLDAFWK RADVEVDGDP EVQQAVRFAL FHVLQAGSRA ERRAIPAKGL TGPGYDGHAF WDSESYVLPV LTYTAPDAAA DALRWRYATL PLARERADLL NLKGAVFPWR TIHGEECSGY WPAGTAAFHV NADIADAVAR YVTITGDERF ERTVGLEILI ETARLWRSLG HHDLDGRFRI DGVTGPDEYS AIADNNVYTN LMAQRNLVAA ADAARRHPDR AAELGVDAEV TASWRDAAEN MFIPYDPHLG VHPQSEGFTE HQVWDFANTA PEQYPLLLHF PYFDLYRKQV IKQADLVLAM QRRGDAFTMD EKIRNFAYYE ALTVRDSSLS ACSQAVLAAE CGHLSLAHDY LREAALMDLH DIEHNTGDGL HMASLAGSWI ALVEGFGGLR DGGEVISFAP RLPEGLTRLA FGLCVRGRHL RVEVSDSTAV YTVPAGPAMT LLHHGKTVRV VPGQPVSMEV PPTPVLERPK QPSGREPLRF LRLPGTGGRI GPTGPSGSVN SIDG
|
| |