Gene Francci3_4347 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFrancci3_4347 
Symbol 
ID3907319 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. CcI3 
KingdomBacteria 
Replicon accessionNC_007777 
Strand
Start bp5188732 
End bp5191146 
Gene Length2415 bp 
Protein Length804 aa 
Translation table11 
GC content69% 
IMG OID637881678 
Productglycoside hydrolase family protein 
Protein accessionYP_483422 
Protein GI86743022 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1554] Trehalose and maltose hydrolases (possible phosphorylases) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.138652 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATCGACA AAGCGTCCTA TCTGATCGAG TCGTGGTCCA TCCAGGAGTC CGGTCTCGAC 
ATCTCCGATC TGGGCCGCTC CGAGTCATTG TTCGCCCTGT CCAACGGCCA TATCGGGCTC
CGGGGAAACC TCGACGAGGG CGATCCGCAC GGCCTGCCGG GCACCTACCT GAACTCGGTC
CACGAGCTGC GGCCCCTGCC CTATGCCGAG GCCGGCTACG GATATCCAGA GTCCGGCCAG
ACGGTCATCA ACGTCACCAA CGGCAAGGTG ATCCGTCTAC TTGTCGACGA CGAGCCGTTT
GACATCCGCT ATGGCGATCT GCGGTCGCAT CAGCGGGCGA TCGACTTCCG GGAGGGCGTG
CTCCGGCGGG ACGTGGAATG GGTCTCGCCG GCGGGGCAGA CTATCCGGGT GCACTCCGAG
CGGCTGGTGT CGTTCTCGCA GCGTTCCATC GCGGCCGTGT ACTACGAGAT CGAGCCGGTG
GGCGACCCGG CGCGGGTCGT CATCCAGTCG GAGCTGGTGG CCAACGAGCA GCTGCCGGCC
TGGCACGGTG ACCCGCGGGC CGCGTCCGCG CTGGAGTCGC CGCTGCGATC CGAGCGCCAT
CGGGCGAGCG GCACGATGGT TGAGCTCGTC CACATCACCC ACCGCAGCGA GATCCGTGTC
GCCGCGGCCA TGGACCACGA GTTTGCCGGG CCGGACTCGC TGGGTGTCTC CTCGGAGAGC
GAGCCGGACG CCGGCCGGGT GACGGCCACG GCGGTACTCG CCCCGGGCGA GAAGCTGCGG
ATGGTCAAGT TCATCGCCTA CGGCTGGTCG GAGCAACGGT CCCTGCCGGC CCTCAGAGAC
CAAGCGGCCG CGGCGCTGGT CGCAGCGCGG CAGACCGGGT GGCCCGGTCT CGTCGCCGAG
CAGCGGCAGT ACCTGGATGC CTTCTGGAAG CGGGCCGACG TCGAGGTCGA CGGCGATCCG
GAGGTGCAGC AGGCCGTTCG CTTCGCGCTG TTCCACGTCC TGCAGGCCGG ATCGCGGGCC
GAGCGGCGGG CGATCCCGGC CAAGGGCCTG ACCGGCCCCG GCTACGACGG GCACGCCTTC
TGGGACTCCG AGTCGTACGT GCTCCCCGTC CTGACGTACA CCGCGCCCGA CGCGGCCGCC
GACGCGCTGC GCTGGCGGTA CGCGACTCTG CCGCTGGCCA GGGAACGAGC CGATCTGCTC
AATCTGAAGG GCGCCGTCTT CCCCTGGCGC ACCATCCACG GAGAGGAGTG CTCCGGATAC
TGGCCAGCCG GGACGGCCGC CTTCCACGTC AACGCCGACA TCGCCGACGC GGTCGCCCGC
TACGTCACCA TCACCGGGGA CGAGCGGTTC GAGCGCACGG TGGGGCTGGA GATCCTGATC
GAGACCGCCC GGCTGTGGCG GTCGCTGGGA CATCACGACC TCGACGGCCG CTTCCGGATC
GACGGTGTCA CGGGCCCGGA CGAGTACTCG GCCATCGCCG ACAACAACGT CTACACGAAC
CTGATGGCGC AGCGGAACCT CGTGGCCGCG GCGGACGCCG CCCGCCGCCA TCCCGACCGG
GCCGCCGAGC TGGGCGTGGA CGCCGAGGTC ACGGCGTCCT GGCGGGACGC GGCCGAGAAC
ATGTTCATCC CCTACGACCC CCATCTCGGC GTCCATCCGC AGTCCGAGGG ATTCACCGAA
CACCAGGTCT GGGACTTCGC GAACACCGCG CCCGAGCAGT ACCCTCTGCT GCTGCACTTC
CCGTACTTCG ACCTCTACCG CAAGCAGGTC ATCAAGCAGG CGGACCTGGT CCTGGCGATG
CAGCGCCGGG GTGACGCCTT CACGATGGAC GAGAAGATCC GCAACTTCGC CTACTACGAA
GCTCTGACGG TTCGGGACTC GTCACTGTCG GCATGCTCCC AGGCGGTGCT GGCCGCTGAG
TGTGGGCACC TGTCGCTCGC CCATGACTAC CTGCGCGAAG CCGCCCTGAT GGATCTGCAT
GACATCGAGC ACAACACCGG AGATGGGCTG CACATGGCCT CCCTCGCGGG AAGCTGGATC
GCCCTCGTCG AGGGCTTCGG TGGGCTGCGT GACGGCGGTG AGGTGATCTC GTTCGCGCCG
CGGTTGCCCG AGGGACTGAC CAGGCTGGCC TTCGGGCTCT GTGTCCGCGG GCGGCACCTG
CGGGTGGAGG TCTCCGACTC GACGGCCGTC TACACCGTCC CCGCGGGGCC GGCGATGACG
TTGCTGCACC ACGGCAAGAC CGTCCGGGTG GTTCCCGGCC AGCCGGTCAG CATGGAGGTC
CCGCCGACGC CCGTCCTGGA ACGGCCGAAG CAGCCCTCGG GCCGGGAGCC GCTGCGCTTC
CTGCGCCTGC CCGGGACCGG CGGCCGTATC GGTCCTACCG GCCCTAGCGG TTCCGTCAAC
AGCATCGACG GCTGA
 
Protein sequence
MIDKASYLIE SWSIQESGLD ISDLGRSESL FALSNGHIGL RGNLDEGDPH GLPGTYLNSV 
HELRPLPYAE AGYGYPESGQ TVINVTNGKV IRLLVDDEPF DIRYGDLRSH QRAIDFREGV
LRRDVEWVSP AGQTIRVHSE RLVSFSQRSI AAVYYEIEPV GDPARVVIQS ELVANEQLPA
WHGDPRAASA LESPLRSERH RASGTMVELV HITHRSEIRV AAAMDHEFAG PDSLGVSSES
EPDAGRVTAT AVLAPGEKLR MVKFIAYGWS EQRSLPALRD QAAAALVAAR QTGWPGLVAE
QRQYLDAFWK RADVEVDGDP EVQQAVRFAL FHVLQAGSRA ERRAIPAKGL TGPGYDGHAF
WDSESYVLPV LTYTAPDAAA DALRWRYATL PLARERADLL NLKGAVFPWR TIHGEECSGY
WPAGTAAFHV NADIADAVAR YVTITGDERF ERTVGLEILI ETARLWRSLG HHDLDGRFRI
DGVTGPDEYS AIADNNVYTN LMAQRNLVAA ADAARRHPDR AAELGVDAEV TASWRDAAEN
MFIPYDPHLG VHPQSEGFTE HQVWDFANTA PEQYPLLLHF PYFDLYRKQV IKQADLVLAM
QRRGDAFTMD EKIRNFAYYE ALTVRDSSLS ACSQAVLAAE CGHLSLAHDY LREAALMDLH
DIEHNTGDGL HMASLAGSWI ALVEGFGGLR DGGEVISFAP RLPEGLTRLA FGLCVRGRHL
RVEVSDSTAV YTVPAGPAMT LLHHGKTVRV VPGQPVSMEV PPTPVLERPK QPSGREPLRF
LRLPGTGGRI GPTGPSGSVN SIDG