Gene Franean1_0241 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_0241 
Symbol 
ID5668666 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp293991 
End bp296393 
Gene Length2403 bp 
Protein Length800 aa 
Translation table11 
GC content71% 
IMG OID641239170 
ProductKojibiose phosphorylase 
Protein accessionYP_001504614 
Protein GI158312106 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1554] Trehalose and maltose hydrolases (possible phosphorylases) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value0.532549 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGATCGTCA GGCCCTCCTA CCCCATCGAG TCCTGGTCGC TGACCGAGCA CGGGCTCGAC 
ATCGACGACC TGGCCCGCTC CGAGTCGCTG TTCTCGCTGT CCAACGGGCA CGTGGGCATG
CGCGGGAACC TCGACGAGGG CGATCCGCAC GGGCTGCCCG GTACCTACCT GAACTCCGTC
CACGAGCTGC GGCCGCTTCC GTACGCCGAA GCGGGCTACG GGTACCCGGA GTCCGGGCAG
ACGGTCATCA ACGTCACGAA CGGCAAGATC GTCCGCCTGC TGGTCGACGA CGAGCCGTTC
GACGTCCGCT ACGGGGATCT TCTCGCGCAC ACCCGCACGA TCGACTTCCG CGAGGGGGTG
CTGCGCCGGG AGGCCGACTG GGTCTCCCCG GCCGGGCAGC GGGTCAGGAT CCGCACCCAG
CGTCTCATCT CCTTCTCCCA GCGCTCCGCC GCCGCGATCC ACTACGAGAT CGAACCGGTG
GGCGACACCG CGCGGATCGT CATCCAGTCC GAGCTGGTCG CCAACGAGCA GCTTCCGGGG
CGCAGGGGCG ACCCGCGCGC CGCCGCCGTC CTGGAGTCGC CGCTCATCTC CGAACGGCAC
CGCGCCCGCG AGACCATGGT CGAGCTCGTC CACCGCACCC GGCACAGCGA CATCCGGGTC
GCCGCGGCGA TGGACCACAT CTTCGACGGC CCGCGTTCGC TCGGCGTCAC CTCGGAGAGC
GAGCCCAACA CCGGCTGGGT CACCGCGACG GCTGTCCTCA AGCCGGGCGA GACCCTGCGG
ATGGTCAAGT TCCTCGCCTA CGGCTGGTCC GAGCAGCGCT CCCTGCCGGC GCTGCGCGAC
CAGGCCACCG CCGCCCTCGT CGCTGCCCGC CAGACCGGCT GGGACGGCCT CGTCGCCGAA
CAGCGTGCGT ACCTGCAGGA CTTCTGGAAC CGGTCCGACG TCGAGGTCGA CGGCGACGCC
GAGGTCCAGC AGGCCGTCCG GTTCGCGCTC TTCCACGTCC TGCAGGCCGG CGCGCGGGCC
GAGCGCCGGG CCATCCCCGC GAAAGGGCTC ACCGGTCCCG GCTACGACGG TCACGCCTTC
TGGGACACGG AGAGCTACGT CCTGCCCGTC CTCACCTACA CCGCGCCGGC CGCCGCCGCC
GACGCGCTCC GCTGGCGGCA CTCGATCCTC CCGCTGGCCC GCGAGCGCGC CCAGTTGCTC
AACCTCGACG GCGCCGCCTA TCCCTGGCGC ACCATCCACG GCGAGGAGTG CTCCGGCTAC
TGGCCGGCCG GGACGGCCGC CTACCACGTC AACGCGGACA TCGCCGACGC CGTCCTGCGC
TACCTGTGGG CCACCGAGGA CGAGCAGTTC GAGCTCGAGG TGGGCCTGGA GATCCTCATC
GAGACGGCGC GGCTGTGGCG CTCGCTGGGC CACCACGACC TCTCCGGCCG TTTCCGCATA
GACGGGGTGA CCGGCCCGGA CGAGTACTCC GCGCTCGCCG ACAACAACGT CTACACCAAC
CTGATGGCCC AGCGGAACCT CATCGGGGCG GCCGACGCCG TCCGGCGACA CCCCGAACGC
GCGCGCGCCT TCGGGGTCGA CGCGGAGACC GCGGCGAACT GGCGCGATGC CGCCGACGAC
ATGTTCATCC CGTTCGACGA ACGCCTCGGG GTGCACCCGC AGTCCGAAGG ATTCACCGAG
CACCAGGTCT GGGACTTCGA ACGGACCAGG CCGGAGCAGT ACCCGCTGCT GCTGCACTTC
ACCTACTTCG ACCTTTACCG CAAGCAGGTC GTGAAACAGG CCGACCTGGT GCTGGCGATG
CAGCGGCGCG GCGACGCGTT CACCGCCGAG CAGAAGGCAC GCAACTTCGC CTACTACGAG
GCGCTCACCG TCCGCGACTC GTCGCTGTCC GCCTGCTGCC AGGCCGTCAT GGCGGCCGAA
TGCGGGCACA TGTCCCTCGC GCACGACTAC CTGCGCGAAG CCGCGTTCAT GGACCTGAAG
GACATCGAGC ACAACACCGG CGACGGCCTG CACATGGCCT CGCTGGCCGG CAGCTGGATC
GCGCTCGTCG AGGGCTTCGG CGGGCTGCGC GACACCGGTG AGCTGCTCTC CTTCAGCCCC
CGCCTGCCCG AGGGTCTCAG CCGGCTCGCC TTCGGCCTGC GCGTGCGCTC CCGCCAGCTC
CGCGTCGAGG TGCTCGAATC GTCCGCCACC TACACGGTGC TCGAGGGGGA GGCGATCACC
ATCCTGCACC ACGGCGAGAA GGCCCGCGTC TCACCCGACC AGCCCTGCAA ACTGGACGTC
CCACCGGTGC CGGCGCAGGA ACGCCCGAGG CAGCCCGCCG GCCGGGAGCC GCTGCGGTTC
CTGCACCACG GCAACGGCAC CGAGGTCACC CGCGCGGCCG ACCTCCACCC CGTCGACGGG
TGA
 
Protein sequence
MIVRPSYPIE SWSLTEHGLD IDDLARSESL FSLSNGHVGM RGNLDEGDPH GLPGTYLNSV 
HELRPLPYAE AGYGYPESGQ TVINVTNGKI VRLLVDDEPF DVRYGDLLAH TRTIDFREGV
LRREADWVSP AGQRVRIRTQ RLISFSQRSA AAIHYEIEPV GDTARIVIQS ELVANEQLPG
RRGDPRAAAV LESPLISERH RARETMVELV HRTRHSDIRV AAAMDHIFDG PRSLGVTSES
EPNTGWVTAT AVLKPGETLR MVKFLAYGWS EQRSLPALRD QATAALVAAR QTGWDGLVAE
QRAYLQDFWN RSDVEVDGDA EVQQAVRFAL FHVLQAGARA ERRAIPAKGL TGPGYDGHAF
WDTESYVLPV LTYTAPAAAA DALRWRHSIL PLARERAQLL NLDGAAYPWR TIHGEECSGY
WPAGTAAYHV NADIADAVLR YLWATEDEQF ELEVGLEILI ETARLWRSLG HHDLSGRFRI
DGVTGPDEYS ALADNNVYTN LMAQRNLIGA ADAVRRHPER ARAFGVDAET AANWRDAADD
MFIPFDERLG VHPQSEGFTE HQVWDFERTR PEQYPLLLHF TYFDLYRKQV VKQADLVLAM
QRRGDAFTAE QKARNFAYYE ALTVRDSSLS ACCQAVMAAE CGHMSLAHDY LREAAFMDLK
DIEHNTGDGL HMASLAGSWI ALVEGFGGLR DTGELLSFSP RLPEGLSRLA FGLRVRSRQL
RVEVLESSAT YTVLEGEAIT ILHHGEKARV SPDQPCKLDV PPVPAQERPR QPAGREPLRF
LHHGNGTEVT RAADLHPVDG