Gene Franean1_4659 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_4659 
Symbol 
ID5673002 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp5559360 
End bp5561327 
Gene Length1968 bp 
Protein Length655 aa 
Translation table11 
GC content66% 
IMG OID641243517 
Productglycoside hydrolase family protein 
Protein accessionYP_001508933 
Protein GI158316425 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG3250] Beta-galactosidase/beta-glucuronidase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.398765 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value0.788478 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TTGATCCGTT CATCCTTCAA TGACGGCTGG TCCTTCCGCC GGAAGACCGA CGCGTTTCTG 
GAGATCCTCG GCCGGGCCGA CGCCCCCTGG CAGGACGTCC GGCTGCCCCA CGACGCGATG
GTGGGGCTCG AACGCGACAG GGCCGATACC GAAGCGGGTC AGCGCGGCTA CTACCCAAGC
GGGGAATACC AGTACAGAAA GACGTTCTTC GCGCCGGAGG AATACCGGAA CCGGCGAGTA
ACCCTTGAGT TCGAGGGTGT CTACCGGAAT GCCCGGGTGT TCATCAACGG CGATTTCGCG
GCGCAGCGCG CCTACGGCTA CTCCAATTTC TACGTCCACA CGGACCATCT ACTGAAGTAC
GGCAGCAACA ACGAGATCGT CGTGGAGGCG CGCAGTGGCA ACGACACCCG CTGGTACTCG
GGTGGCGGCG TCTACCGCAA CACGAAACTG ATCGTCGGCG ATCTGGTTCA CCTCGCACTG
GACGGGGTGA GGATCACCAC TCCGGCCGTG GACGACGACG GCACGGTCGT CGCCGGGGAC
ACCGCTCCGG TCACCACGTT CATCGGCGAC ACACTCACCC TGCGCCAGCG GCTGCCCGTG
CCACGACCGG AGCTGTGGGG GGTGGAGCGA CCCTATCTCT ACCTCTGCCG GACCAGCGTC
ACGGCGGACG GTGAGCTACT GGACCAGGAG ACCACGCGCT TCGGCATCCG TACGCTCACC
GTTGATCCGC TGCGGGGCCT GCGCATCAAC GGCGAGACGG TGAACCTGCG CGGCGCCTGC
ATCCACCACG ACAACGGGGT GATCGGCGCC GCGACCATCG ACCGCGCCGA ACAGCGCAGG
GTCGAGATCC TCAAACAGGC CGGTTTCAAC GCCATCCGCA GCGCCCACCA TCCGATGAGC
AAGGCGCTCC TCGACACCTG CGACCAGCTC GGCGTTCTCG TGATCGACGA GCTGTTCGAC
GCGTGGACCC GCTCCAAGGT GAGCCAGGAC TACGCCCTCG ACTTCGCCAC CTGGTGGGAG
TCCGACGTGC GGGCGATGGT CGACAAGGAC TTCAACCATC CCAGTGTCAT CCTCTACTCG
ATCGGGAACG AGATCCCGGA GACGGGCACC GCCGCCGGCG CGGCGATCAG CCGCCGGCTC
GCCGAGACGA TCCGCTCGAT CGACAGCACC AGGTTCGTCA CGAACGGCGT CAACGGACTC
CTCGCCGGCG GCCCCGACCT GCTCGCCTCC TTCGCCGGCG AATCCCGGAA GAAGGACAGC
GAGAGCCTCG ACGTCAACGG CTTCATGACG CGGTTCCGTG AGTTCATGCC AATCCTCATG
AGCTCCGAGG TCGTCGGTTC GAAGACCGCC GAGTCGATAG CCTGCCTCGA TGTCGCCGGC
TACAACTACC TCGAATCACG GTACGGGCAG GACGGGGCGC AGTTCCCCAA CCGGGTGATC
GTCGGAACCG AGACCTATCC CACCGACATC GACACGAACT GGCGGCTCGA GATCGGTTTC
GTCTACCCCG ACCTCGAGGT CGTCAAGCAG TACGTCGACA TCGACCACGG CGACTACCAG
GCCACCTTCC AACCGGCCAT CCTGGCCGGC GCCGCCGCTC TCGGGCTCGT CGTCGACTTC
ACCGAGCCGG CCAACGCCGC GTGCATCGCC ACCCTTGAGG CCGCGCTGCC AGCACTCACC
GGCAAGCTGG TCGACCCGGC GACCGTGCCC TCAGGCCAGC CCACCCCCGG AACATCCGAA
AGCGCCGCCT GCCGCTACCT GACCCTGTTC CAGGCGATCG CCGAGAAGGC CGGCAAGGAC
CTCACCTACC AGTCCTTCCA GCAGGCCGCG TTCTCTCTCG GTTCCTTCCA GGTCCCCACC
CTGCGGGACA AGGCCACCTA CAGCCGCGAG ACACCCCACG GCGCCGTCCC CCCGCGCCTG
TTCACGTTCG ATCCCGCGAA GAAGAACTTC TTCCCCGCCG GGAGCTGA
 
Protein sequence
MIRSSFNDGW SFRRKTDAFL EILGRADAPW QDVRLPHDAM VGLERDRADT EAGQRGYYPS 
GEYQYRKTFF APEEYRNRRV TLEFEGVYRN ARVFINGDFA AQRAYGYSNF YVHTDHLLKY
GSNNEIVVEA RSGNDTRWYS GGGVYRNTKL IVGDLVHLAL DGVRITTPAV DDDGTVVAGD
TAPVTTFIGD TLTLRQRLPV PRPELWGVER PYLYLCRTSV TADGELLDQE TTRFGIRTLT
VDPLRGLRIN GETVNLRGAC IHHDNGVIGA ATIDRAEQRR VEILKQAGFN AIRSAHHPMS
KALLDTCDQL GVLVIDELFD AWTRSKVSQD YALDFATWWE SDVRAMVDKD FNHPSVILYS
IGNEIPETGT AAGAAISRRL AETIRSIDST RFVTNGVNGL LAGGPDLLAS FAGESRKKDS
ESLDVNGFMT RFREFMPILM SSEVVGSKTA ESIACLDVAG YNYLESRYGQ DGAQFPNRVI
VGTETYPTDI DTNWRLEIGF VYPDLEVVKQ YVDIDHGDYQ ATFQPAILAG AAALGLVVDF
TEPANAACIA TLEAALPALT GKLVDPATVP SGQPTPGTSE SAACRYLTLF QAIAEKAGKD
LTYQSFQQAA FSLGSFQVPT LRDKATYSRE TPHGAVPPRL FTFDPAKKNF FPAGS