Gene Franean1_3693 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_3693 
Symbol 
ID5672059 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp4371224 
End bp4372819 
Gene Length1596 bp 
Protein Length531 aa 
Translation table11 
GC content72% 
IMG OID641242576 
ProductSMP-30/gluconolaconase/LRE domain-containing protein 
Protein accessionYP_001507996 
Protein GI158315488 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG3386] Gluconolactonase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0345166 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones22 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTTGTCGC AAGGGGCGCG CTACAGCGCC GCATCCATCG GCCTCGCCGA AGGATGGAGC 
CTGGACCGGC TCACCTCGCC GAGCAGACTT TTCGGCGCGA ACGGATTACG CACCGGCCCG
GACGGCCGCG TCTACGTGGC GCAGGTGACC GGCAGCCAGA TCAGCGCACT CGACGTCGAG
ACCGGCCAGC TGGAGACCAT CAGCGCGAAG GGCGGCGCGA TCATCGCCCC CGACGACGTG
GCCTTCGGCC CGCAGGGCGA CCTCTACGCC ACCGAGGTCA TGGACGGCCG GGTGAGCGTG
CGCGGCGCCG ACGGCACCAC CCGGGTGCTG CGCGACGACC TCCCGAGCGT GAACGGCATC
ACCGTCCACC AGGGCCGGCT GTTCGTCAAC GAGTGCCGGG TCGGCGGGCG CCTGATGGAG
CTCGACCTCA ACGGCGGCGC GCCCCGCGTG CTGCTCGAGG ACCTGGCGAT GCCCAACGCC
ATGGAGGTCG GCCCCGACGG TCTCCTGTAC TACCCGGTGA TGGGGACGAA CGACATCTGG
CGGATCGACC CGGAGGGCGG CGAGCCGCAG CGGGTCGTGG GGGATCTCGG CGTCCCGGAC
TCGCTGAAGT TCGACCCCGA GGGCTTCATC GTCTCCACCC AGGTCGCCTC CGGCGAGGTG
CTGCGGATCA ACCCGCGCAA CGGCGAGCGC ACCGTGCTGG CCGCCCTCAC GCCTGGCCTG
GACAACTGCA CCTTCGTCGG CGACCGGCTG TTCGTCTCGA ACTTCACCGG GCAGATCACC
GAGATCCTCG GCGGCGGGGC GACCCGGGAG ACGCTGCCGG GCGGGCTGAA CTGGCCGCTG
GACCTGACCG TCGGCGAGGA CGGCAACCTC TACATCGCCG ACGGGACGTA CTTCTACCGC
CTGCGCCCCG GGGGCCGGCC GGAGACGCTC GGCATGCTCT TCAGCCCGGG CTACCCGGGC
TTCGTCCGCG GGCTCACCGC CGTCGGCGCG GGCGAGTTCA TCGTCACCAC CTCGGGCGGG
CAGGTCACGC GCTACCGGCC GGCGAGCGCG GAGAGCGAGG TGCTCGCCGA CGAGCTCGAC
CAGCTCTACG GCGTGGCGGT CGCCCCGGGC GGCGACATCG TGGCCGCGGA GCTCGGCAAG
GGCCGCGTGC TCTCCATCCG TTCCGGACGG GTCGAGGAGC TGGCGTCCGG GCTGAACGAT
CCGCTGGGTG TCGCGTTCGG CGCGGACGGC ACGCTCTTCG TCTCCGAGTC GGGCGCCGGG
CGGGTCGTCA CGATCTCCGG CTTCCGCGTC GACACGGTGG TCGACGGCCT GGAACAGCCG
CAGGGAATCC TCGTGCGCGA CGGCCAGCTC CTCATCGTCG ACGCCGGCAC GAAGCAACTG
ATCAGCGTCG ACCTCGGCAC CGGGGCCCGG CAGACCATCG CCCGTGACCT TCCGGTGGGC
GCCCCGCCCG GCGTGACACC CAAGCCCCTG CGCGGGCTGC CGCCCTTCTC CGGGCCGCAG
GGCCCGTTCG CGGGCATCGC CGCCGGACCG GACGGCACGC TCTACCTATC TGCTGACGCC
GAGGGCAGCG TGCTCGCGCT GCGGCGGGAG GGCTGA
 
Protein sequence
MLSQGARYSA ASIGLAEGWS LDRLTSPSRL FGANGLRTGP DGRVYVAQVT GSQISALDVE 
TGQLETISAK GGAIIAPDDV AFGPQGDLYA TEVMDGRVSV RGADGTTRVL RDDLPSVNGI
TVHQGRLFVN ECRVGGRLME LDLNGGAPRV LLEDLAMPNA MEVGPDGLLY YPVMGTNDIW
RIDPEGGEPQ RVVGDLGVPD SLKFDPEGFI VSTQVASGEV LRINPRNGER TVLAALTPGL
DNCTFVGDRL FVSNFTGQIT EILGGGATRE TLPGGLNWPL DLTVGEDGNL YIADGTYFYR
LRPGGRPETL GMLFSPGYPG FVRGLTAVGA GEFIVTTSGG QVTRYRPASA ESEVLADELD
QLYGVAVAPG GDIVAAELGK GRVLSIRSGR VEELASGLND PLGVAFGADG TLFVSESGAG
RVVTISGFRV DTVVDGLEQP QGILVRDGQL LIVDAGTKQL ISVDLGTGAR QTIARDLPVG
APPGVTPKPL RGLPPFSGPQ GPFAGIAAGP DGTLYLSADA EGSVLALRRE G