Gene Franean1_5713 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_5713 
Symbol 
ID5674039 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp6932639 
End bp6934825 
Gene Length2187 bp 
Protein Length728 aa 
Translation table11 
GC content73% 
IMG OID641244566 
Productsqualene-hopene cyclase 
Protein accessionYP_001509969 
Protein GI158317461 
COG category[I] Lipid transport and metabolism 
COG ID[COG1657] Squalene cyclase 
TIGRFAM ID[TIGR01507] squalene-hopene cyclase
[TIGR01787] squalene/oxidosqualene cyclases 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00366399 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value0.682712 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCCTGA CCTCGGACCA GTCCTCGGCT GCCCCGACGG CCGCGGCGCA GAGCCCGAAG 
ATCCCGAACC CGTCGGTGGC ACGGCCGTCG GCGGACGCCG GGTCCTTCGA GACCGCCGGC
GCAGTGCGAA CCGACTCGGT GTCGATCGAC TCGGTGTCGA CCGGCACGCC GGTCGACCCG
GTGGTGGGCG CGATGCGCCG TGGCCGCGAC CATCTGCTCT CCCTGCAGGC TGAGGAGGGC
TGGTGGAAGG GCGAGCTGGA GACCAACGTC ACCATGGACG CCGAGGACCT CATGCTTCGG
CAGTTCCTCG GCATCCTGAC CCCGTCGACG GCCACTGAGA CCGGACGCTG GATCCGTTCC
CAACAGCTCT CCGACGGCGG CTGGGCTACC TTCTACGGCG GCCCGTCCGA CCTTTCGACC
ACCATCGAGG CCTACGTCGC GCTGCGGCTC GCCGGGGACG ACCCGGACGC CCCGCACATG
CGCTCCGCCG CCGAGTGGGT GCGCTCCGCG GGCGGCATCG CCGCCTCCCG GGTGTTCACC
CGGATCTGGC TGGCGCTGTT CGGCGAGTGG TCCTGGGACG ACGTCCCGGT GCTGCCGGCG
GAGATGACCT TCCTTCCGCC GTGGTTCCCG TTGAACATCT ACGACTTCGC CTGCTGGGCC
CGCCAGACCG TGGTGGCGCT GACGATCGTC GGTTCGCTGC GGCCGGTGCG CTCGTTCGGG
TTCACCCTGG ACGAACTGCG TGTCCAGGCG CCCAAGGCGA CGAAGGCGCC GCTGCGGAGC
TGGGCCGGCG CGTTCGAGCG GCTCGATTCC GTGCTGCACC GCTACGAGAA GCGGCCCTTC
CAGCCGCTGC GCCGGCTCGC GCTGCGCCGC GCCGCCGAAT GGGTGATCGC CCGCCAGGAG
GCGGACGGCT GCTGGGGCGG CATCCAGCCG CCGATGGTGT ACTCGATCAT GGCCCTGCAT
CTCATGGGCT ACCCCCTGAA CCACCCGGTG ATCTCGATGG CGTTCCGCGC CCTCGACCGG
TTCACGATCC GCGAGGAGAC ACCGGAGGGC ACGGTGCGCC GTATCGAGGC GTGCCAGTCG
CCGGTCTGGG ACACGGCGCT GGCCGTCGTC GCGCTCGCGG ACGCCGGTCT GGGCGGTGAC
CACCCGGCTA TGGTCCGGGC CGGTCGCTGG CTCGCCGACG AGGAGGTGCG CGTCGCCGGT
GACTGGGCGG TGCGCCGTCC CACCCTCGCG CCGGGCGGCT GGGCGTTCGA GTTCGACAAC
GACTTCTACC CGGACGTCGA CGACACCGCC GAGGTGGTCA TCGCCATCCG CCGCCTGCTC
GGCGACGGTC ACGGCCCGGT GGACCACTCC GACGGCTCCG GCCCCGGCTC GGCCGCGGCC
ACCGCGGCCT CCGCCGCGGC GGAGGCCGCG GTGGCCGCGG CCGGCACGAT CGCCGCCGCG
GATCCGGAGC TCGCCGCCCG GCTGCGCGCC GCCGCGGAGC GGGGCGTCGA CTGGTCGGTG
GGCATGCGCT CGTCGAACGG TGCCTGGGCG GCGTTCGACG CCGACAACGT GCGCACCCTG
GTCAGGAAGA TCCCATTCTG CGACTTCGGC GAGGTGGTCG ACCCACCGTC GGCGGACGTC
ACCGCGCACA TGGTCGAGAT GCTCGCCCTG CTGGGTCGCT CCGACCACCC GATCACCCAG
CGCGGGGTCC GCTGGCTGCT GGACAACCAG GAGGCCGGCG GGTCGTGGTT CGGTCGCTGG
GGCGTGAACC ACGTCTACGG CACCGGCGCG GTCGTGCCCG CGCTGATCTC CGCGGGTGTA
GACGCGGAGC ACCCGGCGAT CGTCTCCTCG ATGCACTGGC TCGTCGAGCA CCAGACGCCG
GAGGGCGGCT GGGGCGAGGA CCTGCGCTCC TACCGCGACG ACGAGTGGAT CGGGCGCGGC
GAGCCGACGG CCTCGCAGAC CGCCTGGGCG CTGCTGGCGC TGCTGGCCGC CGAACCGGCG
TCCGGGACCG CCGAGTGGGA GGCGGTCGAA CGCGGCGTGC GCTGGCTCTG CGACACCCAG
CGCCCCGACG GCACCTGGGA CGAGCCGCAG TTCACCGGCA CGGGCTTCCC CTGGGACTTC
TCCATCAACT ATCACCTGTA CCGGCTGGTC TTTCCCGTGA CGGCACTCGG TCGGTACGTG
ACCCTCACCG GCAGGTCGAC GTCATGA
 
Protein sequence
MSLTSDQSSA APTAAAQSPK IPNPSVARPS ADAGSFETAG AVRTDSVSID SVSTGTPVDP 
VVGAMRRGRD HLLSLQAEEG WWKGELETNV TMDAEDLMLR QFLGILTPST ATETGRWIRS
QQLSDGGWAT FYGGPSDLST TIEAYVALRL AGDDPDAPHM RSAAEWVRSA GGIAASRVFT
RIWLALFGEW SWDDVPVLPA EMTFLPPWFP LNIYDFACWA RQTVVALTIV GSLRPVRSFG
FTLDELRVQA PKATKAPLRS WAGAFERLDS VLHRYEKRPF QPLRRLALRR AAEWVIARQE
ADGCWGGIQP PMVYSIMALH LMGYPLNHPV ISMAFRALDR FTIREETPEG TVRRIEACQS
PVWDTALAVV ALADAGLGGD HPAMVRAGRW LADEEVRVAG DWAVRRPTLA PGGWAFEFDN
DFYPDVDDTA EVVIAIRRLL GDGHGPVDHS DGSGPGSAAA TAASAAAEAA VAAAGTIAAA
DPELAARLRA AAERGVDWSV GMRSSNGAWA AFDADNVRTL VRKIPFCDFG EVVDPPSADV
TAHMVEMLAL LGRSDHPITQ RGVRWLLDNQ EAGGSWFGRW GVNHVYGTGA VVPALISAGV
DAEHPAIVSS MHWLVEHQTP EGGWGEDLRS YRDDEWIGRG EPTASQTAWA LLALLAAEPA
SGTAEWEAVE RGVRWLCDTQ RPDGTWDEPQ FTGTGFPWDF SINYHLYRLV FPVTALGRYV
TLTGRSTS