Gene Francci3_0823 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFrancci3_0823 
Symbol 
ID3905100 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. CcI3 
KingdomBacteria 
Replicon accessionNC_007777 
Strand
Start bp959770 
End bp961995 
Gene Length2226 bp 
Protein Length741 aa 
Translation table11 
GC content72% 
IMG OID637878156 
Productsqualene cyclase 
Protein accessionYP_479936 
Protein GI86739536 
COG category[I] Lipid transport and metabolism 
COG ID[COG1657] Squalene cyclase 
TIGRFAM ID[TIGR01507] squalene-hopene cyclase
[TIGR01787] squalene/oxidosqualene cyclases 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.671451 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCCTGA CCTCGGACCC GTCCCCGGCT GCTCCAAAGG CCGCGAAGAG CTCGAAGCGC 
GTTAACATTC CCGCGCCCGC CACGCCCGAC GCTTACGGGA TCTCCCGGTC CTCGCCACCG
CTGTCCGGCG GTGGTGTGTC CGGCGGTGGT GTGTCCGGCG GTGGCGCGGC GACCGCCGAC
GGCACCCCGC CCACGACGCA GACCTCGGTC GATCCGGACC TGGCCGCGGC GATGACCGCG
GCCAACCAGG CTCGGGACCA CCTCCTCGGG CTGCAGTCCG AGGAGGGCTG GTGGAAGGGC
GACCTCGAGA CGAACGTCAC GATCGACGCC GAACACCTGT TCATGAAGCA GTTCCTCGGC
ATCCGCACCG AGGAGGAGAC CGAGCCGATC GCCCGGTGGG TGCGGTCCCA GCAACTCGCG
GACGGCGGCT GGGCCACGTA CTACGGCGGA CCGGCCGAGC TGTCCACCAC GGTCGAGGCG
TACATCGCCC TGCGTCTGGC CGGTGACGAG CCCGACGCCC CGCACATGGC GGCGGCGGCC
GCGCTCATCC GCTCCCAGGG CGGGGTGGCG GCCGCCCGGG TGTTCACCCG CATCTGGCTG
GCGACCTTCG GCGAGTGGTC CTGGGACGAC GTGCCCGTCC TGCCGCCGGA ACTGATCTTC
CTGCCGTCCT GGTTCCCGCT GAACGTCTAC GACTTCGGCT GTTGGGCCCG CCAGACGATC
GTCGCGTTGA CGATCGTCGG GTCGCTCCGG CCCGTGCGCG ACCTCGGCTT CAGCATCGAC
GAGATCAAGG TCGCGGCACC CGTGACGCCG CCGAAGCCGG CCCCGCTGCA CAGCTGGGAA
GGCGCCTTCG AGCGGTTGGA CGCGATCCTG CACCGCTATG AGCGCCGGCC GATCAAGGTG
CTGCGCACCC TGGCGCTGCG CCGGGCCACC GAGTGGGTCG TCGCCCGCCA GGAGGCCGAC
GGGTGCTGGG GGGGCATCCA GCCGCCGTGG ATCTACTCCG TCATGGCGCT GCATCTCATG
GGATACCCCC TCAACCACCC GGTGATCGCC ACGGCGTTCC GGGGAATGGA ACGCTACATC
ATCCGGCGCG AGACCCCGGA GGGGCCGACC GCCCAGATCG AGGCATGCCA GTCGCCGGTC
TGGGACACCG CGCTCGCGGT GGTCGCGCTC TCGGACGCCG GCGTCCCCGC CGACCATCCC
GCGATGGTGC GGGCCGGCCG CTGGCTGGTC GACGAAGAGG TCAGGGTGGC CGGGGACTGG
GCGGTCCGCC GTCCGGCGCT CGCCCCTGGC GGCTGGGCGT TCGAGTTCGA CAACGACTTC
TACCCGGATA CCGACGACAC CGCCGAGGTC GTGTTGGCGC TGCGTCGCCT GCTCGGTGGC
AGCCATGTCA CCCCGGGTGG CACCGTCACC CCGAGTGGCA GCGTCACCCC GGGCGGCACC
GCCGAGCTCT CGCCGGCCGC GCGTGACCGC GCCTCCCGCG GCCTCGCGGC GGTCGACCCG
CAGCTGGCCG GGGCGATGCG CGCGGCCGCG GCCCGTGGGG TCGACTGGAG CGTCGGCATG
CGGTCGTCGG ACGGGGCCTG GGGTGCCTTC GACGCCGACA ACGTGCGGAC GCTGACAGCG
AAGATCCCGT TCTGCGACTT CGGCGAGGTT GTCGACCCGC CGTCGGCGGA CGTCACCGCC
CACATCGTGG AGATGCTGGC CGACCTCGGC CGTTCCGACC ACCCGATCAC CCGGCGGGCG
GTGCAATGGC TGCTGGACAA CCAGGAACCG GGCGGCTCCT GGTTCGGCCG GTGGGGGATC
AACCACGTCT ACGGCACCGG CGCCGTGGTT CCGGCGCTGA TCGCCGCCGG CGTTCCCGCC
GACCACCCGG CGATCACAGC CGCGGTGCGC TGGCTGCTGG AGCACCAGTC GCCCGACGGT
GGATGGGGTG AGGACCCACG CTCCTACGAC GATCCGGCCT GGATCGGTCG GGGTGAGCTC
ACCGCCTCGC AGACCGCCTG GGCCCTGCTC GCGCTGCTGG CTGTCGACCC GCACAGCAAG
GCCGTCAAAC GGGGAGTGCG CTGGCTGTGC GAGACCCAGC GGCCGGATGG GACCTGGGAC
GAGCCCCAGT TCACCGGAAC CGGTTTCCCC GGTGACTTCT ACCTCAACTA CCACCTGTAC
CGGCTGGTCT TCCCGCTGAC CGCGCTCGGT CGGTACGTGA CCCTCACCGG GGTGGCCACG
CCATGA
 
Protein sequence
MSLTSDPSPA APKAAKSSKR VNIPAPATPD AYGISRSSPP LSGGGVSGGG VSGGGAATAD 
GTPPTTQTSV DPDLAAAMTA ANQARDHLLG LQSEEGWWKG DLETNVTIDA EHLFMKQFLG
IRTEEETEPI ARWVRSQQLA DGGWATYYGG PAELSTTVEA YIALRLAGDE PDAPHMAAAA
ALIRSQGGVA AARVFTRIWL ATFGEWSWDD VPVLPPELIF LPSWFPLNVY DFGCWARQTI
VALTIVGSLR PVRDLGFSID EIKVAAPVTP PKPAPLHSWE GAFERLDAIL HRYERRPIKV
LRTLALRRAT EWVVARQEAD GCWGGIQPPW IYSVMALHLM GYPLNHPVIA TAFRGMERYI
IRRETPEGPT AQIEACQSPV WDTALAVVAL SDAGVPADHP AMVRAGRWLV DEEVRVAGDW
AVRRPALAPG GWAFEFDNDF YPDTDDTAEV VLALRRLLGG SHVTPGGTVT PSGSVTPGGT
AELSPAARDR ASRGLAAVDP QLAGAMRAAA ARGVDWSVGM RSSDGAWGAF DADNVRTLTA
KIPFCDFGEV VDPPSADVTA HIVEMLADLG RSDHPITRRA VQWLLDNQEP GGSWFGRWGI
NHVYGTGAVV PALIAAGVPA DHPAITAAVR WLLEHQSPDG GWGEDPRSYD DPAWIGRGEL
TASQTAWALL ALLAVDPHSK AVKRGVRWLC ETQRPDGTWD EPQFTGTGFP GDFYLNYHLY
RLVFPLTALG RYVTLTGVAT P