Gene Franean1_4758 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_4758 
Symbol 
ID5673100 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp5681677 
End bp5683746 
Gene Length2070 bp 
Protein Length689 aa 
Translation table11 
GC content71% 
IMG OID641243615 
Productsqualene-hopene cyclase 
Protein accessionYP_001509031 
Protein GI158316523 
COG category[I] Lipid transport and metabolism 
COG ID[COG1657] Squalene cyclase 
TIGRFAM ID[TIGR01507] squalene-hopene cyclase
[TIGR01787] squalene/oxidosqualene cyclases 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones22 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTTCCAGG GATCCGACCG ACCTCCTGTC ACGTTGGTGA TGAACGATAT GCGAGGACCG 
GATATGAACG TTTCCGATAC CGTCAGTGTC ACCCGGGAAA GCATTCCCAC GCAGACCAGC
GCCGGCGACG CCACCGCACG CGACCTCACC GCGGCTGTCG GCAGCGAGCT CACCCGCGCG
CTACGCCTCG CCACCGACCA CCTGCTCGCG CTGCAGGACG GCACGGGCTG GTGGAAGTTC
GATCTCGAGA CGAACACGAG CATGGACGCC GAGGACCTCC TGCTGCGCGA GTACCTCGGT
ATCCGCACGA CCGAGGTGAC GGCGGCCTCG GCCCGGTTCA TCCGCTCCCG GCAGAGCGAC
GACGGATCGT GGCCGCAGTA CTTCGGCGGC CCGGGCGAGC TGTCCACCAC CGTCGAGTCG
TACATCGCCC TGCGCCTCGC CGGCGACGAC GCCTCCGCAC CGCACATGCT CAGCGCCGCG
ACCTGGGTCC GCGACCACGG CGGAGTTCCC GCGACCAGGG TGTTCACCCG GATCTGGCTG
GCGCTGTTCG GTTGGTGGCG CTGGGAGGAC CTGCCGGCGC TGCCCCCGGA GATCATGCTC
CTCCCCCGCC GCGCACCGCT GAACATCTAC TCCTTCGGGT CCTGGGCGCG CCAGACCCTG
GTGTCGCTGA CGGTCGTCTC CGCCCTCCGC CCGGTGCGGC CGGCGCCGTT CGACCTCGAC
GAGCTGTACC CGGACGGACC CGCCTCCGCC TGGTCCGGCG CCGGACCCTC CAACGTGCTC
GAGAGAATCA GCACGCGGTT CACCGCGAAA GAAATCTTCC TGGGTATCGA CCGACTGCTG
CACGTCTATC ACCGGCGCCC CGTTCGATCC ATGCGCAACC ATGCGCTGCG GGCCGCGGAG
CGGTGGATAA TCGCCCGCCA GGAGGCGGAC GGATGCTTCG GCGGAATTCA GCCACCCGCG
GTCTATTCGA TAATCGCGCT GCGGCTGCTC GGGTACGAGC TCGACCATCC GGTGCTGAAG
GCCGCCCTGC GGGCCCTCGA CGACTACAGC GTTACCCTCC CCGACGGCTC CCGCATGGTC
GAGGCGTCGC AGTCGCCGGT CTGGGACACC GCGCTGGCGG TGAACGCCCT CGCGGACGCG
GGTGCCACGG CCGCGATCGC GCCCGACCAC CCGGCGCTGG TCCGCGCCGC CGGCTGGCTG
CTCGGCCAGG AGGTCCGGCA CCGGCGTGGC GACTGGGCGG TCAACCATCC CGACGTCCCG
GCGAGTGGCT GGGCGTTCGA GTTCGAGAAC GACACCTACC CCGACACCGA CGACACGGCG
GAGGTTCTGC TCGCGCTGCG CCGGGTGCGC CACCCGGCGC GCGACGAGCT GGACGCCGCC
GAGCGCCGGG CGGTGGCCTG GCTGTTCGGG CTGCAGTCCA GCGACGGCGG ATGGGGCGCA
TACGACGCGG ACAACACCAG CACCATCCCG TACCAGATCC CGTTCGCCGA CTTCGGAGCC
CTCACCGATC CGCCCTCCGC GGACGTCACC GCGCATGTCG TCGAGCTGCT CGCCGAGGCC
GGCCTCGGCG GCGACGACCG CACGCGGCGC GGGGTGGACT GGCTGCTGGA CCACCAGGAG
GCCGACGGGT CGTGGTTCGG CAGGTGGGGC GTCAACTACG TCTACGGCAC CGGCAGCGTG
ATGCCCGCGC TGCGCGCCGC GGGGCTGGAG CCGTCCCATC CGGCCATGCG GGCGGGAGCG
GACTGGCTGC TCACCCACCA GAACGCCGAC GGCGGCTGGG GGGAGGACCT GCGCTCCTAC
ACCGATCCCG AGTGGTCGGG CCGTGGTGAG TCCACCGCGT CCCAGACGGC GTGGGCGATG
TTGGCCCTGC TGACGGTGGG CGACCAGCCC GAGGTGAGCG GGGCCCTCGC GAGGGGTGCC
CGGTGGCTGG CCGATCACCA GCGGCCGGAC GGCTCCTGGG ACGAGGACCA GTTCACCGGT
ACCGGGTTCC CCGGCGACTT CTACATCAAC TACCACGGCT ACCGGCTGCT GTGGCCGATC
ATGGCCCTCG GCCGCTACCT CCGCGGGTAG
 
Protein sequence
MFQGSDRPPV TLVMNDMRGP DMNVSDTVSV TRESIPTQTS AGDATARDLT AAVGSELTRA 
LRLATDHLLA LQDGTGWWKF DLETNTSMDA EDLLLREYLG IRTTEVTAAS ARFIRSRQSD
DGSWPQYFGG PGELSTTVES YIALRLAGDD ASAPHMLSAA TWVRDHGGVP ATRVFTRIWL
ALFGWWRWED LPALPPEIML LPRRAPLNIY SFGSWARQTL VSLTVVSALR PVRPAPFDLD
ELYPDGPASA WSGAGPSNVL ERISTRFTAK EIFLGIDRLL HVYHRRPVRS MRNHALRAAE
RWIIARQEAD GCFGGIQPPA VYSIIALRLL GYELDHPVLK AALRALDDYS VTLPDGSRMV
EASQSPVWDT ALAVNALADA GATAAIAPDH PALVRAAGWL LGQEVRHRRG DWAVNHPDVP
ASGWAFEFEN DTYPDTDDTA EVLLALRRVR HPARDELDAA ERRAVAWLFG LQSSDGGWGA
YDADNTSTIP YQIPFADFGA LTDPPSADVT AHVVELLAEA GLGGDDRTRR GVDWLLDHQE
ADGSWFGRWG VNYVYGTGSV MPALRAAGLE PSHPAMRAGA DWLLTHQNAD GGWGEDLRSY
TDPEWSGRGE STASQTAWAM LALLTVGDQP EVSGALARGA RWLADHQRPD GSWDEDQFTG
TGFPGDFYIN YHGYRLLWPI MALGRYLRG