Gene Franean1_3938 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_3938 
Symbol 
ID5672299 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp4706526 
End bp4707779 
Gene Length1254 bp 
Protein Length417 aa 
Translation table11 
GC content74% 
IMG OID641242817 
Productcytochrome P450 
Protein accessionYP_001508234 
Protein GI158315726 
COG category[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG2124] Cytochrome P450 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0168475 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGTCGCCG CCGAGGTCAC CGCAGGGGTC GCCGCCGAGT CCGCCCCGGT GGAGTTCACC 
GCGTTCCATC CGACCCACAA GGCCGATCCG TACGCCGTCT ACCAGCGGGT GCGGGAGGCC
CGGCCGCTGT GCCCGTTCAT GCTCGGCGAC ATCCCGGTGA CGGTGCTCAC CCGGTACGCC
GACTGCGAGG CGGTGCTGCA GAGCGACGAC TGGGTGCACG GCTACGACGC CGGCATCAGC
CCGTTCCGGG ACGGCGCCGC GAGCGCCCCG CGGTCGTTCC TGCGGATGGA CCCGCCGGAC
CACACCCGGC TGCGCGGGCT GGTCAACAAG GCGTTCACCC CGCGGATCGT CAACCAGATG
GCGCCGGGCA TCCAGGCCCT CGCCGACCGG CTGGTGGACG GCGCGCTGGC CACCGGGAGC
ATCGACGTCA TCGGCGAGTA CGCGGCCCTG ATCGCGAGCG CGACGCTCGG CCACCTGCTG
GGCGTCCCGG ACGACGTCGG CGCGTCGCTG CGCGGCTGGG CGCTGGCCAT CGCCCGCGGC
ACCGACCCCG ACAACCTGCT CACCGAGGCC GAGCTCGTCG CACGCCGGCA GGCCACCCAG
GATTTCCAGG CCTACTTCGA GGAGCTGATC GCGCAGCGCC GGGCGAACCC CACCGACGAC
CTGATCAGCC GGATGGCCGC GGCCCGCGAC CGCGACGACG CGCTCAGCGA GCTCGAGCTG
CTCGGCGTCT CGTCGCTGCT CGTCGTGGCG GGGATGGAGA CGTCCATCAA CTATGTCGGG
TCGGCGGTGC TCTCGCTGCT GCGCCACCCC GACCAGCTGG CGCTGCTGCG CGCCCGGCCG
GAGCTGCTCC CCTCCGCGGT GGAGGAGGTG CTGCGCTACG ACCCGCCGAC GCAGTTCACC
ATGCGCACCG CGTGCCGCCG GACCGAGCTG GCCGGGCACA CGTTCGCCCG CGGCGACGGC
GTCCTGCTGG TCAGCGCCGC CGCCGGCCGG GACCCGGCGG CGTTCACCGA CCCCGACCGG
TTCGACATCA CCCGCTACCA CGGGCCGCGC CCGGCCCGCC GGCACCTGGG CTTCAGCGTC
GGCATCCACT TCTGCCTGGG CGCCCCGCTC GCCCGGATCG AGGCGGCGGC GGCCATCGGG
GCCCTCGTCC GGCGCACAAC CACCCTGGAG CTCGCCGTCG ACGAGGCCGA GCTGGTCTAC
CTGCCCAGCC TCATCCACCG CGCGCTGGCC ACCCTCCCGG TGCGGGTGCG CTGA
 
Protein sequence
MVAAEVTAGV AAESAPVEFT AFHPTHKADP YAVYQRVREA RPLCPFMLGD IPVTVLTRYA 
DCEAVLQSDD WVHGYDAGIS PFRDGAASAP RSFLRMDPPD HTRLRGLVNK AFTPRIVNQM
APGIQALADR LVDGALATGS IDVIGEYAAL IASATLGHLL GVPDDVGASL RGWALAIARG
TDPDNLLTEA ELVARRQATQ DFQAYFEELI AQRRANPTDD LISRMAAARD RDDALSELEL
LGVSSLLVVA GMETSINYVG SAVLSLLRHP DQLALLRARP ELLPSAVEEV LRYDPPTQFT
MRTACRRTEL AGHTFARGDG VLLVSAAAGR DPAAFTDPDR FDITRYHGPR PARRHLGFSV
GIHFCLGAPL ARIEAAAAIG ALVRRTTTLE LAVDEAELVY LPSLIHRALA TLPVRVR