Gene Franean1_3940 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_3940 
Symbol 
ID5672301 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp4708637 
End bp4709920 
Gene Length1284 bp 
Protein Length427 aa 
Translation table11 
GC content75% 
IMG OID641242819 
Productcytochrome P450 
Protein accessionYP_001508236 
Protein GI158315728 
COG category[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG2124] Cytochrome P450 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0174987 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACGGGCC CCGCCACGAG CACACCGCAC GCTGTCGCGG GCGCGCCCGC GGTGCCCGAC 
GCGGAGGCGG TGCTCGCGCA GGCCGTGATC GAGTCCTTCG ATCCGGCGTG GCGGCACGAC
CCCTATCCCG CGTACGGCAC GCTGCGCCGG GCCGGCACCT TCCTGCCGGG CCCGCTGGCC
GGCACCATGC TCGTCCCCGG CCATGCCGAG TGCGCCGCCA TCCTCGCCGA CCCGGTGTGG
AGCCACGCCG AGGAGTCGGA GCTGCTCCAC CCGGACAGTG ATGTCGAGCT GCCCGGCTCG
TTCCTGTGGA TGGAGCCGCC GGACCACACC CGGCTGCGCG GCCTGGTGAG CCGGGCGTTC
ACCCCACGGA CCATCGAGGC GACCCGCCCG CTGGCCCGCC GGGTCGTGGA CGGGCTGATC
GACGACGCCC TGGCGGCGGG GGAGCTGGAC CTGATCGAGG GGCTCGCCTA CCCGCTGCCG
CTGACCATGA TCTGCGAGCT GCTCGGGGTG CCCATCTCCG AGCACCCCGC GGTGCGCCGG
ATGTCGGCCG GGATCGCGCG CGGGCTCGAC CCCGACGTCC TGCTCTCACC GGCCGAGCTC
GCCGCGCGGA CCGCCGCCGT GGAGGAGTTC CGCGAGTTCT TCGGCGCGCT AGTCACCGCC
CGCCGGGCCG ACCCCCGCGA CGACCTGATC AGCGCGCTCG CCCAGGTGCA CGCCGAGGGT
GACCGGCTCA CCACGACCGA GCTGCTCGGC ACCCTGCTGA TCCTCGTGGT CGCCGGGCAC
GAGACCACCG TCAACCTGAT CGGCAACGGG GTGCTCGCGC TGCTGCGTGA CCCCGCGCAG
CTCGACGCGC TGCGGCGCGA TCCCGGGCTC GCGCTGCCCG CGGTGGAGGA GATCCTGCGG
TTCGACGCGC CGGCGCAGGT GACCACGCGC ACCGCCCGGG CGGAGGTGAC CGTGGCGGGC
CGGACGTTCA CCCCCGGCGA GGCCGTCATC TGCATGCTCG GCTCGGCCAA CCGTGACCCC
CGCGCCTTCG ACCGGCCCGA CGAGTTCCTC GTCGACCGCT ACGCGGGCGG CGCGCGGGTG
AGCCGCCATC TCGCCCTCGG GATGGGCCTG CACTACTGCC TGGGTGCCCC GTTGGTCCGG
CTCGAGGTCG GTGAGGTGCT GCGGGGCATC GCCACCCGCC TGACCGGGAT GACCCTGCTG
GCCGATCCGC CGCCGTACCG GCCGAACATC GTCGTCCGTG GGATGTCGTC GCTGCCTGTG
CGGGTCACCG GCCGACCCGG CTGA
 
Protein sequence
MTGPATSTPH AVAGAPAVPD AEAVLAQAVI ESFDPAWRHD PYPAYGTLRR AGTFLPGPLA 
GTMLVPGHAE CAAILADPVW SHAEESELLH PDSDVELPGS FLWMEPPDHT RLRGLVSRAF
TPRTIEATRP LARRVVDGLI DDALAAGELD LIEGLAYPLP LTMICELLGV PISEHPAVRR
MSAGIARGLD PDVLLSPAEL AARTAAVEEF REFFGALVTA RRADPRDDLI SALAQVHAEG
DRLTTTELLG TLLILVVAGH ETTVNLIGNG VLALLRDPAQ LDALRRDPGL ALPAVEEILR
FDAPAQVTTR TARAEVTVAG RTFTPGEAVI CMLGSANRDP RAFDRPDEFL VDRYAGGARV
SRHLALGMGL HYCLGAPLVR LEVGEVLRGI ATRLTGMTLL ADPPPYRPNI VVRGMSSLPV
RVTGRPG