Gene Franean1_4786 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_4786 
Symbol 
ID5673127 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp5711653 
End bp5712660 
Gene Length1008 bp 
Protein Length335 aa 
Translation table11 
GC content67% 
IMG OID641243642 
Productcytochrome P450 monooxygenase 
Protein accessionYP_001509058 
Protein GI158316550 
COG category[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG2124] Cytochrome P450 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value0.396578 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGTCGTA GCGGTCCGGT CTACTGGGAT CCCTTCGACC GCGATATCGC CGGCGACCCC 
TACCCGGTCT ACCAACGGCT GCGCGCGGAG GCGCCGCTCT ACTACAATGA CCGGCAAGAC
TTCTACGCCC TCAGCCGCCA CGAGGACATC GACCGCTGCC TGACCGACTG GAAGACGTTC
TCGTCGGCCC GCGGGCCGAT CCTGGAAATC ATCAAGGCGA ACGTCGAAAT TCCGCCGGGC
ACGTTGCTGA TGGAGGACCC GCCGGCTCAC GACATCCACC GCGCTCTTCT CGCGCGCGTT
TTCACCCCGC GGCGGGTCAC CTCGCTCGAG CCGCAGGTCC GGGACTTCTG CCGGCGCTGC
CTTGACAGAC TCGTCGACGT CGACAGCTTC GACCTCATGG CGGAGTTCGC CAACGAGGTG
CCGATGCGCG TGATCGGGAT GCTGCTCGGC ATCCCGGAGT CGGACCAGCC GGCGATCCGC
GAGCGCGCGG ACGCGAAACT GCGCACCGAG CCGGGGCAGC AGATGAAGGT GTCCCAGCAG
GCACTCATGG ACTCGGACCT GTTCGCGGAG TACATCGACT GGCGGGCCGA GCATCCGTCG
GACGACCTGA TGACGGAGCT GCTACGCGCC GAGTTCGAGG AGACGCTGCG GTTCGAGCCG
ACCGGTCACG CGATCGCCCG CTACGTCACG ACCGGCGTCG AGTTACATGG CCGCACGGTG
CCTGCCGGCA GCGCGATGAT GCTCCTCATC GCCTCGGCGA ACCGGGACGA GAACAGCTGG
TCGGACCCTG ACCGGTTCGA CGTCCACCGC GGAACCGGTC ACCTCCGGAC CTTCGGCCTC
GGAACCCATT ACTGCCTCGG AGCCGCGCTG GCCCGGCTTG AAGCCAGGGT GGCGCTCGAG
GAGATCCTGA AACGCTTCCC GCGATGGAAC GTCGACTGGG AGAACTCCGC GCTTTCGTCG
ACGTCGACGA TGCGCGGCTG GGAGACGCTG CCGATCACCG TCGGTTAG
 
Protein sequence
MSRSGPVYWD PFDRDIAGDP YPVYQRLRAE APLYYNDRQD FYALSRHEDI DRCLTDWKTF 
SSARGPILEI IKANVEIPPG TLLMEDPPAH DIHRALLARV FTPRRVTSLE PQVRDFCRRC
LDRLVDVDSF DLMAEFANEV PMRVIGMLLG IPESDQPAIR ERADAKLRTE PGQQMKVSQQ
ALMDSDLFAE YIDWRAEHPS DDLMTELLRA EFEETLRFEP TGHAIARYVT TGVELHGRTV
PAGSAMMLLI ASANRDENSW SDPDRFDVHR GTGHLRTFGL GTHYCLGAAL ARLEARVALE
EILKRFPRWN VDWENSALSS TSTMRGWETL PITVG