Gene Franean1_4585 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_4585 
Symbol 
ID5672932 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp5467729 
End bp5468928 
Gene Length1200 bp 
Protein Length399 aa 
Translation table11 
GC content69% 
IMG OID641243448 
Productcytochrome P450 
Protein accessionYP_001508864 
Protein GI158316356 
COG category[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG2124] Cytochrome P450 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.189818 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCTGTTA CCGAACCGGT GCCGAGCCGG TGCCCGGCCG TGACCTACTC ACCCCACGAA 
CAGCGTCCGG TCGGAGAGTG GACCGCCTTC TTCGACCAGC TGCGCGACGA GGCGCCGGTG
GTGCGCAACA CCTTCGCCAA CGGCTACTAC GTGCTCACCC GGTACGAGGA CATCCTCAGT
GCCTACCAGG ACACCGACAC CTTCTCGACC CAGGCGGTCA CGGTCCTGGA GCCGGACCCC
AGCTACCGCT GGATCCCGCA CATGCTCGGC GGCAACGAGC ACCGGCAGTG GCGGCGGCAG
CTCGGCCCGT ACTTCTCGCC GCGGGCGATC GAGGGGCTCG ACGACCGGAT CCGGGCGCGG
GCGGTCGGGC TCATCGAGTC GTTCGCCGAC CGCGGGTCCT GTGACGTGAT CACCGACTTC
TCGTTCCACT TCCCGACGAC GATCTTCCTC GAGCTGATGG GGCTCCCGGT CGGCGACCTC
GACCGGTTCA TGGCCTGGGA AGCCAACATC CTGCACTCGA ACGGCTCGAC GCCGGAGGAG
ATCGCCCACA ACCGGACGAC GGCGATGGCG GCGGTGTACG AGTACTTCGG CTCGGTCATC
GCCGATCGCC GGCGCAGGCC CGGGGACGAC CTGGTGAGCC ACGCGATCGC CTTCAAGGTC
GACGGCCGGC CGGTGACCGA CGACGAGGTC CTCTCCTACT GCTTCTTCAT GTTCATGGCG
GGGTTGGACA CCGTCGCGGC CGCGCTCGGC TACTCGCTCT ACCACCTGGC GACCCACCCC
GAGGACCGGG CGCGGATCGT GGCGGACCCG GCGCTGATCC CGTCGGCCAT CGAGGAGATC
CTGCGGGCCT ACGCGTTCAC CATCCCGGCG CGCAAGGTCA CCAGGGACAT CGAGGTCGCC
GGGTGCCCGA TAGCGGCCGG ATCGATGGTG CAGCTGCCCA TCAAGGCGGC GATGCGCGAC
GGCGCCGCCT TCGCCAGCGC CTCGGAGGTG CTGATCGACC GCAAGCCGAA CAACCACATC
GCCTTCGGTG CCGGCCCGCA CCGCTGCCTG GGTTCGCACC TCGCGCGCCA CGAGCTGGCG
ATCGCGCTGG AGGAGTGGCA CAAGCGGATC CCCGACTACC GCCTCGCGGA CGATGCCGTC
ATCACGGAGC TCGGTCGCAG CTCCGGCCCC GACACGGTGC CGGTCGTCTG GGCGCGCTAG
 
Protein sequence
MSVTEPVPSR CPAVTYSPHE QRPVGEWTAF FDQLRDEAPV VRNTFANGYY VLTRYEDILS 
AYQDTDTFST QAVTVLEPDP SYRWIPHMLG GNEHRQWRRQ LGPYFSPRAI EGLDDRIRAR
AVGLIESFAD RGSCDVITDF SFHFPTTIFL ELMGLPVGDL DRFMAWEANI LHSNGSTPEE
IAHNRTTAMA AVYEYFGSVI ADRRRRPGDD LVSHAIAFKV DGRPVTDDEV LSYCFFMFMA
GLDTVAAALG YSLYHLATHP EDRARIVADP ALIPSAIEEI LRAYAFTIPA RKVTRDIEVA
GCPIAAGSMV QLPIKAAMRD GAAFASASEV LIDRKPNNHI AFGAGPHRCL GSHLARHELA
IALEEWHKRI PDYRLADDAV ITELGRSSGP DTVPVVWAR