Gene Franean1_3863 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_3863 
Symbol 
ID5672226 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp4590957 
End bp4592183 
Gene Length1227 bp 
Protein Length408 aa 
Translation table11 
GC content69% 
IMG OID641242741 
Productcytochrome P450 
Protein accessionYP_001508161 
Protein GI158315653 
COG category[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG2124] Cytochrome P450 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.793072 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACGACGG TGACGACGGA GACCAGCGAC GCCGCCGGCG ACGTCTACTA CGACCCGTAC 
GACTTCGAGA TCGACGCCGA CCCGTACCCG GTGTGGCGGC GGATGCGGGA CTCCGTGCCG
CTGTACTACA ACGCCAAGTA CGACTTCTTC GCCATCAGCC GGTTCGACGA CGTCGAGAAG
GTCATGGGCG ACTTCGAGAC CTACCGGTCG GGCCGCGGCT CCGTCCTCGA GATCATCAGG
TCGAACATCG ACTTCCCGCC GGGGAACATC CTGTTCGAAG ACCCGCCCGT GCACGACATC
CACCGCAGCA TCCTCGCCCG GGTCTTCACC CCGCGGAAGA TGCTCGCGAT CGAGCCGAAG
GTGCGCGAGT TCTGCGCCCG CTCGCTCGAC TCGCTGGTGG CCGAGGGCAA CTTCGACTTC
ATCGCCGACC TCGGCGCCCA GATGCCCATG CGGACGATCG GCATGCTGCT CGGCATCCCC
GAGCAGGACC AGGAGGCGAT CCGCGACGCC GTCGACGAGG GCCTCACCCT CACGGAGGGC
GCGCCGAAGC CGCTGAACGA GGACCCCCTC GCGCGCTCGG AGGGCATGTT CGCCGACTAC
CTCGACTGGC GGGCACGCAA CCCCTCGGAC GACCTCATGA CCGAGCTGAT CACCGCCGAG
TTCGAGGACG AGACGGGCAC GACCCGCCGG CTGACCCGCG CTGAGGTCCT CACCTACGTG
AACATGCTGT CGAGCGCCGG CAACGAGACC ACCACCCGGC TGATCGGCTG GACCGGGAAG
GTCCTCTCCG ACCACCCCGA CCAGCTCCGG CAGGTCGCAC GGGACAGGTC GATGGTCAAC
CAGGTGATCG AGGAGGTGCT GCGCTTCGAG GCGCCCTCCC CCGTCCAGGC CCGCTACGTC
GCCAGGGACG TCGAGGTGCA CGGCCAGACG GTGCCGGAGG GCAGCGTCAT GGTGCTGCTC
AACGGCTCGG CCAACCGCGA CGAGCGCCAG TTCGTCAACG GCGACAGCTT CGACATCCAC
CGGTCGATCA GCCGTCATGT CAGCTTCGGC CGCGGGCTGC ACTTCTGCCT GGGCGCCGCG
CTGGCCCGCC TCGAGGGACG GGTGGCGCTG GACGAGGTGC TCAAGCGCTG GGACCGCTGG
GAGGTCGACT ACGATCGCGC CGTCCAGGCC CGCACCTCCA CCGTCCGCGG CTGGGCCAAG
CTCCCGGTCA CGGCGACGCC GAGGTGA
 
Protein sequence
MTTVTTETSD AAGDVYYDPY DFEIDADPYP VWRRMRDSVP LYYNAKYDFF AISRFDDVEK 
VMGDFETYRS GRGSVLEIIR SNIDFPPGNI LFEDPPVHDI HRSILARVFT PRKMLAIEPK
VREFCARSLD SLVAEGNFDF IADLGAQMPM RTIGMLLGIP EQDQEAIRDA VDEGLTLTEG
APKPLNEDPL ARSEGMFADY LDWRARNPSD DLMTELITAE FEDETGTTRR LTRAEVLTYV
NMLSSAGNET TTRLIGWTGK VLSDHPDQLR QVARDRSMVN QVIEEVLRFE APSPVQARYV
ARDVEVHGQT VPEGSVMVLL NGSANRDERQ FVNGDSFDIH RSISRHVSFG RGLHFCLGAA
LARLEGRVAL DEVLKRWDRW EVDYDRAVQA RTSTVRGWAK LPVTATPR