Gene Franean1_2720 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_2720 
Symbol 
ID5671111 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp3219820 
End bp3221046 
Gene Length1227 bp 
Protein Length408 aa 
Translation table11 
GC content69% 
IMG OID641241632 
Productcytochrome P450 
Protein accessionYP_001507052 
Protein GI158314544 
COG category[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG2124] Cytochrome P450 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGACATCG AGGCAGCGGC CACCGTCTTC ACGGATCCGC GGGCGTACGC CGACAACGAG 
CGCTTCCACA CCGCCACCGC GCTGCTGCGG CGCGAGTCAC CCATCCACCG GATCGAGCAT
CCGAACTTCC ACCCGTTCTG GGCCGTCACC AAGCACGAGG ACGTCATGGC GATCTCCCGC
GCCTCCGACA TCTGGATCAA CGAGCCCCGG CCCGCGCTGG GCCCCAAGCC CAAGGACGCC
GAGCGGGAGA ACATCCCGAT CCGATCGCTG GTCCAGATGG ATGCCCCGGA CCACCCGGTC
TACCGCCACG TGAGCGCGGA CTGGTTCAAG CCGGCCGGTG TCCGGCGGCT GCGCGACCGC
ATCGCCGAGC TGGCGAAGCG CTTCGTGGAC CGCATGGCCG ACCAGGGCGG CGAGTGCGAC
TTCTTCACCG ACATCGTGTC GCACTACCCG CTGTACGTGA TCCTTTCCCT GCTCGGCCTG
CCCGAGGAGG ACTTCCCCCG AATGCTCAGG CTGACCCAGG AGCTGTTCGG CGCGGACGAC
GAGGACCTGG CCAGGGACCA GGACAAGCAG GCGCAGAGGG CGCCCCTGGT GGACTTCTTC
AACTACTTCC AGGCGCTGAT CCAGGACCGC CGCGAGAACC CCACCGACGA CCTGGGATCC
GTGATCGCCA ACGCCACGAT CCAGGGCGAG CAGATCGGCA AGCTCGAGGC GGCCGGCTAC
TACACGCTGA TCGCGACGGC GGGGCACGAC ACCACCAGCG CGGCGCTCGC CGGCGGTCTG
CACGCCCTGT TGGAGAGCCC GGGTCAGTGG CAGCGCCTGG TCGACGACCC GAGGATGGTG
GCGACCGGCG TCGACGAGAT GATCCGGTGG GTCTCCCCGG TCAAGCATTT CATGCGCACC
GCCCGCGAGG ACACGGTCGT GCGCGGCGTC GCGCTCGCGG CGGGGGAGTC GGTCCTGCTC
TCCTATCCGT CGGCCAACCG GGACGAGGAC GTCTTCGAGA ACCCGGACAC CTTCGACGTC
GGGCGATCCC CGAACCGGCA TGTCGCCTTC GGGTTCGGCG CCCACTACTG CCTGGGCACC
CACCTGGCCC GCCTCGAGGG CCAGGCGCTC TACGCGGAGC TGGTCCCCCG GGTGCGGTCG
ATCGAGCTGG CCGGAACGCC CGAGTACATG GAGGCCCTGT TCGTCGGGGG GCCGAAGCGG
CTGCCGATCC GCTACACGAT GGCCTGA
 
Protein sequence
MDIEAAATVF TDPRAYADNE RFHTATALLR RESPIHRIEH PNFHPFWAVT KHEDVMAISR 
ASDIWINEPR PALGPKPKDA ERENIPIRSL VQMDAPDHPV YRHVSADWFK PAGVRRLRDR
IAELAKRFVD RMADQGGECD FFTDIVSHYP LYVILSLLGL PEEDFPRMLR LTQELFGADD
EDLARDQDKQ AQRAPLVDFF NYFQALIQDR RENPTDDLGS VIANATIQGE QIGKLEAAGY
YTLIATAGHD TTSAALAGGL HALLESPGQW QRLVDDPRMV ATGVDEMIRW VSPVKHFMRT
AREDTVVRGV ALAAGESVLL SYPSANRDED VFENPDTFDV GRSPNRHVAF GFGAHYCLGT
HLARLEGQAL YAELVPRVRS IELAGTPEYM EALFVGGPKR LPIRYTMA