Gene Franean1_5491 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_5491 
Symbol 
ID5673822 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp6645147 
End bp6646613 
Gene Length1467 bp 
Protein Length488 aa 
Translation table11 
GC content78% 
IMG OID641244346 
Productputative cytochrome P450 
Protein accessionYP_001509752 
Protein GI158317244 
COG category[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG2124] Cytochrome P450 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.767325 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.292754 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCCATC CCAACCCCCG CCAGCTCGGC ACGTGCGCGA GCGGCATGCC CCGCGCGTTC 
GGCGCCGTCC GCCAGCTCGG CGCGCTGCGC CAGTACGGCG CGTCCTGCCC GGCACCGACC
ACCCGCCGCG GCGGGCCCGA GGACACCCCG TCCGGCGAGC CCGTGCCCGG ACCATCCAGC
GGCCCGCCGA GCGGACCGCG CGGCGGGCCG GTGCGCGGCA CGGCGGGCGC CCACCCGCCG
GCCCGGCCGG ATCTCGACGC GCGCTGGCCG GCGCACGCCA CCGCCGTCCG GCTACCGGCC
GACGGCCCCG GGGACGACCG CGCCGCGTTC TACCGGCGTA TCCGCGACGA GCACGGCCCG
GTCGCCCCGG TCCTGCTCGA GGGGGACGTC CCGGCCTGGC TGGTGCTCGG CTACCGCGAG
GTCGTCTACG TCGCCGGCAC CCCGGCCCTG TTCGGCCGGG ACTGCCGGAT CTGGAACGCC
TGGGATCTCG TCCCGCCGGC CTGGCCGCTG CAGTCGATGG TCGGCCAGCG GCCGTCGCTA
CGCGCCCTCG ACGGGCCCGC CCACGCCCGC CGGCTGGGCG CCATCGGCGA CGTGCTGGGC
ATGTTCGAGC CGCACGGCCT GCGGGCGCGG GCGGCCAGCG GCGCCGACGC CCTGATCGAC
GCGATGGCCA GCGACGGCCG CGCCGAACTC ATGACCCAGT ACGCCAGCCC GCTGCCGGCG
ATGGTCGCAG CCGGGATGTG GGGGCTGCCC GACACCGACC TGACCGCGCT CGCCCGGGAT
CTGACCCTGC TCGTCAGCGG CGGGACGGGG GCCCGCGACG CGCGTCGGCG GGCCCACGCG
ATGCTCACCC GGGTGGTCCG GCGCCGCCGC GCGCAACCGG GCGACGACGC GGTGTCGGCG
CTGCTCACCC ATCCGGCCGG CCTGCGCGAC GACGAGGTCG TCGAGGACGT CATGGCGACC
CTGATCGCCG GCCAGGCGCC GACCGCGGAC TGGATCGGGA ACACGCTGCG GCTGATGCTG
ACCGACCCCC GGTTCGCCAC CGACCTCGCC GGTGGACGGC TCAGCGCGGG GGACGCGATG
GCCGAGGTGC TGTGGGCGGA CCCACCGATC CAGAACCTCG TCGCGCGGTG GGCCCGGCGC
GACGTCCGGC TCGGCGGGCG CTGGATCCGC GCGGGCGACC TGCTCATCTT TGGCCTGTCG
GCCGCGAACT CCGATCCGTG GGCGCGTCCC CAGCCCTCCG GGCGCCCGTC GGGCAACCAC
GCCCACCTGG CGTTCGGCCA CGGCGAGCAC CGCTGCCCCT TCCCCGCCCA GATGACGGCC
GAGACGATCG CGACCACCGC GATCGAGGTG CTCCTGGACC GCCTCCCCGA TCTCGAGCTC
GCCGTGGAGC CGCGGGACCT GCGCTGGCGC CCGTCGGTGT GGGCGCGTGG GGTCGCCGCG
CTCCCGGTGC GCTTCACGCC CCCCTGA
 
Protein sequence
MTHPNPRQLG TCASGMPRAF GAVRQLGALR QYGASCPAPT TRRGGPEDTP SGEPVPGPSS 
GPPSGPRGGP VRGTAGAHPP ARPDLDARWP AHATAVRLPA DGPGDDRAAF YRRIRDEHGP
VAPVLLEGDV PAWLVLGYRE VVYVAGTPAL FGRDCRIWNA WDLVPPAWPL QSMVGQRPSL
RALDGPAHAR RLGAIGDVLG MFEPHGLRAR AASGADALID AMASDGRAEL MTQYASPLPA
MVAAGMWGLP DTDLTALARD LTLLVSGGTG ARDARRRAHA MLTRVVRRRR AQPGDDAVSA
LLTHPAGLRD DEVVEDVMAT LIAGQAPTAD WIGNTLRLML TDPRFATDLA GGRLSAGDAM
AEVLWADPPI QNLVARWARR DVRLGGRWIR AGDLLIFGLS AANSDPWARP QPSGRPSGNH
AHLAFGHGEH RCPFPAQMTA ETIATTAIEV LLDRLPDLEL AVEPRDLRWR PSVWARGVAA
LPVRFTPP