Gene Franean1_4477 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_4477 
Symbol 
ID5672827 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp5341903 
End bp5343165 
Gene Length1263 bp 
Protein Length420 aa 
Translation table11 
GC content66% 
IMG OID641243344 
Productcytochrome P450 
Protein accessionYP_001508760 
Protein GI158316252 
COG category[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG2124] Cytochrome P450 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.752604 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones23 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGTACGG TCGGAGACAG GGAACTGAAT GGCGCCGTCG GAATCGTTCA TACGTCGATC 
GAGGCTGTGA ACGACGGCGC CAACCCTTAT CAGACGCTCG CCCCCTTACA TGATTATGGC
GAGGCGGTCA TCATGCCGGA GGGGTACCTC GCGGTCTGGG GTCACCAGGC GTGTATGGAC
ATCATGCGGT CGTCGGCATG GGGCCGGCAT CTTCCTGATT CCTCCATCCG GGCCGCCTGG
CAGCACGATC TGACCGCGGA GCAGGCCGAG CTTCTGCGGC AGGAGGAGCC ACCGCACATC
GCACCGTGGT TGCAGACCTT CGACGGCCCT GAACACGCAC GCCAGCGGTC GCTCGTCAGC
AAGCCATTCA CGCCCCGCCG GCTGCAGGTC ATGCGGCAGC GGACAACGGA GGTCGTCGGC
CGGCTCGCCG CCGCCGCGCC ACGAGGTGTG CCGTTCGACT TCATGTCCAC CATCGCCTTT
CCGATTCCGA ACCAGGTTGT CGGCGAACTC GTGGGTCTAC CCCTGCAGGA TCGTGACTGG
TTCGCCGAGC GGGCGGTGCT GCTGCTGGCG GAGCGCGACC CCCGGTCCAG CTTCGATCAG
CTGCGGAGGT CGACCCGGGC CATCCGTGAG CTGGGTGACT ACATCCGCGG ACTGCTTCGA
GGCGAAACGT GCCCGACGGA AGGACTGGCC TCGGACCTGC TCGAGGCGGA GGAGACCGGC
GCGCGACTCA CCGAGCCCGA ACTTCTGTCG CTGATGCTCC TGATGTATGT CGCGGGACAC
GGCACAACAG CGCACTCGAT GGGTAATGGC TTGTACGTAC TGCTCCAGCA CCCGGATCAG
CTGGCGGCGC TGCGCGCCGA CCGGGGGCTG ACCCGCTCGG CCGTCGAGGA GATCCTCCGC
TGGGACAGCG GGGTGACATC GGTCGACTAC AGCGCCGTGG AAGATACCGA CATCGACGGC
ATCCGCGTTC CGGCCGGAAC GCCGGCGCAT CTGTTCCTTT CCGCCGCGAA CCGGGATCCC
CGGGTCTATA CAGATCCCGG TAGCTTTGAC ATCAGGCGAA CCGAGGGGCC GACGGTCGTG
TTCGGCGCGG GGCCGCATTT CTGCCTGGGC GCGGCACTTG CCCGCCTGGA GCTGGAGATC
GCCCTCGACG TTCTCCTGGC CGGCTTCGCC TCGATCGAAC TGGCGACATC GACACCGCCC
CGCGGCGACT CCTTCAACTA TCGCTACTTC ACGCAGCTGC CCATTGTGGT CAGCGACAAC
TGA
 
Protein sequence
MSTVGDRELN GAVGIVHTSI EAVNDGANPY QTLAPLHDYG EAVIMPEGYL AVWGHQACMD 
IMRSSAWGRH LPDSSIRAAW QHDLTAEQAE LLRQEEPPHI APWLQTFDGP EHARQRSLVS
KPFTPRRLQV MRQRTTEVVG RLAAAAPRGV PFDFMSTIAF PIPNQVVGEL VGLPLQDRDW
FAERAVLLLA ERDPRSSFDQ LRRSTRAIRE LGDYIRGLLR GETCPTEGLA SDLLEAEETG
ARLTEPELLS LMLLMYVAGH GTTAHSMGNG LYVLLQHPDQ LAALRADRGL TRSAVEEILR
WDSGVTSVDY SAVEDTDIDG IRVPAGTPAH LFLSAANRDP RVYTDPGSFD IRRTEGPTVV
FGAGPHFCLG AALARLELEI ALDVLLAGFA SIELATSTPP RGDSFNYRYF TQLPIVVSDN