Gene Franean1_5157 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_5157 
Symbol 
ID5673491 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp6179770 
End bp6181194 
Gene Length1425 bp 
Protein Length474 aa 
Translation table11 
GC content73% 
IMG OID641244007 
Productcytochrome P450 
Protein accessionYP_001509421 
Protein GI158316913 
COG category[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG2124] Cytochrome P450 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0481875 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.00610054 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGTCGATCG ACCGCCTCCC CACCGCGGGG CACACCGGGG CGCAGGCACC GGGGCCGGGC 
CGGGCCACCC CGGCGTTCCT GCGCGAGCTG CTCGCCGACC CGCTCCGCCT GTTCACCCGG
CTGCGCACCA GTTACGGCCC GATCGTCCGG GTGCCGGTCG GGCGCGGCGG CTTCCACCTC
GTGTGCGGCC CCGAGGCGGT CGAGCAGGTC CTGGTGGGCG AGCAGCGGGC CTACGCGAAG
GGCCTGCGCC GGCGCACCAT GCCGCCGGGC GAGGGAATCC AGCCGCTGTC CCTGCTGCTC
GGTTCGGGCC TGCTCACCAG CGGCGGAGAC CTCCACCGGA CCCGCCGCCG GCTGATCCAG
CCGATGTTCC ACCGCGAGCG GATCGCCGGC TACGGCGCCG CGATCTCCGA GCTGTCCCGC
GCCACCGCGC TGGGCTGGGC GGACGGCTCC CGCCGCGAGG TCCACACCGA CATGAGCGAG
CTGACCCTCG CGATCGTCGC CCGCACCGTG TTCGGCGTCG ACGTCGACAG CGAGGTGGTC
CGGCGGGTGC GCCGCGCCGT CGCCGCGAAC ATGCGGCTGT CCCAGCTGGC CGTTCTCCCG
GGAGCCATAC GCCTGCAGCA GCACCTGCCG ATCGGCCCGC TGCGAGCGGC ACGCGACGCG
CGGGACGACC TCACCGCCGT GGTCATGGAG ATGATCGAGC AGCGCAGATC ACTCGACGCC
GCCGGCTCCG ACCTGCTGTC CACCCTGCTG GCCACCCGCG ACGCCGACAC CGGCGCCCCC
CTGGACGACA CCTCGATCCG CGACGAGGCG CTGACCATCC TGCTGGCCGG TCACGAGACC
ACCGCGAACG CGATGGCCTG GGCCTACCAC CTGCTCGCCA CCAACCCGCA GGCCCGCGAC
CGGATGCACA CCGAACTCGA CGACGTCCTC AACGGCCGGA AACCGACCAC CGCGGACCTG
GCCGAGCTGC CCTACACCCG GGCCGTGTTC AGCGAGACCC TGCGCCTGTA CCCACCCGCG
TGGATCCTCC TGCGCCGCAC GACGAGGGAC GTCACCCTCA CCGGGTACCA CCTGCCCGCG
GACACGAACG TCCTGCTGAG CCAGTGGGTG ATCCACCGGG ACCCCACCTG GTGGCCCGCC
CCCGAGGAGT TCCGCCCCCA GCGCTGGCTG ACCCCCGACC CCACCCGCCC GAAGTACGCC
TACTTCCCCT TCGGCGGCGG AACCCGCCAG TGCATCGGCA ACACCTTCGC CGAGATGGAA
GGAGCACTGG CCCTGGCTGC GATCAGCTCC ATCCGAACCC TGACCCCCAC CCCAGGCCGC
CCGGTAACCC CCATCCCCCG CGTAACCCTG CGCCCCCAGC CCCTACAGAT GACCGCCCAC
CCCCGCACAC CTCACCCCAC CCCCGCCACC CACCAGCCTC ACTGA
 
Protein sequence
MSIDRLPTAG HTGAQAPGPG RATPAFLREL LADPLRLFTR LRTSYGPIVR VPVGRGGFHL 
VCGPEAVEQV LVGEQRAYAK GLRRRTMPPG EGIQPLSLLL GSGLLTSGGD LHRTRRRLIQ
PMFHRERIAG YGAAISELSR ATALGWADGS RREVHTDMSE LTLAIVARTV FGVDVDSEVV
RRVRRAVAAN MRLSQLAVLP GAIRLQQHLP IGPLRAARDA RDDLTAVVME MIEQRRSLDA
AGSDLLSTLL ATRDADTGAP LDDTSIRDEA LTILLAGHET TANAMAWAYH LLATNPQARD
RMHTELDDVL NGRKPTTADL AELPYTRAVF SETLRLYPPA WILLRRTTRD VTLTGYHLPA
DTNVLLSQWV IHRDPTWWPA PEEFRPQRWL TPDPTRPKYA YFPFGGGTRQ CIGNTFAEME
GALALAAISS IRTLTPTPGR PVTPIPRVTL RPQPLQMTAH PRTPHPTPAT HQPH