Gene Franean1_1904 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_1904 
Symbol 
ID5670305 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp2284523 
End bp2285854 
Gene Length1332 bp 
Protein Length443 aa 
Translation table11 
GC content74% 
IMG OID641240825 
Productcytochrome P450 
Protein accessionYP_001506247 
Protein GI158313739 
COG category[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG2124] Cytochrome P450 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones19 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCCCGC GGGCACACGA AGCCGCCGTC GACGAAGCCG CAGCCGCCGT TGACGGAACC 
CCCGCCGGCA GGGCCCCCGG CGGGGGCGCC GCCGGCGGAG GGCGGGCGGC GATCCGCTTC
AACCCCTTCG GCGCGGAGTT CCGCCGCGAC CCCTATCCGC TGTACCGGCG GATGCGCGAG
GCACAGCCCG TCCACCGCAC GATGGGCATG TGGGTCCTGA CCCGGCACGC CGACGTGCAG
GGCGTCCTGC GGGACCGCAC GTTCAGCGCC GGGCTCATCC CCCAGTTGGT CAGCCGGCGG
GCCGGCCAGC TGGCCCGGGC GAACGTCGAC CGGGTGGTGC GGCTCGGCGA GAAGTCCCTG
GTGTTCACCG ACAACCCGGA CCACGCGCGG CTGCGCGGGC TGGTGAACCG GGTGTTCACC
GCCACCGAGG TGGCCGCGCT GCGCCCCCGC GTCGAGGAGG TCGCCGGTGG CCTGGTGCGG
CGCGCCCTGG CCGCCGGCGG CATGGACGCC ATCGCCGACC TCGCGGCGCC GCTGCCCGTC
ACGGTCATGT GCGACTGGAT GGCGCTGCCG GACGGCCTGC GCGCGCAGGT CGGCCCGTGG
ACGCACGACA TCCGGTTCCT GCTCGAACCC GGGCTGATGC GGGCCGGCGA CTTCGAGCAC
GTGTGCGACG TCGTCGAGAC GTTCGCGCGG GCCCTGGACG GGGTCGTCGC CGAGCGCCGG
CAGCGCCCCG GCGGCGACCT GATCAGCCGC CTGCTCGCGG CCCGCACGGC CGGCGGGGAC
GCACTCACCC CCGAGGAACT GATCTTCGTC TGCATCATGT GCTTCGTCGC GGGCAACGAG
ACCACGAAGT CGCTGCTGGG CAATGGTCTC CTCGCGCTGC TGCGGCACCC CGACCAGGCG
GCCCGCCTGC ACCGCGCGCC CGAGCTGGCC GGCGACGCCG TCACCGAGGC GCTGCGCTAT
GACAGCCCGC TGCAGATGAC CAAGCGCGTC GCGACCCGCG ACGTGGACGT CGACGGCCGG
CGGATCCGGG CCGGAGACCA GGTGCTGCTG TGTCTCGGCG CCGCGAACCG CGACCCGGCC
GTGTTCGACC GGCCGGACGA GTTCGACGTC ACCCGCGCGC CGAAGGCGCA CCTGGCCTTC
GGCCACGGCA TGCACGGCTG TCTCGGCGGT CTCCTCGCCG AGATGCAGGC CCAGGTCGTC
TTCGAGCGCC TCTTCGGGGC GACCACCAGC ATCGAGCCAC GGACCGACCA GCTCGAGTGG
CAGGACCACA GCTCCATCGT CCGGGGGCTG CGACACCTCC CGGTGACTCT GCACCCGGCC
GCGCCGCGAT GA
 
Protein sequence
MTPRAHEAAV DEAAAAVDGT PAGRAPGGGA AGGGRAAIRF NPFGAEFRRD PYPLYRRMRE 
AQPVHRTMGM WVLTRHADVQ GVLRDRTFSA GLIPQLVSRR AGQLARANVD RVVRLGEKSL
VFTDNPDHAR LRGLVNRVFT ATEVAALRPR VEEVAGGLVR RALAAGGMDA IADLAAPLPV
TVMCDWMALP DGLRAQVGPW THDIRFLLEP GLMRAGDFEH VCDVVETFAR ALDGVVAERR
QRPGGDLISR LLAARTAGGD ALTPEELIFV CIMCFVAGNE TTKSLLGNGL LALLRHPDQA
ARLHRAPELA GDAVTEALRY DSPLQMTKRV ATRDVDVDGR RIRAGDQVLL CLGAANRDPA
VFDRPDEFDV TRAPKAHLAF GHGMHGCLGG LLAEMQAQVV FERLFGATTS IEPRTDQLEW
QDHSSIVRGL RHLPVTLHPA APR