Gene Franean1_3470 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_3470 
Symbol 
ID5671841 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp4099763 
End bp4100977 
Gene Length1215 bp 
Protein Length404 aa 
Translation table11 
GC content73% 
IMG OID641242358 
Productcytochrome P450 
Protein accessionYP_001507778 
Protein GI158315270 
COG category[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG2124] Cytochrome P450 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0620803 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value0.682712 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCAGCA ACCAAGCGGT CGTGGCGACT CCGCCCGCCT TTCCCATGGA CCGTGGATGT 
CCTTACCACC CGCCGGCCGG GTACGCGCAG CTCCAGCAGG ACGGGCCGAT CACCCGGGCG
ACGCTGTTCG ACGGCCGGGA GGTGTGGGTG GTCACCGGCT ACGAGGAGGC CCGCCGGCTC
CTCGTCGACC CGCGGCTGTC CTCGGACCGT TCCCGGCCCG ACTTCCCGGT GCTGGTGCCG
CGGATGGCCG CGGCCAAGCT CGTCGCGCTC GTCGGGATGG ACCCGCCGGA GCACGACATC
CAGCGCCGCA TGCTGATCGG CAGCTTCACC GTGCGGCGGG CGAACGCGCT GCGGCCGGAC
ATCGAACGGA TCGTCGGCGG GCGCGTCGAC GCCCTGCTCG CGCACGAGCC GGGCGAGGTC
GTCGACCTCG TGCCCGAGTT CGCGTTGCCG ATCCCGTCCA CCGTGATCTG CGAGCTGCTC
GGCGTGCCCT ACGGCGACCA CGAGTTCTTC GAGGAGCAGA CCCGGCGGAT GGTGATCGCG
ACCAGCACGG CGGCCGAGGC CGCGGCCGCG TCGCGGGCCC TGGTCGACTA CTTCGACGAG
CTGATCGCCA GGAAGCGGGA GCGGCCCGGG GAGGGGCTGC TCGACGAGCT GATCGCCGAG
CGGCTCGTCA CCGGCCAGAT CGGGCAGGAC GATCTCGCGT CGATGGCGAT GTTCCTGCTC
GTCGCCGGGC ACGAGACGAC CGCGAACATG CTCGGGCTGA GCGTGCTGGC GCTGCTGGAA
CACCCGGACC AGCGGGCCCG GCTGATCGAG GACCCGGCCG GGCGGGCCGC CGGCGCGACC
GAGGAGCTGC TGCGCTTCCT GTCGGTGGCC GACGAGATCC AGCGGATCGC CGCCGCCGAC
ATCGAGGTCG CCGGGGTCGT CATCCGGGCC GGTGACGGGG TGTACCTGCC GACGGCGGCG
GCGAACCGGA CCGCGGCGAC GTTCCCCGAC CCCGACGCCC TCGACATCGG CCGGGTCCCG
CGGGGACATC TCGCCTTCGG CTACGGCATC CACCAGTGCA TCGGGCAGAA CCTGGCCCGG
GTGGAGCTGG AGATCGGCCT GCGCGAGCTG TTCGGCCGCA TCCCGACGCT GCGGCTGGCC
GAGCCGGTCG AGGCGCTCGG GGCGAAGCCC GGCGGCTCGG TGCAGGGCGT CTACCGGCTG
CCCGTCGTCT GGTAG
 
Protein sequence
MSSNQAVVAT PPAFPMDRGC PYHPPAGYAQ LQQDGPITRA TLFDGREVWV VTGYEEARRL 
LVDPRLSSDR SRPDFPVLVP RMAAAKLVAL VGMDPPEHDI QRRMLIGSFT VRRANALRPD
IERIVGGRVD ALLAHEPGEV VDLVPEFALP IPSTVICELL GVPYGDHEFF EEQTRRMVIA
TSTAAEAAAA SRALVDYFDE LIARKRERPG EGLLDELIAE RLVTGQIGQD DLASMAMFLL
VAGHETTANM LGLSVLALLE HPDQRARLIE DPAGRAAGAT EELLRFLSVA DEIQRIAAAD
IEVAGVVIRA GDGVYLPTAA ANRTAATFPD PDALDIGRVP RGHLAFGYGI HQCIGQNLAR
VELEIGLREL FGRIPTLRLA EPVEALGAKP GGSVQGVYRL PVVW