Gene Franean1_4706 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_4706 
Symbol 
ID5673048 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp5621524 
End bp5622741 
Gene Length1218 bp 
Protein Length405 aa 
Translation table11 
GC content65% 
IMG OID641243563 
Productcytochrome P450 
Protein accessionYP_001508979 
Protein GI158316471 
COG category[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG2124] Cytochrome P450 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.15095 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value0.897652 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTTGAAAC CAGTACAACC TGCTGACAGG ATCGTCGGCG ACGTGGACGA CGACGTCGAC 
TACGTCGACG GCCAAATCAC GGACTTCCAC GCACGCTTAG CCGCGCTTCG TGCCGAGAAG
GGCGTCGCCC GCCTTCGGTT CGGCCCCGAC ACCGGCCTCA TGCTGCTCCG CCACGCCGAT
GTGGTCGTCG CTCTACGAGA TGAGACCCGC TTCTCGAAAT CCGGGGCGTT CCGGCCGATC
ACGTTCCCGT TCCTAGGCCC CAATATCACC GGCTATGACG GTCACGAACA CAATGTGAAG
CGTGCCCTGG TGTCGCCGAC GTTCCGCCGG ACAATGATCC CGCGTTACAT CCAACCCGTC
ATCCGGCCGA TCGCTGAGGA GCTCGTCGCC GACCTCGCAA CACTCGGCGA GGCCGACCTC
ATGGCCACGT TCGCCAAGAA GTACCCTATG CGGATCACCA GCCGCCTCCT CGGTATCCCG
TCCGACGAGG AGGACAAGCT GGCGAGCTGG GCCTTCTCCA TGCTCCATAT CGCAGGCGAC
CCCGACGGCG CCATGAAGGC CAATGCAGAG TTCACCGAGT ACGTCGGACC GCTTATCGAC
ACCCGGCGCG CCCACCCCCG TGACGATCTT CTTTCGGCGC TGCTGACCGA GGAGGTCGAA
GGCCAACACC TCGACCACGA CGAGGTTCTC GGCTTCCTTC GCCTGCTGTT CCCTGCCGGC
GTCGACACGA CCTGGCAGGC GCTCGGCAGT CTCGTGCACG CGGTCCTCGA GCATCCCGAG
GTCCACCAGA GGCTCCGCCG CGACGAGGAG GAAAGGGCCT GGGCAGTCGA GGAAACGCTC
CGCTGGGAGT CACCCGTAGC AGCTGATTCG CGGCTGACCC TGCAAGACGT CGTCGTCTCA
GGCGTCGAGA TCGCCGCCGG AGAACTTGTG CGACTTGGCC TATCCGTGGC CAACCGAGAC
CCCGACGTCT TCCCGGACCC CGATCGCTGG AACCTGGACC GCAGACCGAC AAACCACATC
ACGTTCGGGC TCGGCCGCCA CTTCTGCCTC GGCGCCCACC TGGCGCGCGT CGAACTGCAG
GTGGCACTCG ACGTGCTGCT GCAGCGGTTG CCCAACCTCC GACTTCTCGA GCAACCCCAA
ATCACCGGCA TAGGCATCCG CGGCCCCAAG ACCCTCCGAG TCGCGTGGGA CGCGCCCTCC
ACACCTGGTG CACCCTGA
 
Protein sequence
MLKPVQPADR IVGDVDDDVD YVDGQITDFH ARLAALRAEK GVARLRFGPD TGLMLLRHAD 
VVVALRDETR FSKSGAFRPI TFPFLGPNIT GYDGHEHNVK RALVSPTFRR TMIPRYIQPV
IRPIAEELVA DLATLGEADL MATFAKKYPM RITSRLLGIP SDEEDKLASW AFSMLHIAGD
PDGAMKANAE FTEYVGPLID TRRAHPRDDL LSALLTEEVE GQHLDHDEVL GFLRLLFPAG
VDTTWQALGS LVHAVLEHPE VHQRLRRDEE ERAWAVEETL RWESPVAADS RLTLQDVVVS
GVEIAAGELV RLGLSVANRD PDVFPDPDRW NLDRRPTNHI TFGLGRHFCL GAHLARVELQ
VALDVLLQRL PNLRLLEQPQ ITGIGIRGPK TLRVAWDAPS TPGAP