Gene Franean1_4689 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_4689 
Symbol 
ID5673031 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp5603347 
End bp5604681 
Gene Length1335 bp 
Protein Length444 aa 
Translation table11 
GC content68% 
IMG OID641243546 
Productcytochrome P450 
Protein accessionYP_001508962 
Protein GI158316454 
COG category[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG2124] Cytochrome P450 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.110488 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGCGCGAAT TCGAGGCAAT GGACTTCTTC CGCGACGAGA CTCTCGTCGC GGACCCGTAC 
CCTTACCTCG ACGCCCTGCG GCGGAAATGC CCCGTACAAC GGGAACGCCA CCACGACGTG
GTGATGGTGA CCGGCTACGA GGAGGCGGTG GAGGTCTTCC ACGACTCCGA GGCGTTCTCG
TCCTGCGTCT CGGTGACAGG CCCGTTCCCC GGCTTCCCGG TCCCGCTCGA CGGTGACGAC
GTCTCCGCGC TGATCGAACG GCATCGCCAT GAGCTGCCGA TGAACGACCA GCTCCCGACG
ATGGACCCGC CCACCCACAC CGATCATCGC GCGCTGCTGA TGCGGCTGAT CACCCCTAAG
CGCCTCAAGG AGAACGAGGC GCTGATGTGG GACCTTGCCG ACCGCATGCT CGACCCGTTC
CTCACTCCTG GTGAGGGAGA GTTCATCAGC GGATTCGCCG GACCGTTCAC ACTGCTCGTC
ATCGCGGACC TTCTGGGCGT CCCCGAGGAG GACCAGGACG AGTTTCTCGA CAAACTGCAG
CGCCAGCCGG CACAGACCGG CGGCGTCGGC GGCACCGGAG CGGAGACCCT GGCCCACAGC
CCGCTGGAGT TTCTCTACGG GAAGTTCACC GGCTACATCG AGGACCGTCG CCGCAACCCC
CGCGCCGACG TGCTGACCGG TCTGGCCGGC GCGACGTTCC CGGACGGATC GACACCCGAG
GTTATCGACG TGGTGCGGGT GGCCGCGAAC CTCTTCTCCG CCGGTCAGGA AACCACGGTG
CGCCTGCTCA GCTCGGCGCT GAAGATCCTC GCCGAGCGGC CCGACCTCCA GCGGCAGCTT
CGTGTCGAGC GGGAGCGCAT CCCGGCCTTC ATCGAGGAGA CCCTGCGCTG GGAGAGCCCG
GTCAAGGGCG ACTTCCGGCT CTCCCGTGTG CCGGTCACCG TGGGTGGGGT GCAGCTGCCC
GCCGGCACCA CGGTGATGGT GGTCAACGGG GCGGCCAACC GCGACCCACG CCGCTTCGAG
AACCCGGAGA CGTTCGACGT CGCCCGTTCC AACGCCCGCC AGCACCTGGC CTTCGGGCGT
GGGATCCACA GCTGCCCCGG CGCGCCGTTG GCACGGGCCG AGGCACGGGC GAGTCTTGAA
CGGCTGTTGG ACCGCACCAC CGACATCCGC GTCAACGAGC GGGTGCACGG CCCGGCCGGC
AACCGCCGCT ACGAGTACAT GCCCACCTTC ATCCTGCGTG GGCTGACCGC CCTGCACCTG
GAGTTCGACC TCGCGCCAGC ACCGCCACGT GACTTCCCGC CCGCCGGCTC ACCTGTCGGA
TGGATCAAGG GCTGA
 
Protein sequence
MREFEAMDFF RDETLVADPY PYLDALRRKC PVQRERHHDV VMVTGYEEAV EVFHDSEAFS 
SCVSVTGPFP GFPVPLDGDD VSALIERHRH ELPMNDQLPT MDPPTHTDHR ALLMRLITPK
RLKENEALMW DLADRMLDPF LTPGEGEFIS GFAGPFTLLV IADLLGVPEE DQDEFLDKLQ
RQPAQTGGVG GTGAETLAHS PLEFLYGKFT GYIEDRRRNP RADVLTGLAG ATFPDGSTPE
VIDVVRVAAN LFSAGQETTV RLLSSALKIL AERPDLQRQL RVERERIPAF IEETLRWESP
VKGDFRLSRV PVTVGGVQLP AGTTVMVVNG AANRDPRRFE NPETFDVARS NARQHLAFGR
GIHSCPGAPL ARAEARASLE RLLDRTTDIR VNERVHGPAG NRRYEYMPTF ILRGLTALHL
EFDLAPAPPR DFPPAGSPVG WIKG