Gene Franean1_3937 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_3937 
Symbol 
ID5672298 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp4705083 
End bp4706375 
Gene Length1293 bp 
Protein Length430 aa 
Translation table11 
GC content70% 
IMG OID641242816 
Productcytochrome P450 
Protein accessionYP_001508233 
Protein GI158315725 
COG category[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG2124] Cytochrome P450 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.00640549 
Fosmid HitchhikerNo 
Fosmid clonabilitydecreased coverage 
 

Sequence

Gene sequence
GTGAGTGAGC TCGAGACCAA GGACTTCTTC CGGGACGAGG AGCTCGTCGC GGACCCGTAC 
CCGTTCCTCG AGGCGATGCG CGGGAAGTGC CCCGTGCAGC GCGAGAATCA CCACGACGTG
GTGATGGTGA CGGGGTACGA CGAGGCGGTC CAGGTCTTCC ACGACTCGGC GACCTTCTCC
TCGTGCGTCT CGGTGACGGG TCCGTTCCCG GGTTTCCCCG TCCCGCTCGA GGGCGACGAC
GTCACCGAGC TGATCGAGCG GCACCGCGGT GAGCTGCCGA TGAACGACCA GCTCCCCACG
CTCGACCCGC CCACGCACAC CGCGCACCGC GCGCTGCTGA TGCGGCTGAT CACGCCGAAG
CGCCTCAAGG AGAACGAGGC GCAGATGTGG CGGCTCGTCG ACCAGATGGT CGAGCCGTAC
CTGGCCGGCG GCGAGGGCGA GTTCATCACC GGCTTCGCCG GGCCGTTCAC CCTGCTGGTG
ATCGCCGACC TGCTGGGCGT GCCCGAGGAG GACCAGGAGG AGTTCCTCGA CCGCCTCCAG
CGCCAGCCGC AGGAGAGCGG CGGCATCGGC AGCACCGGCG ACGACCACAT GGCGCACAAC
CCGCTGGAGT TCCTCTACAA CAAGTTCACC GCCTACATCG AGGACCGCCG GCGCGAGCCC
CGCGAGGACG TCCTCACCGG GCTGGCGCTG GCGACGTTCC CCGACGGGTC GACCCCGGAG
GTCATCGACG CCGTCCGGGT CGCGGCCAAC CTGTTCTCCG CCGGACAGGA GACCACCGTC
CGGCTACTCT CCTCGGCACT GAAGATCCTC GCCGAGGACC GCGAGCTCCA GCAGCTGCTA
CGGGCCGAGC CGGACCGCGT CGGCAACTTC ATCGAGGAGA CGCTGCGGCT GGAGAGCCCG
GTCAAGGGCG ACTTCCGGCT CTCCCGGGTG CCGACCACCG TCGGCGGCGT CGACCTGCCC
GCCGGCACCA CGGTCATGGT CGTCAACGGC GCCGCGAACC GCGACCCGCG CCGCTTCGAG
AACCCGAGCG TGTTCGACGT CGCCCGCCCG AACGCCCGCC ACCACGTGGC GTTCGGCCGT
GGCATCCACA CCTGCCCCGG CGCCCCGCTC GCCCGCGCCG AGGCGCGTGC GAGCATCGAG
CGGCTGCTCG AGCGCACCAC CGACATCCGG ATCTCCGAAA GCGTGCACGG CCCCGCGGAC
GACCGCCGGT ACAGCTACCT GCCCACCTTC ATCCTGCGTG GGCTGACGCA CCTCAACCTC
GAGTTCACCC TCGCAGAGAG CAAGACGCCA TGA
 
Protein sequence
MSELETKDFF RDEELVADPY PFLEAMRGKC PVQRENHHDV VMVTGYDEAV QVFHDSATFS 
SCVSVTGPFP GFPVPLEGDD VTELIERHRG ELPMNDQLPT LDPPTHTAHR ALLMRLITPK
RLKENEAQMW RLVDQMVEPY LAGGEGEFIT GFAGPFTLLV IADLLGVPEE DQEEFLDRLQ
RQPQESGGIG STGDDHMAHN PLEFLYNKFT AYIEDRRREP REDVLTGLAL ATFPDGSTPE
VIDAVRVAAN LFSAGQETTV RLLSSALKIL AEDRELQQLL RAEPDRVGNF IEETLRLESP
VKGDFRLSRV PTTVGGVDLP AGTTVMVVNG AANRDPRRFE NPSVFDVARP NARHHVAFGR
GIHTCPGAPL ARAEARASIE RLLERTTDIR ISESVHGPAD DRRYSYLPTF ILRGLTHLNL
EFTLAESKTP