Gene Franean1_4218 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_4218 
Symbol 
ID5672573 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp5024124 
End bp5025311 
Gene Length1188 bp 
Protein Length395 aa 
Translation table11 
GC content69% 
IMG OID641243091 
Productcytochrome P450 
Protein accessionYP_001508508 
Protein GI158316000 
COG category[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG2124] Cytochrome P450 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.247532 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGACGGACG ATCTGTACTG GGACCCGTTC GACAAGGAGA TCGACGTCAA CCCGCACCCG 
CTGTGGAAGC GGATGCGCGA CGAGGCGCCG GTCTACCACA ACGAGAAGTT CGACTTCTAC
GCGCTCTCCC GCTTCACCGA CGTCGACACG GCCCACCTCG ATCCCGCGAC CTACAGCTCC
AAGTACGGCA CGGTCCTCGA GCTCATGAAG TCGGAACCAT GGGACACCGG CCAGATAATC
TTCATGGACC CGCCCACGCA CACCACCCTG CGCGTCCTGG TCTCCCGGGC CTTCACCCCG
CGCCGCGTCG GCGGTCTCGA GGGCGTCATC CGCGACCTGT GCGCCGAGCT GCTCGACCCC
CAGGTCGGCG GCGGCGGGTT CGACTTCGTG CAGGACTTCG CCGCCCAGCT CCCCTCGCTG
GTCATCTCCC AGCTCATCGG CGTCGACCCC GCCGACCGGG AGGACGTCCG CAAGATGATC
GACGGGACGT TCTACCTGGA CCCGGAGAAG GGCATGTTCA ACGAGACGGC CATGGCGGCC
ACCGCCAAGT TCCACGGCTA CCTCAACGGG CAGATCCAGG AGCGGGTCAA GAACCCCCGG
GACGACATGA TGACCGCGCT CACCCAGGCG GAGATCACCA CCGACGACGG CACCCGCCGG
CTCAGCCTCA GCGAGGCCAC GGACTTCACT GCGCTGCTCG TCTCGGCGGG CACGGAGACC
GTCGCCCGGC TGCTCGGCTG GGCCTGCGTC CTGCTGGCCG CCCACCCCGA CCAGCGGGCC
GACCTGGCCG CCGACCCGTC GCTGCTGGGC GGCGCCGTCG AGGAGACGCT GCGCTACGAG
GCGCCGTCCC CCGTCCAGGG CCGCGTCACC ACCCGCGACG TCGAGCTGCA CGGCACCGAG
ATCCCCGCCA AGTCCAAGGT CCTGCTGCTC ACCGGCTCGG CCGGGCGTGA CGACCGCAAG
TACGACGACC CGGACCGCTA CGACATCCGG CGCAGGTTCG ACAGCCATGT CTCCTTCGGG
CACGGCGTGC ACTTCTGCCT CGGTGCCGCG CTCGCCCGCA TGGAGGGACG CATCGCCCTG
GAGGAGACCC TGCGCCGCTT CCCGACCTGG GACGTCGACC ACGGCAACAC GGTGCGCCTG
CACACCAGCA CCGTGCGCGG CTACGAGGAG CTGCCGATCA TCGTCTAG
 
Protein sequence
MTDDLYWDPF DKEIDVNPHP LWKRMRDEAP VYHNEKFDFY ALSRFTDVDT AHLDPATYSS 
KYGTVLELMK SEPWDTGQII FMDPPTHTTL RVLVSRAFTP RRVGGLEGVI RDLCAELLDP
QVGGGGFDFV QDFAAQLPSL VISQLIGVDP ADREDVRKMI DGTFYLDPEK GMFNETAMAA
TAKFHGYLNG QIQERVKNPR DDMMTALTQA EITTDDGTRR LSLSEATDFT ALLVSAGTET
VARLLGWACV LLAAHPDQRA DLAADPSLLG GAVEETLRYE APSPVQGRVT TRDVELHGTE
IPAKSKVLLL TGSAGRDDRK YDDPDRYDIR RRFDSHVSFG HGVHFCLGAA LARMEGRIAL
EETLRRFPTW DVDHGNTVRL HTSTVRGYEE LPIIV