Gene Franean1_4029 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_4029 
Symbol 
ID5672387 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp4804105 
End bp4805301 
Gene Length1197 bp 
Protein Length398 aa 
Translation table11 
GC content67% 
IMG OID641242905 
Productcytochrome P450 
Protein accessionYP_001508322 
Protein GI158315814 
COG category[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG2124] Cytochrome P450 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value0.717223 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCGTTAA CCAGCCCCAG CGACGTCTAC TACGATCCTT ACGACGCCCA GATCGACGCC 
GACCCCTACC CGGTCTGGCG GCGCATGCGC GACGAGGCAT CGCTCTACTA CAACGAGAAG
TACGACTTCT ACGCCCTCAG CCGCTTCGAG GACGTCGAGC CCTGCCTGAG TGACTGGAAC
ACCTACCGGT CCGGCCGGGG ATCCATCCTG GAGCTCATCA AGGCCAACAT CGAGCTGCCC
TCGGGAATCA TTCTCTTCGA GGACCCGCCG ATCCACGACA TTCACCGCAG CCTGCTCGCC
CGGGTCTTCA CCCCGCGGAA GATGAACGCG CTGGAGCCGA AAATCCGCGA GTTCTGCGCG
CGTTCCCTCG ATCCCCTTGT CGGGACCGAG CGCTTCGATT TCATCCGGGA CCTCGGCGCG
CAGATGCCGA TGCGCACGAT CGGCTTTCTC CTGGGAATCC CGGAATCCGA CCAGGAGGCG
ATCCGGGACC GTCTCGACGA GGGCCTGCAG CTGCGCGAGG GTGAAGAGCT CTCGGTCTCG
GCGGAGGACT TCAACGCCGA CGAGTTCGGC GCCTACATCG ACTGGCGGGC CGAGCATCCC
TCCGACGACC TGATGACGGA GCTCCTGAAC GCCGAGTTCG AGGACGAGAC GGGCACCGTC
CGCAAGCTCC ACCGAGAGGA AGTGCTCACT TACGTCACGA TGCTCGCCGG GGCCGGGAAC
GAGACGACCA CGCGACTCAT CGGCTGGACC GGAAAGATTC TCGCCGAGAA CCCCGACCAG
CGGCGCGAAC TCGTCGCGGA CCGCTCGCTC ATTCCGAACG CGATCGAGGA GCTGCTGCGT
TTCGAGGCGC CCTCACCGGT GCAGGCGCGC TATGTCGCCC GCGACGTCGA ACACCACGGC
CACACCGTGC CCGAGGGCAG CATCATGGTG CTGCTGAACG GCTCGGCGAA CCGGGACGAG
CGCCGCTTCG CCGACCCTGA CCGCTTCGAC GTCCACCGCG ACGTCGGCCG CCATCTCAGC
TTCGGCTATG GCATCCACCA CTGCCTCGGG GCGGCGCTGG CCCGACTCGA GGGCAGGGTC
GCCCTGGACG AGGTCCTCAG CCGGTTCCCG ACCTGGGAGA TCGACTGGGA CAACGCCGTC
CAGGCCCGCA CCTCGACGGT CCGCGGCTGG GAGACGATGC CCGCCTTCGT CCGGTAG
 
Protein sequence
MPLTSPSDVY YDPYDAQIDA DPYPVWRRMR DEASLYYNEK YDFYALSRFE DVEPCLSDWN 
TYRSGRGSIL ELIKANIELP SGIILFEDPP IHDIHRSLLA RVFTPRKMNA LEPKIREFCA
RSLDPLVGTE RFDFIRDLGA QMPMRTIGFL LGIPESDQEA IRDRLDEGLQ LREGEELSVS
AEDFNADEFG AYIDWRAEHP SDDLMTELLN AEFEDETGTV RKLHREEVLT YVTMLAGAGN
ETTTRLIGWT GKILAENPDQ RRELVADRSL IPNAIEELLR FEAPSPVQAR YVARDVEHHG
HTVPEGSIMV LLNGSANRDE RRFADPDRFD VHRDVGRHLS FGYGIHHCLG AALARLEGRV
ALDEVLSRFP TWEIDWDNAV QARTSTVRGW ETMPAFVR