Gene Franean1_4324 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_4324 
Symbol 
ID5672679 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp5165885 
End bp5167072 
Gene Length1188 bp 
Protein Length395 aa 
Translation table11 
GC content70% 
IMG OID641243197 
Productcytochrome P450 
Protein accessionYP_001508614 
Protein GI158316106 
COG category[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG2124] Cytochrome P450 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACGGAAC TAAGCTGGGA TCCGTTCGAC AAGGTGATCC ACCTTGCGCC GTACGACGTG 
TGGCGACGGA TGCGCGACGA GGCGCCGGTG TACCGCAACG ACAGGCTCGA CTTCTTCGCG
CTGTCGCGGC ACGCCGACGT CGAGGCCGCC CACCGGGATC CGGCGACCTA CAGCTCGGCG
CACGGCACCG TCCTGGAGAT CATGTCGCCG GAGCCGATGC AGACCGGGCT CATCATCTTC
ATCGACCCGC CGACGCACAC CGAGCTGCGC ACCCTGGTCT CCCGGGCGTT CACCCCCCGG
CGGATCTCGG CGCTCGAGGA CTCCATCCGG GCGCTGTGCG CCGAGATGCT CGACCCGCAG
GTGGGCGGGA GCGGGTTCGA CTACGTGCAG GACTTCGCGG CCCAGCTCCC GTCCAAGGTC
ATCTCCGAGC TGATCGGGGT CGACCCGGCG GACCGCGAGG ACGTCCGCCA GCTCATCGAC
CAGACCTTCC ACCTCGAGGA AGGCGCCGGG ATGATCAACG ACATCTCGTT CGGCGCCCAG
ATCAAGCTGC ACACCTACTG GAGCGAGCAG ATCGAGCTCC GCCGCCGCCA GCCCCGCGAC
GACATGATGA CCGCGCTGGT CGAGGCGGAG GTCAAGAGCG AGACCGGCTC CCGGCGTCTC
ACCACCCAGG AGGCCGCCGA CTTCACGAAC CTGCTCGTCA GCGCCGGCAC CGAGACGGTC
GCCCGGCTGC TCGGCTGGGC GGGGTTCGTC CTGGCCGCGC ACCCCGACCA GCGCGCCGAG
ATCACCGCCG ACCCGTCACT GATCGGCAAC ACGATCGAGG AGCTGCTGCG CTACGAGGCG
CCGTCCCCGG TGCAGGGCCG GGTGCTGACC AGGGAGGTGG AGCTGCACGG CACGGTCCTC
CCCGCCAAGT CGAAGGTCCT GCTGCTCACC GGCTCGGCCG GCCGGGACGA ACGGAAGTAC
CCCGACGCCG ACCGGTTCGA CATCCACCGC CGGTTCGACA GCCATGTCTC CTTCGGCCAC
GGCGTGCACT TCTGCCTCGG CGCATCCCTC GCCCGGCTCG AGGGACGGGT CGCGCTGCAG
GAGACCCTGC ACCGTTTCCC CGAATGGGAC GTCGACCACG ACCGCGCGGT CCGCCTGCAC
ACCAGCACCG TCCGCGGCTA CGAAAAGCTC CCGATCACCC TCGGCTAG
 
Protein sequence
MTELSWDPFD KVIHLAPYDV WRRMRDEAPV YRNDRLDFFA LSRHADVEAA HRDPATYSSA 
HGTVLEIMSP EPMQTGLIIF IDPPTHTELR TLVSRAFTPR RISALEDSIR ALCAEMLDPQ
VGGSGFDYVQ DFAAQLPSKV ISELIGVDPA DREDVRQLID QTFHLEEGAG MINDISFGAQ
IKLHTYWSEQ IELRRRQPRD DMMTALVEAE VKSETGSRRL TTQEAADFTN LLVSAGTETV
ARLLGWAGFV LAAHPDQRAE ITADPSLIGN TIEELLRYEA PSPVQGRVLT REVELHGTVL
PAKSKVLLLT GSAGRDERKY PDADRFDIHR RFDSHVSFGH GVHFCLGASL ARLEGRVALQ
ETLHRFPEWD VDHDRAVRLH TSTVRGYEKL PITLG