Gene Franean1_4491 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_4491 
Symbol 
ID5672841 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp5357824 
End bp5359089 
Gene Length1266 bp 
Protein Length421 aa 
Translation table11 
GC content67% 
IMG OID641243358 
Productcytochrome P450 
Protein accessionYP_001508774 
Protein GI158316266 
COG category[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG2124] Cytochrome P450 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones27 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCCGTTG ACGACACCGA GGCGTCCGCG CCCACGACCC AGCGCGGGTA CGCCACGGCC 
CCCACGCTCG CGGGGTTCTC GCACGCGGCG AACGAGCGTC TCAATGCCGA CCCGTGGGGC
GAGCTCGACC GGCTCCGCGA CGAGTCGCCG ACATTCCGTA GCGACATGCC GAATCCTCTC
GTCCCCGGCG CGTCGCTGTG GTATCTCCTC GACTACGAGA GCGTCTACAC CGCGCTGCGT
GACTGGGAGA CCTTCTCCAA CGTGGGATCG GCGCACCCGT TCTCCGACAG CGACCCGTAC
AGCATGATCC CCGGCGAGCT GGACCCGCCG GACCACACCA AGTTCCGGCG GCCGCTGAAC
GCCCACTTCT CGCCCGGCGC GATCCGCGCG CTCGAGCCGG ATATCCGCCG GACGGCCGTT
GAGCTCATCG AGTCGTTCAA GGATTCCGGC CAGTGCGACT TCGTCACCGA CTTCGCGCTG
CACTTTCCTA CCCGGGTCTT CGAGCGGATG TTCGGCGTCC CGCTCGAGGA TCACGACCAG
CTCACCGCGT GGGTGCATAC CTTCGGCCAG CAGATGGCGA CACAGACCGC GATCGACAAG
GCCGTCGCCG CCGAGCAGGA GGTGCTGGCC TACCTGGGGA AGAAGCTGGA CGAGCGCGAG
CAGTCCCCCA GGGAGGACCT GCTCGGCGCG ATCGCCTTCA TGGAAGTCGA CGGCGCCCGG
ATCAGTCGCA AGGAGCAGGT GGCGGTCGCC TACCTGATGT TCCAGGCGGG TATGGACACG
GTGGCGAGCC AGCTGGGCTG GTCCTTCCGC CACCTCGCCG AGAACGAGGT CGACCGGCAG
GCGATCCTCG CCGACCCGAA GCTGATTCCC TCGACGGTGG AGGAGCTCCT GCGCTCCTAC
GACATCCTCT CGCACACCAT GATCGTCGCC AAGGATGTCG AGTTCAACGG CTGCCCGATG
AAGAAAGGCG ACCGGGTGGT CACCATGATC TCGGCGGCGA ACCGGGACCC GAACGAGTTC
CCGGACCCGG ACACCTTCGA CGTCTCCCGC AAGCCGAACC GGCACATGGC CTTCGGGGTG
GGGCCGCACC GCTGCATCGG TGCGCACCTG GCCCGGATCG AGCTGAACAT CGCGCTGGAG
GAGTGGCACC AGCGGATCCC GAATTACAAG GTGGCCGAGG GCGCCGAGTT CGGCCAGTCC
ATGAAATGGG CGGTCACCTC GATGGAATCG CTCCCGCTCG AATGGGATGT CGAGGCGGTG
AACTGA
 
Protein sequence
MSVDDTEASA PTTQRGYATA PTLAGFSHAA NERLNADPWG ELDRLRDESP TFRSDMPNPL 
VPGASLWYLL DYESVYTALR DWETFSNVGS AHPFSDSDPY SMIPGELDPP DHTKFRRPLN
AHFSPGAIRA LEPDIRRTAV ELIESFKDSG QCDFVTDFAL HFPTRVFERM FGVPLEDHDQ
LTAWVHTFGQ QMATQTAIDK AVAAEQEVLA YLGKKLDERE QSPREDLLGA IAFMEVDGAR
ISRKEQVAVA YLMFQAGMDT VASQLGWSFR HLAENEVDRQ AILADPKLIP STVEELLRSY
DILSHTMIVA KDVEFNGCPM KKGDRVVTMI SAANRDPNEF PDPDTFDVSR KPNRHMAFGV
GPHRCIGAHL ARIELNIALE EWHQRIPNYK VAEGAEFGQS MKWAVTSMES LPLEWDVEAV
N