Gene Franean1_3475 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_3475 
Symbol 
ID5671846 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp4131608 
End bp4132804 
Gene Length1197 bp 
Protein Length398 aa 
Translation table11 
GC content73% 
IMG OID641242363 
Productcytochrome P450 
Protein accessionYP_001507783 
Protein GI158315275 
COG category[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG2124] Cytochrome P450 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.000451038 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.00686408 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGACTGGGA CCGAGATCCC CGAGTACCCG CTGGCGCGGA CGTGCCCGTT CCACCCGCCC 
GCCGGTTACG CCCGCTACCG CGAGCACGGG CCGGTGAACC CGGTGCGGCT CTACGACGGC
CGGCGGGTGT GGGCCGTCAC CGGGCACGCC GAGGCCCGCG AGGTGCTGCT GAACACGCGG
CTGTTCTCCT CGGAGCGCGC CGACCCGCGC TATCCGGCGA CCAGCCCCCG TTTCGAGGCG
GCCCGCAAGG TCCGCAACTT CATCGGCATG GACCCGCCGG ACCACACCGC GCAGCGGCGG
ATGCTGCAGT CCAGCTTCAC CATGCGCCGG ATCAACGGCC TGCGCCCCGG CATCCAGCGG
CTCGTCGACG AACTGCTCGA CGCGATCGTC GCCAAGGGCC CGGTGGTCGA CCTGGTGCCC
GAGTTCGCGC TGCCGATCCC GTCGATCGTC ATCTCCGAGC TGCTCGGGGT GCCCTACGGG
GACCACGCTT TCTTCGAGCA GCAGTCCCGG CGGGTGGCCA GCGGCACCTC GACGCTGGAG
GAGAGCGCGG ACGCGTTCAC CCAGCTGCTC CAGTACCTCG ACGGGCTCAT CCAGGACAAG
GAGCGCTCCG CCGGCGACGG CCTGCTCGAC GTCCTCATCG CCGAGCAGGT GCGCCCCGGG
GTCCTGACCC GGCGCGAGCT CGTCGACATC TCGCTGCTGC TGCTCGTGGC CGGCCATGAG
ACGACGGCCA GCGCCATCGC GCTCGGCGTG GTCGCCCTGC TCGAGCACCC CGACCAGCTC
GCCGCGCTGC GCGCCGACCC CGCGCTGCTG CCCAGCGCCG TCGAGGAGCT GCTGCGCTTC
ACCACGATCG CCGACAGCGT GGCCCGGTTC GCGACGGCCG ACACCGAACT GGCCGGCCAG
CCCGTCGCCG CCGGGGACGG TGTTCTCGTC GTGCTCTCCG CCGCGAACCG CGACGGCACG
GTCTTCCCCG ACCCGGACCT CCTCGACCTG GCCCGCCGCG CCCGCAGCCA CGTGGCCTTC
GGCCACGGCG CGCACCAGTG CATCGGGCAC AACATCGCCC GCGCCGAGCT GGAGATCGCG
TTCTCCACGC TGTTCGCCCG CCTCCCCGGC CTGCGGCTCG CGGTGCCGCT CGACCGGCTG
CCCGGCAAGG ACGCCGGCGG GGTGCAGGGC GTCTTCGAGC TGCCCGTCGC CTGGTGA
 
Protein sequence
MTGTEIPEYP LARTCPFHPP AGYARYREHG PVNPVRLYDG RRVWAVTGHA EAREVLLNTR 
LFSSERADPR YPATSPRFEA ARKVRNFIGM DPPDHTAQRR MLQSSFTMRR INGLRPGIQR
LVDELLDAIV AKGPVVDLVP EFALPIPSIV ISELLGVPYG DHAFFEQQSR RVASGTSTLE
ESADAFTQLL QYLDGLIQDK ERSAGDGLLD VLIAEQVRPG VLTRRELVDI SLLLLVAGHE
TTASAIALGV VALLEHPDQL AALRADPALL PSAVEELLRF TTIADSVARF ATADTELAGQ
PVAAGDGVLV VLSAANRDGT VFPDPDLLDL ARRARSHVAF GHGAHQCIGH NIARAELEIA
FSTLFARLPG LRLAVPLDRL PGKDAGGVQG VFELPVAW