Gene Franean1_1901 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_1901 
Symbol 
ID5670302 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp2281017 
End bp2282252 
Gene Length1236 bp 
Protein Length411 aa 
Translation table11 
GC content71% 
IMG OID641240822 
Productcytochrome P450 
Protein accessionYP_001506244 
Protein GI158313736 
COG category[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG2124] Cytochrome P450 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.908058 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones21 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGACCGACA CCCGAACCCC GCGGGCCGCC AGCGGGCCGT ACGTCTTCAA TCCGTTCGCC 
GACGGCTTCG CCGAGGACCC CTACCCGCAC TATGCCCAGC TGCGCGAAAA CGCGCCGGCG
CACCGGCATC CGCTGGGATT CTGGCTTGTC TCCCGGTACG AGGATGTGGC GAGAATCCAG
CGCTCCGGGC ATTCCGTCGA CGAACGGCAC ATCACCGAGC TTCCGGAGTG GAAAAGCGAG
TCGCGGACCC TGGGCAAGCA GAACCGGCTC ATGCACGGCC TGTCCATGCT CGACCAGGAT
CCACCGAACC ACACCAGGCT GCGCCGGCTG GTGACGAAGG CGTTCACCCG GCGCGCCGTC
GACGCGCTCG AAGGGCGGAT CGAACGCATC GTCGACGACG CTCTCGACCG GATGGCCGAG
GCGGGCCGGG TCGACCTCGT CGCGGAGCTG GCCTTCCCGC TCCCGTTCAC CGTGATCTCC
GAGCTGCTGG GCATCCCGGT GCTGGAGCAC GGCAGGCTCC GCGAGCTGAC CGGGACGATG
GTGCGGGCCC TCGAGCCGCT GCCCGACCCG GGGCTGCAGG CCGAGATCCG GGCGGCGAAC
GACGAGGTGG CCGCCATCAT GCGCGCGGTG ACCGACTGGA AACGTGACAA CCCCGGCGAC
GACCTGCTGA CCGCGCTCAT CGGCGCGGAG CACGACGGTG ACGTTCTCAG CGCGGAGGAG
CTGGTCGCGC AGGTCATGCT GCTGTACGTC GCCGGCCACG AGACGACGGT GAACCTCGTC
GCGGGCGGGA TCCTCACGCT GCTGCGCCAC CCCGACCAGA TGCGCCGGCT GCGCGACGAG
CCCGAGCTCG CCGGGAACGC GGTGGAGGAG CTGCTGCGCT ACGACAGCCC GGTGCACCTG
ATGCGGCGGA TCACCCTGGA GCCGCTGTCC GTGCGTGGCA CGGAGATCCC GCCCGGCGTG
TTCGTGACGG TGTGTCTCGC GGCGGCCAAC CGGGATCCCG ACTTCTGGGG CCCGGACGCC
GACGAGGTCC GCCTCGACCG TACCGACGCG CACCGGCACG TGTCCTTCGG CGCCGGGATC
CACCACTGTG TGGGCGCGGC GCTGGCCCGG TTGGAGGCCC GGGTGGCAAT CTCCCGCTTC
GTCGGGCGAT TCCCCGCGCC GGCGCTCGAG GACGTCCGCT GGAACGGCCG GATCAACGTC
CGCGGCCCGG CCTCGCTGAC GGTCGCCGTC CGGTAA
 
Protein sequence
MTDTRTPRAA SGPYVFNPFA DGFAEDPYPH YAQLRENAPA HRHPLGFWLV SRYEDVARIQ 
RSGHSVDERH ITELPEWKSE SRTLGKQNRL MHGLSMLDQD PPNHTRLRRL VTKAFTRRAV
DALEGRIERI VDDALDRMAE AGRVDLVAEL AFPLPFTVIS ELLGIPVLEH GRLRELTGTM
VRALEPLPDP GLQAEIRAAN DEVAAIMRAV TDWKRDNPGD DLLTALIGAE HDGDVLSAEE
LVAQVMLLYV AGHETTVNLV AGGILTLLRH PDQMRRLRDE PELAGNAVEE LLRYDSPVHL
MRRITLEPLS VRGTEIPPGV FVTVCLAAAN RDPDFWGPDA DEVRLDRTDA HRHVSFGAGI
HHCVGAALAR LEARVAISRF VGRFPAPALE DVRWNGRINV RGPASLTVAV R