Gene Franean1_5387 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_5387 
Symbol 
ID5673719 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp6495897 
End bp6497153 
Gene Length1257 bp 
Protein Length418 aa 
Translation table11 
GC content65% 
IMG OID641244243 
Productcytochrome P450 
Protein accessionYP_001509649 
Protein GI158317141 
COG category[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG2124] Cytochrome P450 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.659302 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCAGAC CAGATAGCAT TGTGTTGCTT GACCGGGCGC GTTTGCGCGA GGTTTTCGAT 
CTCCGCAATG AAGCGAACCT CGGCACGGTC GCGGGCTATG AGGAAGACCC GTACCCGAGG
TGGCACGAGC TTCGGGAGCT GGCAGCGGTA CATCCCGGGA CGTTGCACGA GTTGACGGGT
TTCAGCGGCC CGGTGTTGTT TCAAGGGTTG CCGTTCGAGG ACCGGCCGCA CTTCACCGCG
TTCACGTTCG CGGCGTGTGA CGAGGCGTTG AAGAACCAGG AGGTTTTTGC TTCGTCGCCG
GTAGCGGTCG ATCTCGAGGG CGGACGGCTC GCTCCACTGA ACAGCATATT CTCCATGGCC
GGTGCCCAGC ACCGCCGTTA CCGGAGGCTG GTGCAGTCGT CGTTCGTGCC ACCGCGGATG
GCCTGGTGGA CCGAGAAGTG GATCGAAACG ACGGTACACG CATTGATCGA CTGGTTCGCC
GGCGACGGCT CTGCGGACCT GAACGTCGAT TTCTCCGCAG CGGTACCGGT ACTGACGATC
ACGGGCAGCT TCGGGGTGGC GGTCGAGCAG GCGATCGCGA TCCGGGAGGC TTTGAGTAGC
CCGGAGCGGC TCGTACCGCT GCTGGCGCCG ATCATCGCCG CCCGCCGCGA GACTCGCGAG
GACGATCTCA TCAGCGTCCT GGTCGACGCC GAGGTGCAGG ACGACGACGG GAACCCTCAC
CGCCTGTCGG ATGCCGAGAT CTATTCGTTC GCGGTGTTGC TCCTCATGGC GGGATCGGGC
ACGACATGGC GACAGATGGG GATTGTACTG ACCGCGTTGT TACAGCGCCC GGAAATCCTC
GACGCGGTAC GCCGGGACCG GCAGCTGCTG CGTAACGCGA TCGACGAGTC GTTGCGGTGG
ATGCCCACCA ATCCCATGTT CTCCCGGTTC CTGACCAAGG ACGTTGAATT CCACGGCGCG
CATCTTCCGA AGGGCGCGGT GTTGCACCTC GCGCTGGGTG CCGGCAGCCG GGATCCCCGC
CGCTGGGAAC GGCCCGACGA GTTCGACGTG ACCCGCCCAC CGAAGCCGTC GCTGGGGTTC
GGCGGGGGAC CCCACGTGTG CCTGGGGATG CATGTCGCCC GGGCCGAGAT GTACACGGGC
ATCGGCGCCC TGCTCGACCG GCTGCCGAAC CTTCGGCTCG ATCCCGACGC CAACCCTCCC
CGCATCATCG GCATGTACCA CCGCGGCCCG ACAGCGATCC CCGTGCTGTT CGGTTGA
 
Protein sequence
MTRPDSIVLL DRARLREVFD LRNEANLGTV AGYEEDPYPR WHELRELAAV HPGTLHELTG 
FSGPVLFQGL PFEDRPHFTA FTFAACDEAL KNQEVFASSP VAVDLEGGRL APLNSIFSMA
GAQHRRYRRL VQSSFVPPRM AWWTEKWIET TVHALIDWFA GDGSADLNVD FSAAVPVLTI
TGSFGVAVEQ AIAIREALSS PERLVPLLAP IIAARRETRE DDLISVLVDA EVQDDDGNPH
RLSDAEIYSF AVLLLMAGSG TTWRQMGIVL TALLQRPEIL DAVRRDRQLL RNAIDESLRW
MPTNPMFSRF LTKDVEFHGA HLPKGAVLHL ALGAGSRDPR RWERPDEFDV TRPPKPSLGF
GGGPHVCLGM HVARAEMYTG IGALLDRLPN LRLDPDANPP RIIGMYHRGP TAIPVLFG