Gene Francci3_2664 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFrancci3_2664 
Symbol 
ID3904888 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. CcI3 
KingdomBacteria 
Replicon accessionNC_007777 
Strand
Start bp3144476 
End bp3145801 
Gene Length1326 bp 
Protein Length441 aa 
Translation table11 
GC content69% 
IMG OID637879989 
Productcytochrome P450 
Protein accessionYP_481755 
Protein GI86741355 
COG category[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG2124] Cytochrome P450 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.100989 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCGGTGA GGGGTTCCAG GGGACCGGTG CCGGGCGACG GAGGTCCGCC CCTGGTGGGA 
TACACCCTGC GCTACCTGCA TGACCCTGCC CAGCACTGGC GGCAGCGCTA CGACCGGTAT
GGCCCGGTGT CATGGGAGCG GACCTTCGGG CTGCGGGTGG TGAGCCTGCT CGGCCCGGAC
GCCACCGGCT TGGCGCTGCG CAACCATGAG CAGGCATTCG CGAACGGCCC GGGGCAGCAA
CGGATAGCCG GACCGTTCTT TCGGCGCGGA TTGAGCATGC TCGACTTCGA CGAGCACCGC
CACCACCGTC GGATCCTCGC CGGCGCGTTC GCCCCGGACA GACTGCGCGG CTATCTCGCC
GGGATGAACC CGTCCATCGA ACGGGGCGTC GCGGGGTGGC GGCCGGGCGC GAGGTTCCAG
GTTTATCCCG CGGTCAAACA GCTCACCCTC GAACTGGCGA CCCGGATCTT CATGGGTGAA
CGGCTGGGAC CGGAAGCCGA CCGGTTCAAC GCCGCGCTGT TCGCCTGCAT CCGCGCGCCG
GGCGCGGTGG TCCGCGTGCC GGCACCCGGG CTACGTTGGT CGCGTGGCTT GGCCGGGCGC
CGGTACCTGG AGGAGTTCCT GCGGCTGCGG GTGCCCGCGA AACGAGCCGG GAGCGGCACC
GACATGTTCA GCCGGCTCTG CCACGCCGAG GCCGAGGACG GCAGCAGGCT GAGCGACGAC
GACGTCGTCA ACCACATGAT CCTGATGATG GTGGCCGCGC ACGACACGTC CACGATCACG
ATGACCAGCA TGAGCTACTA TTTGGCACGG CATCCGGAGT GGCAGCAGCG GTGCCGCGAG
GAGTCGCTCG CCCTCGGCAC GCCAGCGGTG GACCATGCCG ACCTCGATCG GCTGCCGTCA
CTCGCTCTGG TCATGAAGGA GGCGCTGCGC CTGGTCACGC CGGTGCCGAT CCTCCTGCGC
GCCACAGTGA AAGACATCGA CGTGCTCGGC GTCACGGTGC CCGCCGGTAC CGTGGCCGCC
CTCGCGCTCG CCTTCACCCA CCAGATGCCG GAGTACTGGC CGAGCCCGGA ACGGTTCGAC
CCGGAGCGGT TCGCGGACCA CCGTCGCGAG GACAAGGTAC ATCCCTACGC CTGGCAACCG
TTCGGTGGCG GGCCACATAC CTGCATCGGT CTGCACTTCG CCGGTCAGCA GGTGAAGGCG
ATCCTGCACC AGATGCTGCT GCGGTACCGG TGGAGCCTGG CACCGGGCTA CCGGATCTCA
TTAGACCGTT TCCCGCTGCC TGTTCCACGG GACGGGCTAC CGGTCCAGCT GGAAAAGATC
ACCTGA
 
Protein sequence
MAVRGSRGPV PGDGGPPLVG YTLRYLHDPA QHWRQRYDRY GPVSWERTFG LRVVSLLGPD 
ATGLALRNHE QAFANGPGQQ RIAGPFFRRG LSMLDFDEHR HHRRILAGAF APDRLRGYLA
GMNPSIERGV AGWRPGARFQ VYPAVKQLTL ELATRIFMGE RLGPEADRFN AALFACIRAP
GAVVRVPAPG LRWSRGLAGR RYLEEFLRLR VPAKRAGSGT DMFSRLCHAE AEDGSRLSDD
DVVNHMILMM VAAHDTSTIT MTSMSYYLAR HPEWQQRCRE ESLALGTPAV DHADLDRLPS
LALVMKEALR LVTPVPILLR ATVKDIDVLG VTVPAGTVAA LALAFTHQMP EYWPSPERFD
PERFADHRRE DKVHPYAWQP FGGGPHTCIG LHFAGQQVKA ILHQMLLRYR WSLAPGYRIS
LDRFPLPVPR DGLPVQLEKI T