Gene Francci3_1604 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFrancci3_1604 
Symbol 
ID3903739 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. CcI3 
KingdomBacteria 
Replicon accessionNC_007777 
Strand
Start bp1925242 
End bp1926552 
Gene Length1311 bp 
Protein Length436 aa 
Translation table11 
GC content71% 
IMG OID637878941 
Productcytochrome P450 
Protein accessionYP_480709 
Protein GI86740309 
COG category[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG2124] Cytochrome P450 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones25 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value0.454248 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCACGG GCGGGATGAC GTCCCCGACA CCCAGCACCG CGACCGCCGA CACCGCGACC 
GCCGACACCG CGACCGCCGA AGCCACCCGG GCACCGATCC GCTTCAACCC CTTCGGCGCC
GAGTTCCGGC GCAGTCCGTA CCCGCTCTAT GCGCGGCTTC GAGGCGCGCA GCCCGTCCAC
CGGACGCTGG GCATGTGGGT GCTCACGCGG CACGCCGACG TGCGCGGTGT CCTGCACGAC
CGGACGTTCA GCGCCGGGCT CATCCCCCAG CTGGTCAGCC GGCAGGCGTC GCGGCTGTCG
CGGGACGACG TGGCGCGGAT CGGGCGGCTG GCGACCAAGT CGCTCGTCTT CACCGACAAC
CCCGACCACG CCCGGCTGCG TGGCCTGGTG AACCGGGTAT TCACGGCGCA GGCGGTCGCC
GAGCTGCGCC CCCGCGTCCA CGAGGTCGCG GAGCGGCTGG TGCGGCGAGC CTGGGACGAC
GGCGGCATGG ACGTCGTCGC CGATCTCGCG GGGCCCCTAC CGCTCACGGT CATGTGCGAC
TGGATGGCCC TTCCGGACAG CCTGCGCGAG CGCGTCGGGC CTTGGACGCA TGACATCCGC
TTCCTGCTCG AGCCGGGGCT GATGAAGACC GAGGACTTCA CCCGGGTATC CGACGTCGTC
GAGACATTCG CGCAGGCCCT GGACGACGTG GTCACCGAAC GCCGGTCCCG GCCGGGTGAC
GATCTGATCA GCCGGCTGCT CGCCGCGCGG ACGGCCGGGG GCGACCGGCT CAGCGATGAG
GAAGTCGTCT TCGTCTGCAT CATGTGCTTC GTGGCCGGCA ACGAGACCAC GAAATCCCTC
ATCGGCAACG GTCTCCTCGC GTTGCTCCAG CACCCGGATC AGGACGCGCG CCTGCGCCGT
CGGCCGGAGC TGCTGGGCGG CGCCGTCGAC GAGGCTCTGC GCTACGACAG TCCCCTGCAG
CTGACCAAGC GCGTCGCGAC GCGCGAGGTC GAGATCGACG GAAGCCGGAT CCGGGCAGGC
GATCAGGTTC TGCTGTGCCT CGGCGCCGCG AACCGCGACC CCGCGGTGTT CAGCCGGCCG
GACGAGTTCG ACATCACCCG CGACGCACGA GGGCATCTCG CCTTCGGCCA CGGGATGCAC
GGGTGCCTGG GCGGGATTCT GGCGCAGTTG CAGGCCCAGG TCGCCCTTGA TCGGTTGTAC
CGGCGAGCCG AACGCCTGGA ACTCCTGGTC ACGCAACCCG ACTGGCAGGA TCACAGCTTC
ATCGTCCGCG GGCTGAAGCA GCTTCCGGTC TCGGTGCGGG GTGTCGGGTG A
 
Protein sequence
MSTGGMTSPT PSTATADTAT ADTATAEATR APIRFNPFGA EFRRSPYPLY ARLRGAQPVH 
RTLGMWVLTR HADVRGVLHD RTFSAGLIPQ LVSRQASRLS RDDVARIGRL ATKSLVFTDN
PDHARLRGLV NRVFTAQAVA ELRPRVHEVA ERLVRRAWDD GGMDVVADLA GPLPLTVMCD
WMALPDSLRE RVGPWTHDIR FLLEPGLMKT EDFTRVSDVV ETFAQALDDV VTERRSRPGD
DLISRLLAAR TAGGDRLSDE EVVFVCIMCF VAGNETTKSL IGNGLLALLQ HPDQDARLRR
RPELLGGAVD EALRYDSPLQ LTKRVATREV EIDGSRIRAG DQVLLCLGAA NRDPAVFSRP
DEFDITRDAR GHLAFGHGMH GCLGGILAQL QAQVALDRLY RRAERLELLV TQPDWQDHSF
IVRGLKQLPV SVRGVG