Gene Francci3_1608 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFrancci3_1608 
Symbol 
ID3903743 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. CcI3 
KingdomBacteria 
Replicon accessionNC_007777 
Strand
Start bp1928994 
End bp1930244 
Gene Length1251 bp 
Protein Length416 aa 
Translation table11 
GC content71% 
IMG OID637878945 
Productcytochrome P450 
Protein accessionYP_480713 
Protein GI86740313 
COG category[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG2124] Cytochrome P450 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value0.894041 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGATGCCG ATTCCCCGTC CGGTGCGGCC GCGGTCGGCA CGCCGCCGGC CTTCGGCGCG 
TTCGATCCGG CCCGACGCCA TGATCCCTAC CCCAGCTACC ACGCCCTACG GGAGGCGGAT
CCGTTCTACC GTCTGCCGCT GGGCGCCCAG CAGGTCACGC TGCTCACCCG CTACCAGGAC
TGCATGCACG TTCTGCAGGA CGCCGCCTGG GGCCGGGGTG AGGGGGGAAC CAACGCCTGG
CGTTCCGCCA ACAGCTTCGA CGGCGGGCTG CGGTCGCTCC TCGGCGTGAA CCCGCCCGAT
CACACCCGGC TGCGCGGGCT GGTCAGCAAG GCGTTCACTC CCCGGGTCAT CTCCGGGCTG
CGGCCACAGA TCACCGTCCT GGTCGAGTCG CTGCTCGACG CCGCGCTCGC GGCTGGCGAG
GTGGATCTGA TCGACGCGTT CGCCCGTCCC CTTCCCCTGA GAATCATCTG CGATCTGCTG
GGTGTGCCCG TGCGGGACGA GGAGACGTTC CGGGCCTGGG GAACCGCCCT CACCCGCGGC
CTCGATCCGG ACTATCTGCT CACACCCGAC GAGCTGGCCC TGCGAGGGAA GGCCACCGTC
GAGTTCGACG CCTACTTCAC GGATCTGATC GCGGCCAGAC GGGCTCGGCC GACCGACGAC
CTGCTCGGTC TTCTCGTGGC GGTGCGGGAG CAGGGTGACA GCCTGACCGA GGCCGAACTG
CTGGAGCTGT GCGCTCTGCT GCTGGTCGCC GGGTACGAGA CCACGATCAA CCTGATCGGC
AACGCGGTGC TCGCCCTGCT GCGTGACACC GACCAGCTCA CCGCGCTGCG GGCCGACCCC
GACCTGGCGC CCGCACTCGT TGACGAGACG CTGCGCCACG ACCCGCCCGT CCAGTTCGTC
GGGCGTCTCG CGCTGCGTGG CACCGAGGTC GCCGGGCACT CCTTCGCCGC GGGCGAGGTC
GGCGTCATCA TGCTGGCCGC CGCCGGGCGG GATCCGCGGA CCTTCGCGGA GCCGGACCGG
TTCGACATCC GCCGGTACGC CGGGCCGACA CCCGCCCCCC GTCATCTCGG CTTCGGGTTG
GGGATCCACT ACTGTCTCGG CGCGCCGCTC GCCCGACTGG AGGCGGAGAT CGCCCTGCAG
GCACTCGTGC GCCGGGCCGG TAGCCTCACG CCGGCCAGCG ACCCTCCCTC CTATCGCCCG
CATCTGGCCG TGCGTGGCCT GGAGACACTC CCCATCCGGC TGTCGCCCTG A
 
Protein sequence
MDADSPSGAA AVGTPPAFGA FDPARRHDPY PSYHALREAD PFYRLPLGAQ QVTLLTRYQD 
CMHVLQDAAW GRGEGGTNAW RSANSFDGGL RSLLGVNPPD HTRLRGLVSK AFTPRVISGL
RPQITVLVES LLDAALAAGE VDLIDAFARP LPLRIICDLL GVPVRDEETF RAWGTALTRG
LDPDYLLTPD ELALRGKATV EFDAYFTDLI AARRARPTDD LLGLLVAVRE QGDSLTEAEL
LELCALLLVA GYETTINLIG NAVLALLRDT DQLTALRADP DLAPALVDET LRHDPPVQFV
GRLALRGTEV AGHSFAAGEV GVIMLAAAGR DPRTFAEPDR FDIRRYAGPT PAPRHLGFGL
GIHYCLGAPL ARLEAEIALQ ALVRRAGSLT PASDPPSYRP HLAVRGLETL PIRLSP