Gene Francci3_1930 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFrancci3_1930 
Symbol 
ID3904292 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. CcI3 
KingdomBacteria 
Replicon accessionNC_007777 
Strand
Start bp2269043 
End bp2270332 
Gene Length1290 bp 
Protein Length429 aa 
Translation table11 
GC content68% 
IMG OID637879267 
Productcytochrome P450 
Protein accessionYP_481034 
Protein GI86740634 
COG category[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG2124] Cytochrome P450 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value0.708059 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCAGTCA CAACCGATTC CGTATTCGCC GGGGCGGACG CCGGTACCGG CGTCGCGGCC 
GACCACGGGA AACCGCTCCC GGACCAGGCG CCTGATCTGC TGGCCGCGTT GTTCGATCCC
GCGAACAGGC CGAATCCCTA TCCGCTCTAT GCGCGCCTGC GGGCAGCCGG CCGGCTGCAC
GAGACGCCGT TCGGACTGCG GGTCGCCACC CGGCACGACG ACTGTGTAGC GGTACTGTCC
AACCAGAGCT GGGGGCACGA TCAGGAGGCC CACCAGCTGC ATCCCACCTT GCCCGCCGAG
GAGTTCCCGG CGACGTTCCT GTGGATGGAG CCGCCGGACC ACACCCGGCT GCGCGGCCTG
GTGAGTAAGG CGTTCACCCC GGGTCGGGTC GCCGATCTGC GGCCGCGTAT CACCGCCCTC
GTCGACGATC TGCTCGACAC CGCGCTGCGG GCCGGAGAGT TCGACCTCAT CGAGACGATC
GCGTACCCGC TGCCACTCAC GATCATTTGT GAGATCCTCG GGGTGCCGGC GGCCGACCAC
GGCATCATCC AGATCTGGTC GCAGGCGTTG GCCAGGGCGT TCGACCCGGA CGTCCTGATG
TCTCCGGAGG CGTTGGCGGA GCGTAACGAC GCCATCCCCG AGTTTCTCGC CTACTTCCGT
GCGCTGGTCG CGCGCAGCCG CCGCAGCGGT GGCGACGATC TGCTCAGCGC GCTGGCTGCC
GTCGAGGAGC AGGGTGATCG GCTGACCGAG GACGAACTGC TCGGCACCTG CGTGACCCTG
CTGATTGCCG GCCACGAGAC GACGGTCAAC CTGGTCGGCA ACGGGGCGCT GGCGTTGCTG
CGTAACCCCG ACCAGACGGC GTTGCTGCGC GACGAACCGG ATCTCATCCG GCCCGCCGTC
GACGAGCTGC TACGCTATGA CTCCCCGATC CACCTCAACA CCCGGGCAGC GACCCGGGAG
ATGACGGTGG GTGGCCGAAC CTTTTCCCCG GGTGAGGGTG TGGTGGCGTT GATCGCTTGC
GCCAATCGTG ATCCGGAGGC ATTCGACGAC CCGGACCGTC TCGATGTGCG CCGCTACGTG
GCAGGCTCCG GCGCGTCCCG GCACCTGTCG TTCAGCCTCG GTCACCACTA CTGCCTCGGC
GCGCCGTTGG CGTTGCTCGA GATGGAGATC TTCCTGGCGG GGTTCCTCCA CCGGGTCCGG
TCCGCCGAGC TTCTCGTCGA TTCGCCGCCC TACAAATCCA ACCTGCTCAT CCGAGGACTG
GCGGAGCTCC CCGTCCGCTT CCGGGGATAG
 
Protein sequence
MAVTTDSVFA GADAGTGVAA DHGKPLPDQA PDLLAALFDP ANRPNPYPLY ARLRAAGRLH 
ETPFGLRVAT RHDDCVAVLS NQSWGHDQEA HQLHPTLPAE EFPATFLWME PPDHTRLRGL
VSKAFTPGRV ADLRPRITAL VDDLLDTALR AGEFDLIETI AYPLPLTIIC EILGVPAADH
GIIQIWSQAL ARAFDPDVLM SPEALAERND AIPEFLAYFR ALVARSRRSG GDDLLSALAA
VEEQGDRLTE DELLGTCVTL LIAGHETTVN LVGNGALALL RNPDQTALLR DEPDLIRPAV
DELLRYDSPI HLNTRAATRE MTVGGRTFSP GEGVVALIAC ANRDPEAFDD PDRLDVRRYV
AGSGASRHLS FSLGHHYCLG APLALLEMEI FLAGFLHRVR SAELLVDSPP YKSNLLIRGL
AELPVRFRG