Gene Francci3_2111 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFrancci3_2111 
Symbol 
ID3905638 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. CcI3 
KingdomBacteria 
Replicon accessionNC_007777 
Strand
Start bp2477263 
End bp2478798 
Gene Length1536 bp 
Protein Length511 aa 
Translation table11 
GC content73% 
IMG OID637879446 
Productmonooxygenase, FAD-binding 
Protein accessionYP_481212 
Protein GI86740812 
COG category[C] Energy production and conversion
[H] Coenzyme transport and metabolism 
COG ID[COG0654] 2-polyprenyl-6-methoxyphenol hydroxylase and related FAD-dependent oxidoreductases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0054618 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value0.499354 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACGCCGT CGGCACCGCG GCTGCCGGTG CTGGTCGCCG GCGCCGGACC AGCCGGTCTC 
ATGGCCACGA TCGAGCTGAC CCGGCGCGGG GTGCCGGTGC GCTGCATCGA CCGGGCCGGC
GGGCCGAGCA CCCTGTCCAA GGCACTCGGG GTGTGGCCAC GCACCCGTGA ACTGATCCGT
CGGATCGGGG GCGATGAGGC GCTGGCATCG AGATCCCTGC CGCAGACGCA GATGCGCTAC
TACTCCTCGG GGAAGGTCAT CGCCAACCTG CGCTACCGGA CGGCCACCCG GCCGCTGATC
TGCCCGCAGC CCGGCGTCGA GGAGGTACTG CGGGAGGTCC TGACCGGCCT CGGCGGCTCG
CCGGAATGGC GCACCGAACT GCTCGACCTC GACCAGTGCG ACGACCGGGT GCGGGTGCGG
GTGCGGTATC CGGACGGCGC GGAGCGGATC GAGGAGTTCG CCTACCTGGT CGGCGCCGAT
GGCGCGAGCA GCACGGTCCG GGCCCAGCTC GGCATCGGCT TCGACGGCGA CACCTATGAG
CTGCGCTTCG TGGTGGCCGA CGCGCTGGCG GACACCGCAC TCGATCCGAC GATGACCCAC
TACTTCTGCT CCACGCGCGG CATCCTGGTC GCCTGCGGGC TGCCCTCCGG CCGGTGGCGG
GTCTTCACCT CCGCGCCACC GGACTTCACC CAGCAGGGAG CCGACCTCGA CGCCGTCCAG
CGGCTGGTCG ACGAGCGGGG CCCGGGCGGG ATCGTCCTGC GTGACCCGGA CTGGCTCAGC
GTGTTCTCGG TGCACGCCAG GCAGGCCGAG CGGACCAGGG TCGGGCACGT GTTCCTGGTG
GGGGACGCGG CGCACATCCA CAGCCCGGCC GGCGGGCAGG GCTTGAACAC AGGCGTCACC
GACGCGCACA ACCTGGCCTG GAAGATGGCG TTCGTCTGGC ACGGCCGGGC GGATCCGGAT
CTGCTGGACA CCTACGCGGC GGAGCGTGGG CAGGTAGCCC GGGCGGTCGT CCGCCAGGCC
GACGTGCAGA CGCGGATCTG GCTGCTGCGC CGCGGCTACC AGGTGGCGCT TCGGGACACG
CTGCTGCGGG CCGCCTCAGC CCTACGACTG TTCGACATCT CCTACGTTCC CTGGCTGGCC
GGCCTGCGCA CCAGGTACCG GGTGGCGGCG TCGGAGGGCC GCGCGGTGGC CGGATTCCAG
CCGGGCGCAC TGATCCCGCT GCCGCTGCGG TCCGAGCTCG ACGACCTGCG CTACACCCTG
CTGATCTCGC GGCCCGAGCG GCACGGTTCC GGCGGCGCCT CGTTCGACGC ACTGGCGGAC
CTGTGCCGCG ACCACTTCGC GGACCGGGTG GACGTGCGGG TGCTCGACGG AAACGGCCGG
GCGCGGGGCG CTGTCGCCGC CCTGGTGCGC CCGGACGGGC ATGTCGACAC GGCCTCCCGT
GACGCCGCAC CGGTGCGGAC CCGACTGACC GCACTGTTCA CCGCACCGCC GACGAGGCAG
CCTGACGGCC TCGGCGACCT GAGGAGAAAA ACGTGA
 
Protein sequence
MTPSAPRLPV LVAGAGPAGL MATIELTRRG VPVRCIDRAG GPSTLSKALG VWPRTRELIR 
RIGGDEALAS RSLPQTQMRY YSSGKVIANL RYRTATRPLI CPQPGVEEVL REVLTGLGGS
PEWRTELLDL DQCDDRVRVR VRYPDGAERI EEFAYLVGAD GASSTVRAQL GIGFDGDTYE
LRFVVADALA DTALDPTMTH YFCSTRGILV ACGLPSGRWR VFTSAPPDFT QQGADLDAVQ
RLVDERGPGG IVLRDPDWLS VFSVHARQAE RTRVGHVFLV GDAAHIHSPA GGQGLNTGVT
DAHNLAWKMA FVWHGRADPD LLDTYAAERG QVARAVVRQA DVQTRIWLLR RGYQVALRDT
LLRAASALRL FDISYVPWLA GLRTRYRVAA SEGRAVAGFQ PGALIPLPLR SELDDLRYTL
LISRPERHGS GGASFDALAD LCRDHFADRV DVRVLDGNGR ARGAVAALVR PDGHVDTASR
DAAPVRTRLT ALFTAPPTRQ PDGLGDLRRK T