Gene Francci3_3201 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFrancci3_3201 
SymbolpyrC 
ID3906167 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. CcI3 
KingdomBacteria 
Replicon accessionNC_007777 
Strand
Start bp3794449 
End bp3795825 
Gene Length1377 bp 
Protein Length458 aa 
Translation table11 
GC content74% 
IMG OID637880525 
Productdihydroorotase 
Protein accessionYP_482287 
Protein GI86741887 
COG category[F] Nucleotide transport and metabolism 
COG ID[COG0044] Dihydroorotase and related cyclic amidohydrolases 
TIGRFAM ID[TIGR00857] dihydroorotase, multifunctional complex type 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.0334723 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACGACGC CCGCGGCGGA AACGTCGGAG GAAACGCCGG TGGGGAGGTC GGCGGGGACC 
TCCTGGGTCC TGCGGCGGGT CCGCCCGCTC GGCGGGGACC CCGTCGACGT CGTCCTCGCG
GACGGGGTGG TGGCCGCCTG GCGACCGGCC GGCTCCACGC ACGCCGGGGC CGGGGGACTG
CCCGCCGGCA CCACCGTGCT GGACACGGAC GGGCTGATCC TGCTCCCCGG CCTGGTCGAC
CTGCACACCC ACCTGCGGGA ACCCGGCCGG GAGGACGCCG AGACGGTCGC CTCCGGCACC
CGCGCCGCGG CCCTCGGTGG CTACACCACC GTGTTCGCGA TGGCGAACAC CAATCCGGTC
GCCGACACCG CGGGGGTCGT CGAGCAGGTG TGGCGGCTCG GCCTGGACGC GGGTCACTGC
GATGTCCGGC CGGTCGGCGC GGTCACCGTC GGACTTGCCG GCGAGCGGCT CGCCGAACTC
GGCGCCATGG CCTCCTCCGC GGCGGGCGTG CGGGTCTTCT CCGACGACGG GCACTGCGTG
TCGGACGCGC TGCTCATGCG CCGGGCGCTG GAGTACGTCA AGGCGTTCGA TGGGGTGATC
GCCCAGCATG CGCAGGAACC GCGGCTGACC GAGAACGCCC AGATGAACGA GGGCACGGTG
GCTGCCAGGC TGGGGCTGCC GGGGTGGCCC GCGGTCGCCG AGGAGGCGAT CATCGCCCGG
GACGCGCTGC TGGCCGGGCA CGTCGGCTCC CGACTGCACG TCTGTCACGT CTCCACCGCC
GGATCGGTAG AGATCATCCG GTGGGCCAAG GCGAAGGGCT GGAACGTCAC CGCCGAGGTG
ACCCCGCACC ACCTGCTGCT CACCGACGAC CTGGTCTGCT CGTTCGACCC GGTCTACAAG
GTCAACCCGC CGCTGCGCAC CGCCGAGGAC GTCGCCGCGC TGCGCGCCGG GCTCGCCGAC
GGCACGATCG ACTGCGTCGC CACCGACCAC GCTCCGCACG CGCTGGAGGA CAAGGAGACG
GAGTGGGCCG CCGCGCGTCC CGGCATGCTC GGTCTCGAGA CGGCGCTGTC GGTGGTCATC
GAGACGATGG TCATCCCGGG CCGGCTGGAC TGGGCCGGGG TCGCCGAGCG GATGGCACTG
GCCCCGGCAA GGATCGGTGG CCTGCCGCGA ACCGCGGCCG AGGTGTGGAG TTCCATAGCA
GTGGGCGCGC CCGCGACCGT CACCCTGCTT GACCCGGCGC CGTGGCGGAT GGTCGAGCCG
GACGCGCTCG CCAGCCGCAG CCGCAACACG CCCTATGCGG GCCGGTCGCT GCCGGGAACG
ATCCGCGCCA CGTTCCTGCG GGGACGGCCC ACCGTGCTCG ACGGGAAGAT CGTATGA
 
Protein sequence
MTTPAAETSE ETPVGRSAGT SWVLRRVRPL GGDPVDVVLA DGVVAAWRPA GSTHAGAGGL 
PAGTTVLDTD GLILLPGLVD LHTHLREPGR EDAETVASGT RAAALGGYTT VFAMANTNPV
ADTAGVVEQV WRLGLDAGHC DVRPVGAVTV GLAGERLAEL GAMASSAAGV RVFSDDGHCV
SDALLMRRAL EYVKAFDGVI AQHAQEPRLT ENAQMNEGTV AARLGLPGWP AVAEEAIIAR
DALLAGHVGS RLHVCHVSTA GSVEIIRWAK AKGWNVTAEV TPHHLLLTDD LVCSFDPVYK
VNPPLRTAED VAALRAGLAD GTIDCVATDH APHALEDKET EWAAARPGML GLETALSVVI
ETMVIPGRLD WAGVAERMAL APARIGGLPR TAAEVWSSIA VGAPATVTLL DPAPWRMVEP
DALASRSRNT PYAGRSLPGT IRATFLRGRP TVLDGKIV