Gene Franean1_1707 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_1707 
SymbolpyrC 
ID5670109 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp2038535 
End bp2039890 
Gene Length1356 bp 
Protein Length451 aa 
Translation table11 
GC content74% 
IMG OID641240625 
Productdihydroorotase 
Protein accessionYP_001506051 
Protein GI158313543 
COG category[F] Nucleotide transport and metabolism 
COG ID[COG0044] Dihydroorotase and related cyclic amidohydrolases 
TIGRFAM ID[TIGR00857] dihydroorotase, multifunctional complex type 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0209676 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACATCGG TGATCGTGGT GCGTGGCGTC CGTCCGCTGG GCGGTGACGC GCGGGATCTG 
GTGATCGCCG GCGGGACGAT CGCCGCTGTC GCGCCGGCCG GGGGCGGCGA GCACACCGGC
GTCGTGGCCG AGGCCACCGC GGGCTCCGCC GACGGCACCC GCCCCGCCCC CCTGGTCGTC
GACGCCACCG GGCTCGTGGC GCTGCCCGGG CTCGTTGACC TGCACACCCA CCTGCGCGAG
CCTGGCCGGG AGGACGCCGA GACGGTCGAC TCCGGTACCC GCGCGGCGGC GCTCGGCGGC
TACACGACGG TGTTCGCGAT GGCCAACACC GACCCCGTCG CCGACACCGC CGGGGTGGTC
GAGCAGGTGT GGCGGCTCGG CCAGGACGCC GGGCACTGCG ACGTGCGCCC GGTCGGCGCG
GTGACCAGGG GGCTCGCCGG CGAGCGGCTC GCCGAGCTGG GCGCCATGGC CTCCTCCGCG
GCGGCGGTCC GGATGTTCTC CGACGACGGT CACTGTGTGT CCGACGCGTT GCTCATGCGC
CGCGCGCTCG AGTACGTCAA GGCCTTCGAC GGCGTGATCG CGCAGCACGC GGAGGAGCCG
AGGCTGACCG CCGGGGCGCA GATGAACGAG GGCGCCATGG CCGCGCGGCT GGGCCTGCCC
GGCTGGCCGG CCGTCGCAGA GGAGGCGATC ATCGCCCGGG ACGCCCTGCT GACCGGGCAC
GTCGGGTCGC GCCTGCACAT CTGCCACGTG TCCACGGCGG GCTCGGTCGA GCTCATCCGG
TGGGCCAAGG CGAAGGGCTG GCGGGTGACC GCGGAGGTCA CCCCGCACCA CCTGCTGCTC
ACCGAGGAGC TCGTCGCCTC GTACGACCCG GTGTACAAGG TCAATCCGCC GTTGCGCACC
GCGGCCGACA CAGAGGCACT GCGAGCCGGT CTCGCGGACG GCACGATCGA CTGTGTGGCC
ACCGATCACG CCCCGCACGC CTCCGAGGAC AAGGAGACGG AGTGGGCCGC CGCCCGGCCC
GGGATGCTCG GCCTCGAGAC AGCGCTGTCC ATCGTGATCC AGACCATGGT CGAAACCGGC
CGGCTCGACT GGGCCGGTGT CGCCGACCGG ATGGCGCTGG CGCCGGCCCG GATCGGCGCC
GTCGCCGACA CGCCGCGTGA CCCGGCGAGC TTCGCCCAGG TGGGCGCGCC CGCCACCCTG
ACCCTGCTCG ACCCGGAGGC GCGACGGGTG ATCGATCCAC TCGCCGTCGC CAGCCGGAGC
AGCAACACTC CGTACGGTGG CCGCACGCTG CCCGGTGCCA TTCGGGCGAC GTTCCTGCGA
GGCCGGCCCA CCGTGCTCGA CGGGAAGATC ATATGA
 
Protein sequence
MTSVIVVRGV RPLGGDARDL VIAGGTIAAV APAGGGEHTG VVAEATAGSA DGTRPAPLVV 
DATGLVALPG LVDLHTHLRE PGREDAETVD SGTRAAALGG YTTVFAMANT DPVADTAGVV
EQVWRLGQDA GHCDVRPVGA VTRGLAGERL AELGAMASSA AAVRMFSDDG HCVSDALLMR
RALEYVKAFD GVIAQHAEEP RLTAGAQMNE GAMAARLGLP GWPAVAEEAI IARDALLTGH
VGSRLHICHV STAGSVELIR WAKAKGWRVT AEVTPHHLLL TEELVASYDP VYKVNPPLRT
AADTEALRAG LADGTIDCVA TDHAPHASED KETEWAAARP GMLGLETALS IVIQTMVETG
RLDWAGVADR MALAPARIGA VADTPRDPAS FAQVGAPATL TLLDPEARRV IDPLAVASRS
SNTPYGGRTL PGAIRATFLR GRPTVLDGKI I