Gene GSU1272 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGSU1272 
SymbolpyrC 
ID2686576 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacter sulfurreducens PCA 
KingdomBacteria 
Replicon accessionNC_002939 
Strand
Start bp1386623 
End bp1387900 
Gene Length1278 bp 
Protein Length425 aa 
Translation table11 
GC content64% 
IMG OID637125946 
Productdihydroorotase, multifunctional complex type 
Protein accessionNP_952325 
Protein GI39996374 
COG category[F] Nucleotide transport and metabolism 
COG ID[COG0044] Dihydroorotase and related cyclic amidohydrolases 
TIGRFAM ID[TIGR00857] dihydroorotase, multifunctional complex type 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0255922 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAACCTGC TGATAAAAGG TGGGAGGGTG ATTGACCCGT CCCAGGGAAT TGACGAAGTT 
CTGGATATCC TCGTGGAGAA TGGCGCAATC AAGGAACTCG GCAAGGGACT CGCGGCTCCG
GCCGGGGCCG GGGTCGTGGA CGCCGCCGGC CTGATCGTCA CGCCGGGCCT CATTGATATG
CATGTGCACC TGCGGGACCC GGGGCTCGAG TACAAGGAAG ATATCGTAAC AGGCACCAGG
GCGGCTGCGG CCGGCGGCTT CACGTCGGTG GCCTGCATGC CCAACACCAA GCCGGTGAAC
GACAACAAGG CCGTGACCAG CTACATCGTC GCCAAGGCCA AGGCCGAGGG GCTCGTCAAC
GTCTTCCCCG TGGGGTCCAT TACTCAGGGG AGCAAGGGGG ATGCCCTGGC CGAGATGGGG
GACCTGAAGG AAGCAGGCTG CGTGGCGGTT TCCGACGACG GCCGGCCCGT GACCAGTTCC
GAGCTCATGC GCCGGGCCCT GGAGTACGCC AAGGGAATGG GAATCATGGT CATCTCCCAT
GCCGAGGATC TCTCCCTGGT GGGCGAGGGG GTCATGAACG AGGGCTTCGT CTCCACGGAG
CTGGGGCTCA AGGGAATACC CTGGGCCGCC GAGGACGCTG CCACCGCCCG TGACGTGTAC
CTGGCCGAGT TCACCAACTC GCCGCTCCAC ATCGCCCACG TCTCCACAAT GGGGTCATTG
CGGATCATCC GTAACGCCAA GGCCCGCGGC GTGAAGGTTA CCTGCGAGAC GGCGCCCCAC
TACTTCAGCC TCACCGACGA TGCAGTGCGC GGCTACAACA CCAATGCCAA GATGAATCCG
CCGCTCCGTA CGGCCGATGA TCTGGCCGCG GTCAAAGAGG CCCTGAAGGA CGGCACCATC
GACGCCATCG CCACCGACCA CGCCCCCCAC CATCTGGATG AGAAGGACGT GGAGTTCAAC
GTGGCTTTGA ACGGCATCAT CGGCCTGGAA ACCTCCCTGC CGCTGTCGCT GAAGCTGGTG
GAGGAGGGAG TGTTGACCCT GCCGGCACTG GTTGAGAAGA TGGCGTGCAA CCCGGCCGCG
ATTCTCGGCA TTGACCGGGG CACGCTCCGG CAAGGCGCGG TTGCCGACAT CACGGTTATT
GATCCGGCGG CCGTCTGGAC GGTGGAGGCC GGTGCGCTCG CCAGCAAGTC CAAGAACTCA
CCCTTCCTCG GCTGGGAGAT GAAAGGTGCC GCGGCATACA CCATCGTCGG CGGCACGGTG
GTCCACAGCA GAGGATGA
 
Protein sequence
MNLLIKGGRV IDPSQGIDEV LDILVENGAI KELGKGLAAP AGAGVVDAAG LIVTPGLIDM 
HVHLRDPGLE YKEDIVTGTR AAAAGGFTSV ACMPNTKPVN DNKAVTSYIV AKAKAEGLVN
VFPVGSITQG SKGDALAEMG DLKEAGCVAV SDDGRPVTSS ELMRRALEYA KGMGIMVISH
AEDLSLVGEG VMNEGFVSTE LGLKGIPWAA EDAATARDVY LAEFTNSPLH IAHVSTMGSL
RIIRNAKARG VKVTCETAPH YFSLTDDAVR GYNTNAKMNP PLRTADDLAA VKEALKDGTI
DAIATDHAPH HLDEKDVEFN VALNGIIGLE TSLPLSLKLV EEGVLTLPAL VEKMACNPAA
ILGIDRGTLR QGAVADITVI DPAAVWTVEA GALASKSKNS PFLGWEMKGA AAYTIVGGTV
VHSRG