Gene Plut_1013 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPlut_1013 
SymbolpyrC 
ID3745793 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChlorobium luteolum DSM 273 
KingdomBacteria 
Replicon accessionNC_007512 
Strand
Start bp1150727 
End bp1152076 
Gene Length1350 bp 
Protein Length449 aa 
Translation table11 
GC content60% 
IMG OID637769048 
Productdihydroorotase 
Protein accessionYP_374918 
Protein GI78186875 
COG category[F] Nucleotide transport and metabolism 
COG ID[COG0044] Dihydroorotase and related cyclic amidohydrolases 
TIGRFAM ID[TIGR00857] dihydroorotase, multifunctional complex type 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0448021 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCATAG TTTTTCAGGA GGCGCATATC ATCAGCCCCT CTGACGGCAT TGACGCCAGG 
GGCTCAATCA GGGTATCCGA CAGCGGGGTC ATCGAGACCC TCTCGATTGG AGAGACCCCG
CTTGAGCCTT ATGCAGAAGA AAAAGTCATT GCTATGAGGG GTAAAATACT CTCCCCCGGC
CTTTTCGACA TGCACTGCCA TTTCCGCGAA CCCGGACAGG AATACAAGGA GACGCTTCTG
AGCGGCTCGG CCGCCGCCGC AGCCGGAGGA TTCACCGGAG TGGCGCTGAT GCCGAACACC
ACGCCCGTCA TCGACAGCCC TCTTGGGGTC ACCTTCATCG GCTACCATGC CGGGAACCTC
CCTGTCGACC TTGAGGTGAT TGCCTCCATG ACGGAAGGAA GCCGCGGTGA GAAGCTCACG
GCGTTCGGAA GCCTGAAAGC CTACGGAGTG AGGGCCGTCT CGGATGACGG CACCGCCATC
CAGGCAAGCC AGTCGATGCG CCTGGCGTTT GAATACGCCT CCAACTTCGA CATGCTCATC
ATCCAGCACT GTGAAGACCG GTCCCTGACG ACGGGAGGAG TCATGAATGA AGGAATGTGG
TCGTCGAAAC TCGGACTCAA GGGAATCCCC GACATCTCAG AGGCCATGAT GCTTGCCCGT
GACCTCATGC TGCTCCGCTG GCTCGAAGAG CACAAGCTGC ACGACCCGCT CTGCCGCCCC
CGATACCACG CAGCCCACGT CAGCACGGCA GCATCGCTCC AGCTGATCCG GGAAGCCAAG
CGGGACGGCC TGCAGGTGAC CTGCGAGGTC ACCCCGCACC ACTTCACCCT GACCGACCAG
GACCTCTACC TGGCCGAAAA GAAAGGCAAC TTCATCATGA AGCCTCCGCT TACCTCCCCG
AAAAACCGGG ACGCGGTGCT GGAAGCCCTT GCCGACGGCA CAGCGGACGC CATTGCCACC
GACCATGCGC CGCATGCCCT CCACGAAAAG GAGTGCCCCC CCGGCGAAGC TTCGTTCGGC
ATCATCGGAC TGGAAACCTC GCTGGGCCTC ACCATGACGG AGCTGGTGAT GAAAGGAGTC
ATCACGATGC ATCGGGCTAT TGAACTTCTG TCGGTCAATC CGAGAAGAAT CCTGCGGCTC
CCCCCCATCC GCATCCGCGA AGGGGAAAAA GCCAACTTCA CCCTCATCGA TCCTGAAGCC
GTCTGGACCG TATCTGCCGA TCATCTCCGC TCCAAATCCG CCAACACCCC GTTCATCGGC
CGCCAGCTGA AAGGCCGCCC TATGGGAATC TTCCACAAGG GCCGACTTAC TGCAAGCGCC
CGGGGCATAA TTGACGCCCC GGAAGGGTGA
 
Protein sequence
MSIVFQEAHI ISPSDGIDAR GSIRVSDSGV IETLSIGETP LEPYAEEKVI AMRGKILSPG 
LFDMHCHFRE PGQEYKETLL SGSAAAAAGG FTGVALMPNT TPVIDSPLGV TFIGYHAGNL
PVDLEVIASM TEGSRGEKLT AFGSLKAYGV RAVSDDGTAI QASQSMRLAF EYASNFDMLI
IQHCEDRSLT TGGVMNEGMW SSKLGLKGIP DISEAMMLAR DLMLLRWLEE HKLHDPLCRP
RYHAAHVSTA ASLQLIREAK RDGLQVTCEV TPHHFTLTDQ DLYLAEKKGN FIMKPPLTSP
KNRDAVLEAL ADGTADAIAT DHAPHALHEK ECPPGEASFG IIGLETSLGL TMTELVMKGV
ITMHRAIELL SVNPRRILRL PPIRIREGEK ANFTLIDPEA VWTVSADHLR SKSANTPFIG
RQLKGRPMGI FHKGRLTASA RGIIDAPEG