Gene Cag_0898 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCag_0898 
SymbolpyrC 
ID3748088 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChlorobium chlorochromatii CaD3 
KingdomBacteria 
Replicon accessionNC_007514 
Strand
Start bp1231394 
End bp1232722 
Gene Length1329 bp 
Protein Length442 aa 
Translation table11 
GC content50% 
IMG OID637773429 
Productdihydroorotase 
Protein accessionYP_379206 
Protein GI78188868 
COG category[F] Nucleotide transport and metabolism 
COG ID[COG0044] Dihydroorotase and related cyclic amidohydrolases 
TIGRFAM ID[TIGR00857] dihydroorotase, multifunctional complex type 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAACTATA TTTTTCAAAA CGCCCACCTA CTTAATCCCC TTGAAAAACT TGATGCCGTT 
GGCACCCTCA CCGTTACAAG CGATGGCACT ATTGCCGCCG TAACGCTTGG CAACGAAGCC
CCTCCCATTA CCGCTGACGA CCAGCTTATT AATTGTGAAG GAAAGATGAT TGTGCCGGGG
CTTTTTGATA TGCACTGCCA TTTTCGTGAA CCGGGGCAGG AGTACAAGGA GACGCTTGAG
AGCGGGGCAG AAGCCGCTTT AGCGGGTGGC TTTACCGGTG TGGCGCTTAT GCCCAACACA
CGCCCCGTCA TTGATAGCCC ACTTGGCGTT GCCTACATTC GCCACCACAG CACCACGCTT
CCCGTTGATT TGGAGGTTAT TGGCGCCATG ACGGTAGAAA GCAAAGGCGA ACATCTTGCA
CCGTATGGCA AGTTTAGCTC TTACGGCGTT ACCGCAATTT CGGATGATGG TGCCGCAATT
CAAAGCAGCC AAATGATGCG CTTAGCGCTG GAGTATGCTT CAAATTTTGA TTTACTCATT
ATTCAGCATT GCGAAGATCG CTCGCTGAGT GCAGGCGGCG TTATGAACGA AGGGCTGTAC
TCAACACGGT TGGGCTTAAA GGGGATTCCC GAAGTAGCCG AAGCCATCAC ACTTGGGCGC
GACCTCATGC TGTTGCGTTA CCTTGAAGAG CACAAATTAC ACACGCCACT GCGTCGCCCA
CGCTACCACG TTGCCCACAT AAGCACCCGT CAAGCAATTG AGTTGGTGCG CCAAGCAAAA
ATGGAAGGGT TGCAAGTAAC CTGCGAAATT ACCCCTCACC ACTTCACTTT GTGCGATCAA
GAGCTTTTTG AAGCCGAACG CAAAGGCAAT TTTATTATGA AACCGCCGCT TGCCTCACAA
GCCACGCGCG AGCACCTGCT TGCTGCCCTT GCTGATGGCA CCATTGATGC CATTGCCACC
GACCACGCTC CACACGCTCT GCACGAAAAA GAGTGCCCAC CCGACCAAGC CTCATTTGGC
ATTATTGGGC TTGAAACCTC GCTTGCGCTT ACCATTACAG AGTTAGTACA AAAAGAGGTT
ATTTCAATGG CACGCGCTAT TGAGTTACTC TCGGTTAATC CACGCGCTAT TATGCGGCTC
AAACCAATTC GCTTTGCAGC AGGTGAAGCT GCCAACTTCA CCATTATTGA TCCCAATGCA
GAATGGGTTG TAACGGCTGA ACATATTCGC TCAAAATCAT CCAACACACC ATTTATTGGA
CGCACCCTTC GCGGTAAATC GTTAGGAACA TTCCATAAAG GTGCATTGCG TATGACGGTT
GAGGAATAA
 
Protein sequence
MNYIFQNAHL LNPLEKLDAV GTLTVTSDGT IAAVTLGNEA PPITADDQLI NCEGKMIVPG 
LFDMHCHFRE PGQEYKETLE SGAEAALAGG FTGVALMPNT RPVIDSPLGV AYIRHHSTTL
PVDLEVIGAM TVESKGEHLA PYGKFSSYGV TAISDDGAAI QSSQMMRLAL EYASNFDLLI
IQHCEDRSLS AGGVMNEGLY STRLGLKGIP EVAEAITLGR DLMLLRYLEE HKLHTPLRRP
RYHVAHISTR QAIELVRQAK MEGLQVTCEI TPHHFTLCDQ ELFEAERKGN FIMKPPLASQ
ATREHLLAAL ADGTIDAIAT DHAPHALHEK ECPPDQASFG IIGLETSLAL TITELVQKEV
ISMARAIELL SVNPRAIMRL KPIRFAAGEA ANFTIIDPNA EWVVTAEHIR SKSSNTPFIG
RTLRGKSLGT FHKGALRMTV EE