Gene Cpha266_1336 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCpha266_1336 
SymbolpyrC 
ID4569707 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChlorobium phaeobacteroides DSM 266 
KingdomBacteria 
Replicon accessionNC_008639 
Strand
Start bp1531191 
End bp1532531 
Gene Length1341 bp 
Protein Length446 aa 
Translation table11 
GC content50% 
IMG OID639765925 
Productdihydroorotase 
Protein accessionYP_911791 
Protein GI119357147 
COG category[F] Nucleotide transport and metabolism 
COG ID[COG0044] Dihydroorotase and related cyclic amidohydrolases 
TIGRFAM ID[TIGR00857] dihydroorotase, multifunctional complex type 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGCACCA TTTTTCAAAA CGCCCATATC ATCAACCCAC AAAGCAACCT TGATTATACC 
GGATCAATAA GGGTGTCCGT TGATGGTTTC ATTGAAGAGA TTATTCAAGG CGAATGCGAT
AAAAGCCCCG ACGACAGAAT AATCGATCTT CAGGGAAAAC TGCTGGTGCC GGGTCTTTTC
GACATGCACT GCCATTTCCG TGAACCGGGG CAGGAGTATA AGGAAACCCT TGAAAGCGGC
TCGAAAGCTG CCGTAGCCGG CGGATTTACC GGGGTAGCCC TCATGCCTAA CACAAAACCG
GTGATCGACA GCCCGCTCGG AGTAGCATAC ATACGTCACA ACGCACAGCA GTTGCCGGTT
GATCTTGAGG TTATCGGCGC AATGAGCGAA GGAAGCAAGG GAGAACAGCT TGCACCATAC
GGAAAATTCC GTTCGTATGG GGTAAAAGCT GTTTCCGATG ACGGAACGGC GATTCAGAAC
AGCCAGAATA TGCGACTGGT GTTCCAGTAC GCATCAAATT TCGATCTTCT TGTCATTCAG
CATTGCGAAG ACAAATCCAT GACCGCCGAA GCCGTCATGA ACGAAGGGGT ATTTTCAACA
AAACTTGGCC TTAAAGGAAT ACCTGATGTA TCCGAAGCGG TCATGCTGTG CCGTGATCTT
CACCTGATCC GCTATATCGT GGAGCACGAA TTGCACGATC CGGCCAACAA ACCGAGATAC
CATGTGGCAC ACATCAGCAC CAAAGCCTCC CTCGACCTTG TCCGGCAGGC TAAAGCCGAA
GGCTTGCAGG TAACGTGCGA GGTTACGCCT CACCATTTCA CCCTTACCGA CGAAGATCTT
TTCAATGCGC CCTCAAAAGG CAATTTCATC ATGAAGCCCC CGCTCCCCTC AAAGAAGAAC
AGGGCAGCTA TCCTTGAAGC AATTGCAGAC GGAACGATTG ATGCCATTGC TACCGATCAC
GCCCCTCATG CTCCACATGA AAAAGAGTGC CCTCCCGATC AGGCGTCATT CGGCATTATC
GGTCTGGAAA CCGCTGTAGG ACTCACAATA ACCGAACTGG TCGAACCGGG AATCATAACG
CTTTCAAGAG CGATTGAGCT GATGTCAGTC AATCCCCGTA AAATTCTTCA GCTTGATCCC
CTGCTGTTCG CGCAGGGAGA AAGGGCTAAT TTTACCATTA TTGATCCAGA GGAGGAGTGG
GCGCTTACCG CGAATGCCGT TAAATCAAAG TCAACGAATA CGCCCTTTCT TGGCCGTAAA
CTCAAGGGCA GGGCGATTGC TGTTTACCAC AAAGGGATGT TTCATGAAAG CGTAACTTCA
CAAGAGCATT TTAGCGTGTA A
 
Protein sequence
MSTIFQNAHI INPQSNLDYT GSIRVSVDGF IEEIIQGECD KSPDDRIIDL QGKLLVPGLF 
DMHCHFREPG QEYKETLESG SKAAVAGGFT GVALMPNTKP VIDSPLGVAY IRHNAQQLPV
DLEVIGAMSE GSKGEQLAPY GKFRSYGVKA VSDDGTAIQN SQNMRLVFQY ASNFDLLVIQ
HCEDKSMTAE AVMNEGVFST KLGLKGIPDV SEAVMLCRDL HLIRYIVEHE LHDPANKPRY
HVAHISTKAS LDLVRQAKAE GLQVTCEVTP HHFTLTDEDL FNAPSKGNFI MKPPLPSKKN
RAAILEAIAD GTIDAIATDH APHAPHEKEC PPDQASFGII GLETAVGLTI TELVEPGIIT
LSRAIELMSV NPRKILQLDP LLFAQGERAN FTIIDPEEEW ALTANAVKSK STNTPFLGRK
LKGRAIAVYH KGMFHESVTS QEHFSV