Gene Paes_1038 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPaes_1038 
SymbolpyrC 
ID6459999 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProsthecochloris aestuarii DSM 271 
KingdomBacteria 
Replicon accessionNC_011059 
Strand
Start bp1140380 
End bp1141702 
Gene Length1323 bp 
Protein Length440 aa 
Translation table11 
GC content52% 
IMG OID642725038 
Productdihydroorotase 
Protein accessionYP_002015724 
Protein GI194333864 
COG category[F] Nucleotide transport and metabolism 
COG ID[COG0044] Dihydroorotase and related cyclic amidohydrolases 
TIGRFAM ID[TIGR00857] dihydroorotase, multifunctional complex type 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value0.933737 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.00310904 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGAGCATCA TTTTTCATCA GGCGAGAATC ATCAATCCCG CCCAGAACCT TGATACGACA 
GGATCGATCC GTATTTCCGA TTCAGGCCGG ATTGAAACCA TCGTTACAGG AAACGATCCC
CTGCCCCCTC AGGATACCGA CCGCGTCATT GATATGAACG GCAAACTTCT CGTTCCGGGC
CTGTTCGATA TGCACTGCCA CTTCAGAGAG CCTGGACAGG AATACAAAGA AACACTCGAA
ACAGGATCAA GAGCCGCTGT CGCAGGAGGA TTTACGGGTG TGGCTCTTAT GCCGAATACC
AAACCCGTCA TCGACAACCC GCAGGGAGTC GCCTATATCC GTCAGACGGC CATGGAGCTG
CCGATCGACA TCGAAGTCAT CTCGGCGATG ACCAAAGAAA GCAGGGGGGA AACTCTGGCC
CCTTTCGGAA AACTCTTTGC AAGCGGCGTC AAAGCCGTAT CGGATGACGG AACCGCTATT
CAGAACAGCC AGATCATGCG CCTGGCGTTT GAGTACGCCG CCAACTTCAA CTTACTCTTC
ATCCAGCACT GTGAAGACAC CAGCCTTACC TCGGGCGGGG TCATGAATGA AGGTGAATAT
TCTGCGATGA TGGGTCTTAA AGGAATTCCC GACGTCGCAG AAGCAATCAC GCTCAGCCGT
GACCTGCTGC TTATCGACTA CCTCCGAAAA CATAAACTCT CACCGCCTCT GACTGCACCA
CGTTATCATG TCGCCCATAT CAGCACAAGA AGCGCCCTGG ATCTTGTACG CAAAGCAAGG
AAAGAAGGCA TGGCAATAAC CTGCGAAGTC ACGCCGCATC ATTTCACCCT CACCGAAGAG
GCCCTTTTCA AGGCAGAGCA CAAAGGCAAC TTCATCATGA AACCGCCACT CTGCAGCCTT
GATAACCACG CAGCAATCCT TGAAGCGATT GTCGACGGCA CGATCGACGC CATTGCAACC
GATCACGCGC CACATGCCGA ACATGAAAAG CAATGCCCTC CCGATCAGGC ATCATTCGGC
ATCATCGGTC TGGAAACAGC GGTGGGACTG ACCTTCAGCG AACTGGTCCA CACAGGACGC
ATATCCGTCA GCCGGGCTAT AGAGATGCTC TCAGTCAACC CAAGACGGAT TATGGATATT
GAACCTGTGC TTTTCGAACC CCAGAGAGCG GCTAACTTTA CGCTGATAGA TCCCGATGCC
ACCTGGACCT GGAAGAGCGA GCATATCAAG TCGAAAGCAA AAAACTCTCC GTTTATCGGC
CGAACAATGA AAGGCAAAGC GATCGGTATC TGTCATAAAG GAAAACTGCT TGGACTTGAC
TGA
 
Protein sequence
MSIIFHQARI INPAQNLDTT GSIRISDSGR IETIVTGNDP LPPQDTDRVI DMNGKLLVPG 
LFDMHCHFRE PGQEYKETLE TGSRAAVAGG FTGVALMPNT KPVIDNPQGV AYIRQTAMEL
PIDIEVISAM TKESRGETLA PFGKLFASGV KAVSDDGTAI QNSQIMRLAF EYAANFNLLF
IQHCEDTSLT SGGVMNEGEY SAMMGLKGIP DVAEAITLSR DLLLIDYLRK HKLSPPLTAP
RYHVAHISTR SALDLVRKAR KEGMAITCEV TPHHFTLTEE ALFKAEHKGN FIMKPPLCSL
DNHAAILEAI VDGTIDAIAT DHAPHAEHEK QCPPDQASFG IIGLETAVGL TFSELVHTGR
ISVSRAIEML SVNPRRIMDI EPVLFEPQRA ANFTLIDPDA TWTWKSEHIK SKAKNSPFIG
RTMKGKAIGI CHKGKLLGLD