Gene Phep_3229 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPhep_3229 
Symbol 
ID8254348 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePedobacter heparinus DSM 2366 
KingdomBacteria 
Replicon accessionNC_013061 
Strand
Start bp3834190 
End bp3835524 
Gene Length1335 bp 
Protein Length444 aa 
Translation table11 
GC content45% 
IMG OID644936882 
Productdihydroorotase 
Protein accessionYP_003093486 
Protein GI255533114 
COG category[F] Nucleotide transport and metabolism 
COG ID[COG0044] Dihydroorotase and related cyclic amidohydrolases 
TIGRFAM ID[TIGR00857] dihydroorotase, multifunctional complex type 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones19 
Fosmid unclonability p-value0.174621 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAACACCA TTCTTATAAA AGGAGCCTCT GTAGTAAACG AAGGCCAGAT TGTTGTAGCC 
GATGTACTAC TTAAAAACGG CTTTATTGAA AAAATAGCCC CAAACATTGA TATTGCGGCC
CATCAGGAGA TCAATGCTGA AGGCCTGCAC CTTTTTCCGG GAATGATAGA CGACCAGGTG
CATTTTCGTG AGCCGGGCCT AACCCATAAA GCAGATATTT TTTCGGAAAG CATGGCCGCT
GTTGCCGGCG GGATTACTTC TTTCATGGAA ATGCCAAATA CAGTACCCAA TACACTTACA
CAGAAACTGC TGGCCGATAA ATATGCCATT GCCTCAGAAA TGTCGCTGGC CAATTACTCC
TTTTTCATGG GTGCATCTAA CGACAATCTG GACGAAGTAT TAAAAACAGA CCCTAAAAAC
GTTTGCGGCA TTAAGGTATT TATGGGTTCT TCTACAGGCA ATATGCTGGT AGACAATGAA
AAGGTACTGG AAAACATCTT CAAAGAAGCG CCCATGCTGG TGGCTACGCA TTGCGAAGAT
GAGCAGACTA TCCGGCATAA CCTTGCCGTT TATAAAGAAA AATACGGAGA AAATATCACC
ATAGCTATGC ACCCGCTGAT CCGGAGTGCC GAAGCCTGTT ATAAATCCTC TTCAATGGCT
GTAGAACTGG CCAAAAAGTA CCATACACGT CTCCATATCC TGCACATTTC TACGGCAAGG
GAAGTTGCAC TATTTGACAA TAAAACACCA CTTGCAGATA AAAAAATAAC CGCTGAAGCC
TGTGTACATC ATTTATGGTT CGACGATCAT GATTATGCGG TTAAAGGAAA CTGGATCAAA
TGGAACCCCG CTGTTAAAAG CGCTGCAGAT AAAGCAGGTA TCCTGAAAGG GGTACTGGAC
GGCCATATCG ATATCATTGC TACTGATCAT GCCCCACATA CCATTGAAGA AAAGGAACAG
CCTTATTTAC AGGCGCCCTC TGGTGGCCCA CTGGTTCAGC ATGCACTGCC CGCACTGTTC
GAAATGTATC ACCAGGGTAA AATATCGCTG GTACAGATTG CCGAAAAAAC AGCACACAAT
GTGGCAGTAT GTTTCAATAT CGATAAAAGG GGCTTTATCA GGGAAGGCTA CTGGGCCGAC
CTGGTACTGG TAAACCTGAA CGATCCTTTT ACCGTAACCA AAATGAACGT GCTGTACAAA
TGTGGCTGGT CGCCTTTTGA AGGGCAAACC TTCAGAGCCG AAGTTACCCA TACCTTTGTA
TCCGGCAACC TTGCTTATCA GAAAGGGAAA TTCACTACCC AGGAAACCGG CAAACGACTG
GCCTTTAACC GCTGA
 
Protein sequence
MNTILIKGAS VVNEGQIVVA DVLLKNGFIE KIAPNIDIAA HQEINAEGLH LFPGMIDDQV 
HFREPGLTHK ADIFSESMAA VAGGITSFME MPNTVPNTLT QKLLADKYAI ASEMSLANYS
FFMGASNDNL DEVLKTDPKN VCGIKVFMGS STGNMLVDNE KVLENIFKEA PMLVATHCED
EQTIRHNLAV YKEKYGENIT IAMHPLIRSA EACYKSSSMA VELAKKYHTR LHILHISTAR
EVALFDNKTP LADKKITAEA CVHHLWFDDH DYAVKGNWIK WNPAVKSAAD KAGILKGVLD
GHIDIIATDH APHTIEEKEQ PYLQAPSGGP LVQHALPALF EMYHQGKISL VQIAEKTAHN
VAVCFNIDKR GFIREGYWAD LVLVNLNDPF TVTKMNVLYK CGWSPFEGQT FRAEVTHTFV
SGNLAYQKGK FTTQETGKRL AFNR