Gene Caul_4357 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaul_4357 
Symbol 
ID5901818 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCaulobacter sp. K31 
KingdomBacteria 
Replicon accessionNC_010338 
Strand
Start bp4734799 
End bp4736133 
Gene Length1335 bp 
Protein Length444 aa 
Translation table11 
GC content70% 
IMG OID641564875 
Productdihydroorotase 
Protein accessionYP_001685975 
Protein GI167648312 
COG category[F] Nucleotide transport and metabolism 
COG ID[COG0044] Dihydroorotase and related cyclic amidohydrolases 
TIGRFAM ID[TIGR00857] dihydroorotase, multifunctional complex type 


Plasmid Coverage information

Num covering plasmid clones30 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value0.655368 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCCAGA CCTTCGACCT GATCGTGCGC GGCGGCGAGG TGGTCAATCA CGCGGGGCGG 
GGCTTCACCG ACATCGGCGT GCGCAATGGC AAGATCGTCG CGATCGGCGA CCTTGGCCAA
GCCTCGGCCG GCGAGATCTT CGACGCCTCG GGCCTGACCG TGCTGCCGGG CGTCATCGAC
AGCCAGGTCC ACTTCCGCGA ACCCGGCCTG GAGTGGAAGG AAGACCTGGA AAGCGGCTCG
CGCGGCGCCG CCCTGGGAGG CGTGGTCGCG GTGTTCGAGA TGCCCAACAC CGAGCCCACC
ACCACCGACC CCGACGCCCT GGCCGACAAG CTGGCCCGGG CCAAGGGCCG CATGCACACC
GACCACGCCT TCTATGTCGG CGGCACCCAC GAGAACGCCG CCTTTCTGGG CGAGCTGGAG
CGCCTGCCCG GCTGCTGCGG CATCAAGGTG TTTATGGGGG CCTCGACGGG CACCCTGCTG
GTGCAGGACG ACGAGGGGGT CGAGGCCGTT CTGCGCAGCG TCAACCGCCG CGCCGCCTTC
CACTCCGAGG ACGAGTACCG CCTGGCCGAG CGCCGCGGCC TGGCCCGTCC CGGCGACTGG
ACCAGCCACC CGGAGGTCCG CGACGCCCAG GCGGCCCTGC AGTCGACCCA CCGCCTGGTC
GGCATCGCCA AGCGCCTGGG CAAGCGCATC CACGTGCTGC ATGTCACCAC CCACCAGGAG
ATCGACTTCC TGGCCAAGCA CAAGGACGTC GCCAGCGTCG AGGTCACGCC CCAGCACCTG
ACGCTGGTGG CGCCGGAAGC CTATGAGCGG CTGAAGGGCT TCGCCCAGAT GAACCCGCCG
ATCCGCTCGG CCGAGCACGT GGCCGGCGTC TGGCGCGGCG TCGAGACCGG CATCGCCGAC
GTGCTGGGCT CCGACCACGC CCCCCACACT CGCGAGGAAA AGGCCCGGCC CTATCCGGCC
TCGCCCTCGG GCATGCCGGG CGTGCAGACG CTGGTTCCCA TCATGCTGAC CCATGTGGTG
GACGGCCGGC TGTCGCTGGA GCGCTTCGTC GACCTGACCA GCCATGGGGT GAACCGCGTG
TTCGGCCTGG CCGACAAGGG CCGGATCGCC GAGGGCTTCG ACGCCGACTT CACCATCGTT
GACATGAAGG CCCGGCGAAC CATCACCCAC GACTGGATGG CCACCCGCTC GGGCTGGACC
CCCTTCGACG GCTTCGAGGC CAAGGCCTGG CCCGTGGCGA CCATCGTTCG CGGGACCGTG
GTGATGCGGG ACGACGAGAT CGTGGCGGAA GGGACGGGCG CGCCGGTGCG GTTCCTGGAG
ACGCTGGCGG GGTAG
 
Protein sequence
MTQTFDLIVR GGEVVNHAGR GFTDIGVRNG KIVAIGDLGQ ASAGEIFDAS GLTVLPGVID 
SQVHFREPGL EWKEDLESGS RGAALGGVVA VFEMPNTEPT TTDPDALADK LARAKGRMHT
DHAFYVGGTH ENAAFLGELE RLPGCCGIKV FMGASTGTLL VQDDEGVEAV LRSVNRRAAF
HSEDEYRLAE RRGLARPGDW TSHPEVRDAQ AALQSTHRLV GIAKRLGKRI HVLHVTTHQE
IDFLAKHKDV ASVEVTPQHL TLVAPEAYER LKGFAQMNPP IRSAEHVAGV WRGVETGIAD
VLGSDHAPHT REEKARPYPA SPSGMPGVQT LVPIMLTHVV DGRLSLERFV DLTSHGVNRV
FGLADKGRIA EGFDADFTIV DMKARRTITH DWMATRSGWT PFDGFEAKAW PVATIVRGTV
VMRDDEIVAE GTGAPVRFLE TLAG