Gene Caul_0388 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaul_0388 
Symbol 
ID5897662 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCaulobacter sp. K31 
KingdomBacteria 
Replicon accessionNC_010338 
Strand
Start bp426729 
End bp427793 
Gene Length1065 bp 
Protein Length354 aa 
Translation table11 
GC content71% 
IMG OID641560873 
Productdihydroorotate dehydrogenase 2 
Protein accessionYP_001682023 
Protein GI167644360 
COG category[F] Nucleotide transport and metabolism 
COG ID[COG0167] Dihydroorotate dehydrogenase 
TIGRFAM ID[TIGR01036] dihydroorotate dehydrogenase, subfamily 2 


Plasmid Coverage information

Num covering plasmid clones29 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones21 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCCTTC ACGACATCGC CGCCCGCGCC CTGCACGTCC TGGACCCCGA GGACGCCCAC 
GGCTGGGCGA TCAAGGGCCT GAAGATGGGA TTGGGTCCGC GCCAGTCCGA CGTCGACGAT
CCGATCCTGT CGCTGACCGT CGCCGGCCTG CCGCTGTCCA ACTGCGTGGG CCTGGCCGCC
GGCTTCGACA AGAACGCCGA GGTCCCCGCC GCCATGTCGC GGGCCGGCTT CGGCTTCGTC
GAGTGCGGTA CGGTCACGCC CCTGGCCCAG GCCGGCAATC CGCGCCCGCG CCTGTTCCGG
CTGACCCAGG ACCAGGCGGT GATCAATCGC ATGGGCTTCA ATAACGAGGG CCTGGAGCCG
TTCGCGGCCC GCCTGTCGGC TCTGAAGGCC CGACGCACGC GCGGGATCGT CGGCGCCAAT
ATCGGGGCCA ACAAGGACGC GACCGACCGC ATCGCCGACT ATGTCACGGG CCTTACCCGC
CTGTGGGGCC TGTCGGACTA TTTCACCGTC AATATCTCCT CGCCCAACAC GCCGGGGCTG
CGCGCCCTGC AGACCAAGGC GGCGCTGGAG GAACTGCTGG GCCGCCTCGC CGAGGCGCGC
GGCCTGCTGA AGGCTGCCGG CACGGTCGAC TATCCGATCT TCCTGAAGGT CGCGCCAGAC
CTGGAGGACG GGGAGGTCGA GGCCATCGTC GAGACGGTCA AGAGCGCCGG CCTGAACGGG
ATCATCGTCA GCAACACCAC GATCGCCCGC CCCGCCGACC TGGCCTCCCC CCACGCCGCC
GAGAGCGGCG GTCTTTCGGG CAAGCCGCTG CTGGCCGCTT CCACCGCCAT GCTGGCCCGT
TTCCACGCCG CCAACAACGG ACACCTAGCC TTGATCGGGG CGGGCGGGAT CGCCAGCGGC
GCCGACGCTC TGGCCAAGAT CCGGGCCGGC GCCTGCGCCG TGCAGCTCTA TTCAGCCCTC
GTCTACGGCG GGCCGGGCCT GGTCCAGCGG ATCAAGTCGG ACCTGGCCGC CCGCCTGCGC
GCCGAGGGCT TCGCCTCGGT CACTGACGCG ATCGGCGCGG CATGA
 
Protein sequence
MSLHDIAARA LHVLDPEDAH GWAIKGLKMG LGPRQSDVDD PILSLTVAGL PLSNCVGLAA 
GFDKNAEVPA AMSRAGFGFV ECGTVTPLAQ AGNPRPRLFR LTQDQAVINR MGFNNEGLEP
FAARLSALKA RRTRGIVGAN IGANKDATDR IADYVTGLTR LWGLSDYFTV NISSPNTPGL
RALQTKAALE ELLGRLAEAR GLLKAAGTVD YPIFLKVAPD LEDGEVEAIV ETVKSAGLNG
IIVSNTTIAR PADLASPHAA ESGGLSGKPL LAASTAMLAR FHAANNGHLA LIGAGGIASG
ADALAKIRAG ACAVQLYSAL VYGGPGLVQR IKSDLAARLR AEGFASVTDA IGAA