Gene Caul_2043 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaul_2043 
Symbol 
ID5899498 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCaulobacter sp. K31 
KingdomBacteria 
Replicon accessionNC_010338 
Strand
Start bp2184131 
End bp2185108 
Gene Length978 bp 
Protein Length325 aa 
Translation table11 
GC content69% 
IMG OID641562532 
ProducttRNA-dihydrouridine synthase A 
Protein accessionYP_001683669 
Protein GI167646006 
COG category[J] Translation, ribosomal structure and biogenesis 
COG ID[COG0042] tRNA-dihydrouridine synthase 
TIGRFAM ID[TIGR00742] tRNA dihydrouridine synthase A 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value0.00353768 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value0.913937 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATGGACT GGAGCGATCG GCATTGCCGG TCGCTGCACC GTGTGCTGTC GCGCCGGGCG 
TTGCTGTATT CGGAGATGGT GACCAGCGGG GCGGTGCTGC ATGGCGACTG CGAGAAGTTG
CTGGGGTTCG ACGCCGGTCA GCATCCGGTG GCCCTGCAAC TGGGCGGCTC GGAGCCGGCC
GATCTGGCCG CCGCCGCCCG GATTGGCGAG GACTTTGGCT ACGACGAGAT CAATCTGAAT
GTCGGCTGCC CGTCGGACCG CGTGCAGAGC GGCCGTTTCG GGGCCTGCCT GATGCGCGAA
CCGAAGCTGG TGGCCGACTG CATGGCGGCG ATGACCGCGG CGGTGTCCGT ACCGGTCACC
GTCAAGTGCC GGATCGGGGT CGACGACCAG GACCCGGAGC AGAGCCTGTT CGAGCTGGTC
GATCTGTCGG CCCAGGCCGG CGTAACCCAC TTCGTGGTTC ACGCGCGCAA GGCCTGGCTG
AAGGGCTTGT CGCCAAAGGA AAACCGCGAC GTGCCGCCGC TGGACTATCC GTTGGTCCAT
CGGCTGAAGG CCGAGCGGCC CGCCCTGACC ATCGTCATCA ACGGCGGGAT TCCGGATCTC
GACGCCTCGC TGGTCCAGCT GGCGCATGTC GATGGGGTGA TGCTGGGTCG GGCGGCCTAT
CACGAGCCTG GCCTGCTGGG TCAGGTCGAT CGCCGGGTGT TCGGCGAGGG CCGCGATGTC
GACGCCTTCG AGGCGGTCGA GCTCTACAAG TCCTATATGG CCAGTCAGTT GGCGGCCGGC
GTGCACCTGA CGGCGATGAG CCGGCACATG CTGGGCCTGT TCCACGGCAT GCCGGGCGCG
CGGGCTTGGC GCCGCATCCT CACGGTCGAG GGCGTCGCGG CGGGGGCGGG GCTGGAGGTT
GTCGATCGCG CCTTGGCCGC CGTCCGCCAG GCTGTCGATG CGCGCGAGGC GCGGGCGCTG
GAGGCGGTCG CGAGCTAA
 
Protein sequence
MMDWSDRHCR SLHRVLSRRA LLYSEMVTSG AVLHGDCEKL LGFDAGQHPV ALQLGGSEPA 
DLAAAARIGE DFGYDEINLN VGCPSDRVQS GRFGACLMRE PKLVADCMAA MTAAVSVPVT
VKCRIGVDDQ DPEQSLFELV DLSAQAGVTH FVVHARKAWL KGLSPKENRD VPPLDYPLVH
RLKAERPALT IVINGGIPDL DASLVQLAHV DGVMLGRAAY HEPGLLGQVD RRVFGEGRDV
DAFEAVELYK SYMASQLAAG VHLTAMSRHM LGLFHGMPGA RAWRRILTVE GVAAGAGLEV
VDRALAAVRQ AVDAREARAL EAVAS