Gene Caul_1658 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaul_1658 
Symbol 
ID5899113 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCaulobacter sp. K31 
KingdomBacteria 
Replicon accessionNC_010338 
Strand
Start bp1737630 
End bp1738619 
Gene Length990 bp 
Protein Length329 aa 
Translation table11 
GC content68% 
IMG OID641562147 
ProductRluA family pseudouridine synthase 
Protein accessionYP_001683285 
Protein GI167645622 
COG category[J] Translation, ribosomal structure and biogenesis 
COG ID[COG0564] Pseudouridylate synthases, 23S RNA-specific 
TIGRFAM ID[TIGR00005] pseudouridine synthase, RluA family 


Plasmid Coverage information

Num covering plasmid clones28 
Plasmid unclonability p-value0.615207 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.0770781 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGAGAAG TACGCACGCT GTTCGTGGAC GCTGGCGAGG ACGGGGTCCG CCTCGATCGC 
TGGTTCAAGC GTCGCTGGCC CCACCTCAAC CACATCCAGC TCAACAAGCT GTTCCGCTCG
GGCCAGGTGC GGGTCGATGG TTCGCGCGCC AAGGCCGACA CCAAGCTGGC GGCCGGCAGC
CAGATCCGCG TGCCCCCGCT GCCCGACGCC CCGGATCCGG ACGAAAAACA GAAGCTGAGC
CCGCGCGACA TCGCCTTCGC CAAGTCGCTG GTGCTGTACG AGGACGAGGA AGTTCTGGCG
CTGAACAAGC CGGCCGGCCT GGCCGTGCAG GGCGGCACCA AGACCACCCA CCACATCGAC
AAGCTGCTCA GCGCCTGGGG CGAGGGCGTC AACCGGCCCA AGCTGGTCCA CCGCCTGGAC
CGCGACACCT CCGGCGTGTT GCTGCTGGGC AAGACTCCCG CCGCGGCCGC CCGCCTGTCG
GGCTCGTTCG CCAAGCGCAA GGCGCAGAAG ACCTACTGGG CGATCGTCGC CGGCAACCCG
CACCCGACAG AGGGCGTGAT CGAGCTGCAC CTGGCCAAGC GCGGGGTGGG CGACCGCGAA
CTGGTCGTGC CGGCAGAACC CAAGGATCCT GACGGCCAGC CGGCCGAGAC CGAGTTCGTC
TCGATCAGCC GCGCGGGTCC ACGCGTCACC TGGATGGCCC TGCGCCCGCA CACCGGCCGC
ACGCACCAGC TGCGCGCCCA CATGAAGGCC ATCGGCCACC CGATCCTCGG CGATCCCAAG
TACAGCGACG ACAAGGCCTT GCAGCTTTCG GAAGGCCTGA AGCTGCAGTT GCACGCCCGC
TCGATCGTGC TGCCGCACCC CTCGGTCGGC ACCCTGGCCA TCCAGGCGCC GCTCAGCCCC
GAGATGAAGG CTGGCTTCGC CAAGTTCGGC TTCTCGGAGG ACGAGGCGGA ATACGACCCG
TTCGCTCGCC GCCAAACCAA GCGTAGATAA
 
Protein sequence
MREVRTLFVD AGEDGVRLDR WFKRRWPHLN HIQLNKLFRS GQVRVDGSRA KADTKLAAGS 
QIRVPPLPDA PDPDEKQKLS PRDIAFAKSL VLYEDEEVLA LNKPAGLAVQ GGTKTTHHID
KLLSAWGEGV NRPKLVHRLD RDTSGVLLLG KTPAAAARLS GSFAKRKAQK TYWAIVAGNP
HPTEGVIELH LAKRGVGDRE LVVPAEPKDP DGQPAETEFV SISRAGPRVT WMALRPHTGR
THQLRAHMKA IGHPILGDPK YSDDKALQLS EGLKLQLHAR SIVLPHPSVG TLAIQAPLSP
EMKAGFAKFG FSEDEAEYDP FARRQTKRR