Gene Caul_0345 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaul_0345 
Symbol 
ID5897619 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCaulobacter sp. K31 
KingdomBacteria 
Replicon accessionNC_010338 
Strand
Start bp386647 
End bp387702 
Gene Length1056 bp 
Protein Length351 aa 
Translation table11 
GC content73% 
IMG OID641560830 
ProductRluA family pseudouridine synthase 
Protein accessionYP_001681980 
Protein GI167644317 
COG category[J] Translation, ribosomal structure and biogenesis 
COG ID[COG0564] Pseudouridylate synthases, 23S RNA-specific 
TIGRFAM ID[TIGR00005] pseudouridine synthase, RluA family 


Plasmid Coverage information

Num covering plasmid clones44 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones28 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TTGATCCAGC CCCTCGACCC GCCCGCGACG CTCGCGGATG ACGAGATCGA CGACATCGAC 
GCTCCCGAAA CGGGCGCCGG CGGCGACATC GTGCGGATCG AACTGGGCGC CGACCTGGCC
GGCCAGCGCC TGGACAAGGC CCTGGCGACC GCCGCGCCGG AGCTGTCTCG CGCCCGCCTC
CAGGCCCTGA TCGCGGCGGG CCAGGTGTCG CTGGTCGTCG AGGGCGCCGC GCCGCGCGCG
ATGCCCGACG GCAAGGCCAA GGCCCCGGCC GGGCTCTACG AGGTGGTCGT GCCGCCGCCG
ACCGCCGCCG AGCCGCTGCC CGAGAACATC CCGCTGAGCG TGCTCTACGA GGACGCCCAC
CTGATCGTCA TCGACAAGCC GGCCGGCATG GCCGCCCACC CGGCCCCAGG GTGCGAGACC
GGCACCCTGG TCAACGCCCT GCTGTTCCAC TGCGGGGCCA GCCTGTCGGG AATCGGCGGC
GTGGCCCGGC CCGGCATCGT CCACCGCCTC GACAAGGAGA CGTCCGGGGT GATGGTGGCC
GCCAAGACCG ACGCCGCCCA CCAGGGCCTA TCGGCCCTGT TCGCCAAGCA CGACATCGAC
CGCATGTATC TGGCCCTGAC CCGCGGCGCG CCCCATCCGG TGGTCGGCAC GATCATCACC
CAGCTGGGCC GCTCGCCGGG CGACCGCAAG AAGATGGCGG TGCTGAAGTC CGGCGGTCGC
GAGGCGATCA CCCACTACCG CGTCGAGAAG AGCTTCGGTC CGCCGGACAA GCCCCTGGCC
TCGCGCGTCG CCTGCCGGCT GGAAACCGGC CGCACCCACC AGATCCGCGT CCACATGGCC
AGCAAGGGCA GCCCCTGCCT CGGCGACCCG GTCTACGGGG CCGGCGCCCC GGCCGCGCCG
GTCAAGGCGG CCCTGACGGA GATCGGCTTC TCGCGCCAGG CTCTACACGC CGCCGTGCTG
GGCTTCGTCC ACCCGATCAC CCGCGAACTT CTGCGCTTCG AAACGCCCCT GCCGCCCGAC
ATGGCGGCGC TGGAGGCGGC CCTTGAAGCC CTGTGA
 
Protein sequence
MIQPLDPPAT LADDEIDDID APETGAGGDI VRIELGADLA GQRLDKALAT AAPELSRARL 
QALIAAGQVS LVVEGAAPRA MPDGKAKAPA GLYEVVVPPP TAAEPLPENI PLSVLYEDAH
LIVIDKPAGM AAHPAPGCET GTLVNALLFH CGASLSGIGG VARPGIVHRL DKETSGVMVA
AKTDAAHQGL SALFAKHDID RMYLALTRGA PHPVVGTIIT QLGRSPGDRK KMAVLKSGGR
EAITHYRVEK SFGPPDKPLA SRVACRLETG RTHQIRVHMA SKGSPCLGDP VYGAGAPAAP
VKAALTEIGF SRQALHAAVL GFVHPITREL LRFETPLPPD MAALEAALEA L