Gene Caul_4639 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaul_4639 
Symbol 
ID5902101 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCaulobacter sp. K31 
KingdomBacteria 
Replicon accessionNC_010338 
Strand
Start bp5016808 
End bp5018313 
Gene Length1506 bp 
Protein Length501 aa 
Translation table11 
GC content71% 
IMG OID641565158 
ProductRNA-binding S4 domain-containing protein 
Protein accessionYP_001686257 
Protein GI167648594 
COG category[J] Translation, ribosomal structure and biogenesis 
COG ID[COG1187] 16S rRNA uridine-516 pseudouridylate synthase and related pseudouridylate synthases 
TIGRFAM ID[TIGR00093] pseudouridine synthase 


Plasmid Coverage information

Num covering plasmid clones28 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCGAGA AATACCAGCC CCACCTGCAT GACAACGCCA TTCCCGGTCA AGACGACGAC 
GGGGAAGGCG CGCGGGTCGC CAAGATGCTG GCCCGGGCCG GCGTGGCCTC GCGGCGCGCG
GTCGAACGGC TGATCGAGGA CGGCCGGGTC GCCCTGAACG GCGAGGTTCT GACCACCCCG
GCCATCAAGG TCCGCCCCGG CGACATCCTG ACCGTCGACG GCAAGATGAT CGACGAGCCC
GAGGCGACGC GCGTCTTCCG CTACCACAAG CCCTCCGGGC TGATGACCAC CCACAACGAT
CCCAAGCAGC GCCCGACGGT GTTCCAGGCC CTGCCGCGCG ACCTGCCGCG CCTGATCTCG
GTCGGACGAC TGGACCTCAA CTCCGAAGGC CTGCTGCTGC TGACCAACGA CGGGGCTCTG
TCACGGGCCC TGGAGATGCC GCAGAACGCC TGGGTGCGCC GCTATCGGGC CCGCGCGTTC
GGCGACACCA CCCAGGCCAA GCTGGACAAG CTGAAGGACG GCTGCACCGT CGAGGGCGTC
CGATACGGCC CGATCGAGGC GCGGCTCGAC AAGGCCCAGG AAAAGGCCGG CGGCGGCAAG
AACATCTGGA TCACCCTGAC CCTCAGCGAG GGCAAGAACC GCGAAGTGCG GCGGGTGCTG
GAATCCATCG GCCTGAAGGT CAACCGCCTG ATCCGCCTGT CCTACGGCCC GTTCGCGCTC
GGAACCCTGC TGCCGGGCCA GGTCGAGGAG GTCGGTCCCC GGGTGATCCG CGAGCTGCTG
GAAGGCATCG TCGCCGAAGA GAACATGCCC AAGGGCGACA AGCCGCAATT CATCGGCGTG
GCCGATCCGC TGAAGGCCGT CGGCACCGCG GGCGGCGGCG ACATGCAGCG GCGCGGCGTG
CCGCGCACCA ACAAGCTGAC CCAGGTCTCG ATCATCACGC CCGAGGAGCC GGTCGAGGAA
GAGAAGTTCG TCCGCAAGCC GGGCTGGGCC AAGCCCAAGA AGAAGCCGGC GATCGTCGGT
CGCGAGCCCG TGCGCACAGC CAAGAAGTCG ATCGAGAGCA AGATGATCGG CCCAAAGCCC
CTGTCCTACC GCGACGCCGC CGCCAAGCGG GTCCGCGACA AGGGCATGGC CGACAAGAAC
GCGGCCGACA AGCGGGCGGC GAGCGGCAAG CCGGCGCGGC CCGACAAGCC TGCCGGCCAG
CCCTCCGGCG GGCACACGTC CAAGCCGAAG CTCGGCGCGC TGCGGGCCAA TGGCTACAAG
CCGCTGACCG AGGGTCCGGC AAGGTCGGCG GGCAAGCCCG GCGGCGCGCG TCCGGGCGGC
AAGCCCAGCA CGGTCAAGGC CGCCGGCGCG GGCAAGCCTG GCGAACCGGG CAAGGTGTGG
TCCAAGCCCG GCATGGCAAA GTCGGCCGGG CCTCGCCCCG ACGGCCCCAA GGGTCCGCCG
CGCGCCGGCG GCAAGCCGGG CGGCCCACGT CCGGGCGGTT CGAGCGCGCC GCGCGGCAAG
CGATAG
 
Protein sequence
MTEKYQPHLH DNAIPGQDDD GEGARVAKML ARAGVASRRA VERLIEDGRV ALNGEVLTTP 
AIKVRPGDIL TVDGKMIDEP EATRVFRYHK PSGLMTTHND PKQRPTVFQA LPRDLPRLIS
VGRLDLNSEG LLLLTNDGAL SRALEMPQNA WVRRYRARAF GDTTQAKLDK LKDGCTVEGV
RYGPIEARLD KAQEKAGGGK NIWITLTLSE GKNREVRRVL ESIGLKVNRL IRLSYGPFAL
GTLLPGQVEE VGPRVIRELL EGIVAEENMP KGDKPQFIGV ADPLKAVGTA GGGDMQRRGV
PRTNKLTQVS IITPEEPVEE EKFVRKPGWA KPKKKPAIVG REPVRTAKKS IESKMIGPKP
LSYRDAAAKR VRDKGMADKN AADKRAASGK PARPDKPAGQ PSGGHTSKPK LGALRANGYK
PLTEGPARSA GKPGGARPGG KPSTVKAAGA GKPGEPGKVW SKPGMAKSAG PRPDGPKGPP
RAGGKPGGPR PGGSSAPRGK R