Gene Caul_4779 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaul_4779 
Symbol 
ID5902241 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCaulobacter sp. K31 
KingdomBacteria 
Replicon accessionNC_010338 
Strand
Start bp5160291 
End bp5161571 
Gene Length1281 bp 
Protein Length426 aa 
Translation table11 
GC content74% 
IMG OID641565299 
ProductFmu (Sun) domain-containing protein 
Protein accessionYP_001686397 
Protein GI167648734 
COG category[J] Translation, ribosomal structure and biogenesis 
COG ID[COG0144] tRNA and rRNA cytosine-C5-methylases 
TIGRFAM ID[TIGR00563] ribosomal RNA small subunit methyltransferase RsmB 


Plasmid Coverage information

Num covering plasmid clones27 
Plasmid unclonability p-value0.849075 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value0.940845 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGACTCAGG AACTCAATGA CGGCCTCCCG GCACGGGAAG GCGCCCTCGC CCTCATCGAC 
GCGGCCCTGT CGCGGCGCGG CGGGCTCGAC GAGGCCGCCT CGGCCAACGC TTTTCGCTTT
CTCGAACCGC GGGAGCGCGC CTTCGCGCGC GCCCTGGCCA TGGCCACCTT GCGTCATCTG
GGACCCATCG ACCGCGCCCT GGCCGGCAAG CTGGCCAAGG AACCACCGCC CCGCGTGCGC
AACCTGCTGC GCCTGGGCGC GACCCAGGCC TTCTTCCTGG AGGTGCCCGC CTTCGCCGCC
GTCGCCACCA GCGTCGAACT GGCCGGCGCC AGCAAGACCA GCCGCCCGTT CAAGGGCCTG
GTCAACGCCG TGCTGCGCGG CCTGCTGCGC GACGGCGCCC TGTCCGACGC TTCGGAACAC
CTGGCTCCGC CGTGGCTCTA CGCCCGCTGG GTTAGCGCCT ATGGCAAGGA GACCGCCGAC
GCGGTCGCCG CCCAGATCGG CTTCGAGCCG GCCACCGACC TTTCCTTGAA GCCCGACTTC
GACGCCACGG CGCTGGCCGC CGAGCTGGAG GGCGAGATCC TGCCCGGCGG CACGCTGCGC
ACCGAGCGGC GCGGCGACGT CTCGGCCTGG CCGGCCTTCG ACGACGGCGT CTGGTGGATC
CAGGACGCCG CCGCCGCCAT CCCCGCCCGC CTGCTGAACC TCAAGCCCGG CGAAACGGCG
CTCGACCTCT GCGCCGCGCC CGGCGGCAAG ACGATGCAGA TGGTCGCGGC CGGGGCCCAG
GTCGTCGCCA TCGACCGCTC GCCCGCCCGG CTGGGCCGCG TCACCGAGAA CCTGGCCCGC
ATGTCCATGC AGGCCGAGGT GATCGCCGCC GACGCCGGAG CCTGGGACGA TGCGCGCACC
TTCGACGCGG TGCTGCTGGA CGCCCCCTGC TCGGCCACCG GCACCTTCCG CCGCCACCCC
GACGTGTTGT GGGCCGCCCG CCCCGGCGAC GTCGCCAGCC TGGCCGGCGT GCAGAGCAAG
CTGCTCGACA GCGCGGCGGG CCGACTCAAG CCCGGTGGCC GTCTGGTCTA TTGCGTCTGC
TCGCTGGAGC CCGAAGAGGG CGAGGCCCAG GTCGAGGCGT TCCTCGCCCG CCGCCCGGAC
ATGGCGCTGG ATCCGATCAC CTCGGAGGAA GGCGGCGCTC CGGCCGCCAG CCTGACGCCG
CGCGGCACGC TGCGCATCCT GCCCCACCAC CGCGAGGGCG GACTGGACGG CTTCTTCGCG
GCGCGGTTCG TGAAGCTCTA A
 
Protein sequence
MTQELNDGLP AREGALALID AALSRRGGLD EAASANAFRF LEPRERAFAR ALAMATLRHL 
GPIDRALAGK LAKEPPPRVR NLLRLGATQA FFLEVPAFAA VATSVELAGA SKTSRPFKGL
VNAVLRGLLR DGALSDASEH LAPPWLYARW VSAYGKETAD AVAAQIGFEP ATDLSLKPDF
DATALAAELE GEILPGGTLR TERRGDVSAW PAFDDGVWWI QDAAAAIPAR LLNLKPGETA
LDLCAAPGGK TMQMVAAGAQ VVAIDRSPAR LGRVTENLAR MSMQAEVIAA DAGAWDDART
FDAVLLDAPC SATGTFRRHP DVLWAARPGD VASLAGVQSK LLDSAAGRLK PGGRLVYCVC
SLEPEEGEAQ VEAFLARRPD MALDPITSEE GGAPAASLTP RGTLRILPHH REGGLDGFFA
ARFVKL