Gene Caul_3702 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaul_3702 
Symbol 
ID5901158 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCaulobacter sp. K31 
KingdomBacteria 
Replicon accessionNC_010338 
Strand
Start bp3996464 
End bp3997906 
Gene Length1443 bp 
Protein Length480 aa 
Translation table11 
GC content69% 
IMG OID641564213 
ProducttRNA (uracil-5-)-methyltransferase Gid 
Protein accessionYP_001685327 
Protein GI167647664 
COG category[J] Translation, ribosomal structure and biogenesis 
COG ID[COG1206] NAD(FAD)-utilizing enzyme possibly involved in translation 
TIGRFAM ID[TIGR00137] tRNA:m(5)U-54 methyltransferase 


Plasmid Coverage information

Num covering plasmid clones27 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones22 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCACTT CCCCCAAACC CGTCGCCCCC ATCCACGTCA TCGGGGGCGG CCTGGCCGGG 
TCCGAGGCCG CCTGGCAGAT CGCCCAGGCC GGCGTCCCGG TCGTCCTGCA CGAGATGCGT
CGCGACCTGC CCTCAAGCAG CGGAGCGGTA GGGCCAACAA AAGTGCGCAC CGACGCCCAT
CAGACCGACG GCCTGGCCGA GATGGTCTGC TCCAATTCGT TCCGGTCCGA CGACTGGCAG
TTCAACGCCG TCGGCCTGCT GCATGCCGAG ATGCGCAAGC TGGACTCGTT GATCCTGTCG
GCCGCCGACC AGCACCAGGT GCCGGCCGGC GGCGCCCTGG CCGTCGACCG CGACGGCTTC
TCGGCCGAGG TCACCCGCCG TATCGAAGCG CATCCGCTGA TCACCATCGA ACGCGAGGAG
GTCGCGGGCT TGCCGCCGGA AGACTGGGAC AGCGTGGTGG TGGCCACCGG CCCCCTCACC
TCGCCCGCCC TGGCCGACGC GATCCTCGAG CTCAGCGGCG AAGGCCAGCT CAGCTTCTTC
GACGCCATCG CCCCGATCAT CCACGTCGAG TCGATCGACA TGGACATCGC CTGGCGCCAG
TCGCGCTACG ACAAGGAAGG CCCCGGCGGA GACGCGGCCG CCTACATCAA CTGCCCGATG
AACAAGGCGC AATACGAGGC CTTCATCGAC GCCCTGCTCG AAGGCCCCAA GGCCGAGTTC
AAGGACTGGG AGCACGTGCC CTATTTCGAC GGCTGCCTGC CGATCGAAGT CATGGCCGAG
CGTGGCCGCG AGACCCTGCG CCACGGCCCG ATGAAGCCGG TCGGCCTGAC CAACCCGCGT
GACCCGACCG TGAAAGCCTA CGCCATCGTG CAACTGCGCC AGGACAACGC CCTGGGCACG
CTGTGGAACA TGGTCGGCTT CCAGACCAAG CTGAAGCACG GCGCCCAGGC CGAGGTCTTC
CGGATGATCC CGGGCCTGCA AAACGCCCAG TTCGCGCGGC TGGGCGGCCT GCACCGCAAC
ACCTTCATCA ACAGCCCGCG CCTGCTGGAC CGGTCCCTGC GAATGAAGGT CGCGCCGCGC
CTGCGCTTCG CGGGTCAGAT GACCGGGGTC GAGGGCTATG TCGAAAGCGC CGCCACGGGC
CTGCTGGCCG GCCGCTTCGC CGCCGCCGAG CGCCTAGGCA AGACGCTGGA CGCCCCGCCG
CCGACCACCG CCCTGGGCGC TCTGGTCGAC CACGTGACCG GCGGCCACAT CGAGGGCGAA
GCGCTGGGCA AGACCAGCTT CCAGCCGATG AACATCAACT ATGGCCTGCT GCCGCCGACG
GAGACCCCCA AGGTCGGGGA CGACGGCGTC AAGATCCCGA TGAAGGAACG CGGCCGGGCC
AAGAAGCGGC TGATGAGCCT CCGGGCGCTG GCGGATCTGG ATCAGTGGAT GGCAGGGGCC
TGA
 
Protein sequence
MSTSPKPVAP IHVIGGGLAG SEAAWQIAQA GVPVVLHEMR RDLPSSSGAV GPTKVRTDAH 
QTDGLAEMVC SNSFRSDDWQ FNAVGLLHAE MRKLDSLILS AADQHQVPAG GALAVDRDGF
SAEVTRRIEA HPLITIEREE VAGLPPEDWD SVVVATGPLT SPALADAILE LSGEGQLSFF
DAIAPIIHVE SIDMDIAWRQ SRYDKEGPGG DAAAYINCPM NKAQYEAFID ALLEGPKAEF
KDWEHVPYFD GCLPIEVMAE RGRETLRHGP MKPVGLTNPR DPTVKAYAIV QLRQDNALGT
LWNMVGFQTK LKHGAQAEVF RMIPGLQNAQ FARLGGLHRN TFINSPRLLD RSLRMKVAPR
LRFAGQMTGV EGYVESAATG LLAGRFAAAE RLGKTLDAPP PTTALGALVD HVTGGHIEGE
ALGKTSFQPM NINYGLLPPT ETPKVGDDGV KIPMKERGRA KKRLMSLRAL ADLDQWMAGA