Gene Caul_4850 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaul_4850 
Symbol 
ID5902312 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCaulobacter sp. K31 
KingdomBacteria 
Replicon accessionNC_010338 
Strand
Start bp5245466 
End bp5246557 
Gene Length1092 bp 
Protein Length363 aa 
Translation table11 
GC content72% 
IMG OID641565370 
Productaminoglycoside phosphotransferase 
Protein accessionYP_001686468 
Protein GI167648805 
COG category[R] General function prediction only 
COG ID[COG3178] Predicted phosphotransferase related to Ser/Thr protein kinases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value0.191997 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TTGAGCTCTG ATCGCGAAAC CCTCAAGACG GCCTTTCTGA CGGCCAACGG CTTCGGCGAC 
GCCCGCCGCG AAGCCCTGAG CGGCGACGCC TCGACCCGGA GCTACGAGCG CCTATATCGC
GGCGACGAAC GCTTCATCTT CATGGACCAG CCGCCGGCCC TGGAGAGCGT GGTCTGTCCG
CCGGGCGCCA GCGACGCCGA GCGCCTGGCC CTGGGCTACA ACGCCGCCGC CCGCCTGGCC
GCCGGCTCGG TCGCCGCCTT CGTGGCCACG GCCGCCTATC TGCGCGGGCG CGGCCTGTCG
GCCCCAGCCA TCCTGGCCCG TGACATCGCG GCGGGCCTGG CGGTGCTGGA AGACCTGGGC
GACGGCCTCT ACGCCACGCT GATCGCCGAC GGCCAGGACG AGACCCCGCT CTACGAGGCC
GCCGTCGACG TCCAGGTGGC CCTGCACGGC GAGACCCCGC CGGACGTCCT CACCGCCGAA
GGCGGCGTGG CCTGGCCGCT GCTGACCTAT GATGCGCTGG CCCTGAAGAT CGCCACCGAC
ACCTTCCTGG AGTTCTGGCC GAAGTTCTCG GGCCTGGCGC CATTCAGCGA CGCCGCCGTG
GCCGACTGGG ACGCCCTGTG GGCGCCGGTC TGGGTGCGCG GCGAGGCCGG CGCCAGCGTC
TTCACCCACC GCGACTATCA CGCCCAGAAC CTGCTGTGGC TGCCCGAGCG CGACGGCGTG
GCCCGCGTGG GCCTGCTGGA CTTCCAGGAT GCCCTGCGCG CCCACCCGGC CTGGGATCTG
ACCCACCTGC TGCAGGACGC CCGCCGCGAC GTCTCGCCGG AGTTGGAACA GGCCATGCTC
GACCGCTACC TGACCGCACG GCCCTTGATG GACCGCGAAG CCTTCATCGC CGACTACCGC
GCCCTGGCCG CCTCCAACGC CGCGCGGATC CTAGGCCGGG TGTTCGCCCG CCAGGCCCTG
CTGGGTCGGC CGCAGTACGA GGCCTACATG CCGCGCACCT GGCGCTATCT GGAGCGCAAT
CTCCAGGACC CGGCGATGGC GGGGCTGAAG GCCTGGTTCG ACCGGTACGT GCCGTCGGCG
TTCCGCCGAT GA
 
Protein sequence
MSSDRETLKT AFLTANGFGD ARREALSGDA STRSYERLYR GDERFIFMDQ PPALESVVCP 
PGASDAERLA LGYNAAARLA AGSVAAFVAT AAYLRGRGLS APAILARDIA AGLAVLEDLG
DGLYATLIAD GQDETPLYEA AVDVQVALHG ETPPDVLTAE GGVAWPLLTY DALALKIATD
TFLEFWPKFS GLAPFSDAAV ADWDALWAPV WVRGEAGASV FTHRDYHAQN LLWLPERDGV
ARVGLLDFQD ALRAHPAWDL THLLQDARRD VSPELEQAML DRYLTARPLM DREAFIADYR
ALAASNAARI LGRVFARQAL LGRPQYEAYM PRTWRYLERN LQDPAMAGLK AWFDRYVPSA
FRR