Gene Caul_5398 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaul_5398 
Symbol 
ID5897175 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCaulobacter sp. K31 
KingdomBacteria 
Replicon accessionNC_010333 
Strand
Start bp110878 
End bp112494 
Gene Length1617 bp 
Protein Length538 aa 
Translation table11 
GC content63% 
IMG OID641550688 
Producthypothetical protein 
Protein accessionYP_001672174 
Protein GI167621666 
COG category[R] General function prediction only 
COG ID[COG0661] Predicted unusual protein kinase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.61189 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCGCCCC TGACACTCTC GCAGAACGAC TTTCGCCGCC TTGCTACGGT GCTCCGTACG 
TTGGCCCCTT CTGCCGCCCG GCCCGGTCTC AGAAAGCTAG GGTTGTTGCG CGGAGGGGTT
ACCGACGACA CGAGCGACGC GACATATGAG GGCTTTGTTA CGGCGCTTGA GGCGCTCGGG
CCCACATTTG TTAAGCTTGG GCAGATCATG GCGACCCGCT CGGATCTGCT CCCCCCCGAG
TTGACCGAAA GGCTCGGCAG GCTGCACGAT AGCGTCGCTC CGGTTCCCTT CGAACAGATA
CGTGAACAGC TGGCAATCGA TCTTGGCGGA ACGCCCGAAG ATATCTTCGC TGAATTTTTC
CCCACCCCAA TCGCCGCCGC TTCAATAGCG CAGGTCTACG CCGCGCGGCT CACGACTGGT
GAAGAAGTCG TCTTGAAAGT TCGCAGACCC GGTATCGGGG CTATCATGCG CGCGGACCTG
CGGCTGCTCG CGGCCGCAGC GCGTGTCGTC GAGCGGCGCG CGCCGGCGCT TCGTCGCCAT
CAGCCCGTAC GTGTCGTCTC CGAACTGAGT GAAGCTCTCC TGGAGGAGAT CGATTTTCGA
ATCGAGGCGC GTAATCAGTC TGAGGTGCGC AAACGAGCGA CGGTCTTCCA TGTGCCCATA
GTCCATGAGC GGTTTACCAC TGAAAGGACC ATGGTCAGCA CCCGCATCCG CGGTCGCAGC
CCTAACGCAA TGTTTCGCAA CGAGGAGCCT GCGTTGGCGC GTGCGTTGGC GCCCGAAGCG
GCCCGGGGCC TACTGGGCAT GATCCTGTTG GACGGGGTGT TCCACGCTGA CCCTCATCCC
GGAAATGTTC TGGTGACGCC AGACGGTCGT CTGGCGCTAT TGGATTTCGG CTCGGTCGGA
CGCCTCTCGG TCAAGCGGCG AGAGCAGGTG CTAGTGGTCC TCGGCTCCCT GGTGGATGCC
GATGTCGCCG CAGTGTCCGA TGTATTAATG GAATGGGCTG GCCGCTCCGG ACCGCCTCCG
ACCGGCCTTG AGCAAGGTGT CGAGCGCTTC TTTCTGCGCT ATGCAGCCGA CTCCGGGCGG
CCGATCCGGC TTGCGGAGGC TATTGGCGAA TTCCTCTCCA TCGCCCGCGA CAACGCGCTG
ACCTTGCCGC CTGACCTGTT ACTCCTGTTG CGGGCTCTCG GCATCGCGGA GGGACTGGCG
AGGACGCTCG ACCCCGAGCT GGATGTTGTC GACGCGATTG CGCCGGTGGT GGTCAAGGCC
TTCGCGTCAA GGTTTGCGCC CAAAGCCCTA GCCACCCGCG CCTTCCGTGC ATTCAAGGAA
CTCGATCAGA TTCTGGCGAC CGGGCCAGAA GCGATAAGAC GCGGCATCGG CCGCCTCAGC
CGAGACGGCC TCGCCATACG CGTCTCAAGT TCAGAATTAG CGGACTTGCC TCTTGCCGTG
AAGCGGGCCG GCGAAAGCCT TTCTCTCGCG GTAGTCGTCG CGGCGCTCGT CGTCGCCGCC
GCCATCGTAA CGGTCGCGAA CGGCGGCCAA GTCGCCGCGC CCGGACGAAC CGCGATCCTT
GTCGTTTCAG GGGTTGGTTT GGTGGCTTTC GTGGTCCGCT TGCTTAGGCG TGACTGA
 
Protein sequence
MPPLTLSQND FRRLATVLRT LAPSAARPGL RKLGLLRGGV TDDTSDATYE GFVTALEALG 
PTFVKLGQIM ATRSDLLPPE LTERLGRLHD SVAPVPFEQI REQLAIDLGG TPEDIFAEFF
PTPIAAASIA QVYAARLTTG EEVVLKVRRP GIGAIMRADL RLLAAAARVV ERRAPALRRH
QPVRVVSELS EALLEEIDFR IEARNQSEVR KRATVFHVPI VHERFTTERT MVSTRIRGRS
PNAMFRNEEP ALARALAPEA ARGLLGMILL DGVFHADPHP GNVLVTPDGR LALLDFGSVG
RLSVKRREQV LVVLGSLVDA DVAAVSDVLM EWAGRSGPPP TGLEQGVERF FLRYAADSGR
PIRLAEAIGE FLSIARDNAL TLPPDLLLLL RALGIAEGLA RTLDPELDVV DAIAPVVVKA
FASRFAPKAL ATRAFRAFKE LDQILATGPE AIRRGIGRLS RDGLAIRVSS SELADLPLAV
KRAGESLSLA VVVAALVVAA AIVTVANGGQ VAAPGRTAIL VVSGVGLVAF VVRLLRRD