Gene Caul_2917 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaul_2917 
Symbol 
ID5900372 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCaulobacter sp. K31 
KingdomBacteria 
Replicon accessionNC_010338 
Strand
Start bp3162211 
End bp3163698 
Gene Length1488 bp 
Protein Length495 aa 
Translation table11 
GC content69% 
IMG OID641563414 
Producthypothetical protein 
Protein accessionYP_001684542 
Protein GI167646879 
COG category[I] Lipid transport and metabolism
[R] General function prediction only 
COG ID[COG1545] Predicted nucleic-acid-binding protein containing a Zn-ribbon
[COG3425] 3-hydroxy-3-methylglutaryl CoA synthase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value0.381583 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0360554 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGTCGGCA TCGCCGCCTG GGGCGCCTAT GCGCCGCGCC TTCGCCTCAG CCGCAAGGCG 
GTCACCCAGG CCAATGCCTG GGTCGCGCCG AACCTGCGGG CCAAGGCCAA GGGCGAGCGC
TCCATGGCCA ACTGGGACGA GGACGCTCTG ACCATGGCGG TCGAGGCGGC CCGCGACGCC
TTGGGGCCAG GCGACGACCG CTCCGCCATC GACGCCCTCT ATTTCGCCTC CACCACGGCG
CCGTTCACCG ATCGCCAGAA CGCGGGAATC GTCGCTGGCG CCCTGACGCT GGAGAAGGCC
ATCGCCTCGG CCGACATCAC CGGTTCGCAG CGTTGCGGCC TGGCCGCCCT GGGCCAGGCG
CTAGTGGCGG TGCAAGGCGG AGCGGCCAAG CGCGCCCTGG TGGCGACCGG CGAGCACCGC
CGGGCGCGAG CGGCGTCCGC TCAGGAGCTC GACAATGGCG ACGGCGCGGC GGCCTTCGTG
GTCACGGCCG AGGCCGGGGC GGCCGAATTC CTCGGCCGGG GCAGCGTCAC CGACGACTTC
GTCGATCACT TCCGTGGGGC CGACAGCGAC TTCGACTATC AATGGGAGGA GCGCTGGATC
CGCGACGAGG GCATCGTCAA GCTGGTCCCT CCGGCGATCC GGCAAGCGCT GGAAGTTTCT
GGCCTAAAGG CTTCGGACGT CACGCATTTC TGTTTCCCCT CCACCTTCTC CGGGATGGCC
GCCAGCCTGG CCAAGACGAT CGGGATCGCC CCCGAGGCGG TGCGCGACAA CCTGGCCTTG
ACGCTGGGCG AAGCCGGCTG CGCGCACGGC CCGCTGATGC TGGCCCACGC CCTGGAGCAA
GCCAGCCCCG GCGACGTCAT CCTGGTCGCC CAGTTCGGCC AGGGCGCCGA GGCGCTGGTG
TTCCGGGTGA CCGAAGCGGC GGCCAGCACC CGCCCGGCGC GCGGCGTCAC CGGCGCTCTG
GCTGATCGAA AAGACGAAGA CAACTATCTG AAGTTTCTCA CTTTCAATGG CCTGGTCGAA
TGGGACAAGG GCATGCGGGC CGAGAAGGAC AACAAGACGG CCCTGACCAC CCTCTATCGC
AACCAGGACA TGATCCTGGG CCTGGTCGGC GGTCGCTGCC GCGAGACCGG CGTCGTGCAG
TTCCCCCGCA CGCGCATCTC GGTCGCACCC AACAATCCGG GCGTCGACAC CCAGGAACCA
TACCGCTTCG CCGAGCGAAA GGCCTCGGTG CTGAGCTATT CGGCCGACTT CCTGACCTTC
TCGATGAGCC CGCCCAACCA CTACGGCATG ATCGTCTTCG AGGGCGGCGG GCGGATCATG
ATGGACATCA CCGACGTCGA GCAAGGCGAG GTCGACAGCG GCCTGCCGGT CAAGATGGTG
TTCCGCATCA AGGACGTCGA TGAGAAGCGC GGCTTCGTCC GCTACTTCTG GAAGGCCGCG
CCGGACCGGG TGGCGATGGC CGCCAAGACC GCCCTGGCGG CGGAATAG
 
Protein sequence
MVGIAAWGAY APRLRLSRKA VTQANAWVAP NLRAKAKGER SMANWDEDAL TMAVEAARDA 
LGPGDDRSAI DALYFASTTA PFTDRQNAGI VAGALTLEKA IASADITGSQ RCGLAALGQA
LVAVQGGAAK RALVATGEHR RARAASAQEL DNGDGAAAFV VTAEAGAAEF LGRGSVTDDF
VDHFRGADSD FDYQWEERWI RDEGIVKLVP PAIRQALEVS GLKASDVTHF CFPSTFSGMA
ASLAKTIGIA PEAVRDNLAL TLGEAGCAHG PLMLAHALEQ ASPGDVILVA QFGQGAEALV
FRVTEAAAST RPARGVTGAL ADRKDEDNYL KFLTFNGLVE WDKGMRAEKD NKTALTTLYR
NQDMILGLVG GRCRETGVVQ FPRTRISVAP NNPGVDTQEP YRFAERKASV LSYSADFLTF
SMSPPNHYGM IVFEGGGRIM MDITDVEQGE VDSGLPVKMV FRIKDVDEKR GFVRYFWKAA
PDRVAMAAKT ALAAE