Gene Caul_3240 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaul_3240 
Symbol 
ID5900695 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCaulobacter sp. K31 
KingdomBacteria 
Replicon accessionNC_010338 
Strand
Start bp3499991 
End bp3501448 
Gene Length1458 bp 
Protein Length485 aa 
Translation table11 
GC content70% 
IMG OID641563745 
Producthypothetical protein 
Protein accessionYP_001684865 
Protein GI167647202 
COG category[I] Lipid transport and metabolism
[R] General function prediction only 
COG ID[COG1545] Predicted nucleic-acid-binding protein containing a Zn-ribbon
[COG3425] 3-hydroxy-3-methylglutaryl CoA synthase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones25 
Plasmid unclonability p-value0.57921 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value0.496079 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCATGG CCACGGGCAT CGTTTCGTTC GGCGCCTATG TCCCGCGCCT TCGTTTGCAG 
CGCGCGGCCA TGGCCCAGGC CACCGCCTGG TTCAATCCGG CCCTGGCGGG GCTTGGTCGC
GGCGAGCGCG CCATCGCCAA CTGGGACGAG GACGCCGTGA CGATGGCGGT CGAGGCGGCG
CGCGACTGCC TTGGCGACCG CGACCGGTCT GACCTTGGTC GGGTGATCCT GGCCTCCACC
ACCTTGCCGT TCGCCGATCG CCAGAACGCC GGAATCGTCA AGGAGGCCCT GGCGCTGGAC
GATGAGGTGG CCGCTCTGGA CGTCACCGGT TCGCAGCGCT CGGGCGCCTC GGCCCTGATC
GCCGCCCTGG AGATGGCGAC CGCCGCGCCC GTTCTCTGCA TGGCCTCCGA TCGCCGGCTG
GCGCGACCGG GCTCGGCCGC GGAATTCCAC AATGGCGACG CCGCGGCCGC GATGCTGGTC
GGCCGCGACG CGGTGATCGC CGAGTTCCTG GGCGCGCACA GCGTCACGGT GGACTTCGTC
GACCACTATC GCGCCGCCGG CCAAGACCAT GACTACGAGT GGGAAACACG CTGGATCCGC
GACGAGGGCT ACGCCAAGCT GATCCCGGCG GCGATCACTG GCGCCCTGCA CAAGCTGGGG
CTCGAGGCCA GCGCGGTCGA TGTCCTGATC ACAGCCGTCC CAGCCGCTGG CGTCGATCGC
CTGGTGGCCG CGGCCGCGGG AGTCAGGCCC GAAGCCGTGT GCGAGCCCCT GCATGATCGG
CTGGGGTTCG CGGGCGCGGC CCAACCGCTC GTCCTGCTGG CCCAAGCCCT GGCGACGGCC
AGGCCCGGCA TGCTGATCCT GGTCGCGGCG TTCGGCCAGG GCGTCGATGT CCTGGCGTTC
CGGACGACGG AGCAGATCAC CAGGCGCAAG GCGTGCCTCG GTGTCGATGG TTGGCTGGCG
CGTCGGCGCC CGGAGTCCAA CTACGTCAAG CACCTGTCGT TTACTGGCGA GGTGGCCCTG
GACGGCGGCA TGCGCGCGGA ACTGGACCTC AAGACGCCGC CGACCATGCT CTATCGTGAC
CGGCGCACCA TCCTGTCGCT GATGGGCGGA CGCTGCCGTG TGACCGGCGC CGTCCAGTAC
CCCAAGACCG ACATATCCGT TTCACCCAAC GCCCGCCTCG TCGGCACCCA GGACGACTAT
CGCCTGGCCG ACCTGCGGGC GCGGGTCGTC ACCTTCACCG CCGACCACCT GGCTTTCAGC
CCCGATCCGC CCGGTTGCTA CGGCATGATC GACTTCGACG GCGGGGGCCG GATGATGGTC
GACATGGTCG ACCTGGACGA GGACGGCCTC AAGGTCGGCG ACCCGGTGCG GATGATGTTC
CGCCTCAAGC GCGACGATGT CCGGGGCTTC AAGCACTACT TCTGGAAGGC CGCGCCGGAC
TACCGGCCGG CGAACTGA
 
Protein sequence
MTMATGIVSF GAYVPRLRLQ RAAMAQATAW FNPALAGLGR GERAIANWDE DAVTMAVEAA 
RDCLGDRDRS DLGRVILAST TLPFADRQNA GIVKEALALD DEVAALDVTG SQRSGASALI
AALEMATAAP VLCMASDRRL ARPGSAAEFH NGDAAAAMLV GRDAVIAEFL GAHSVTVDFV
DHYRAAGQDH DYEWETRWIR DEGYAKLIPA AITGALHKLG LEASAVDVLI TAVPAAGVDR
LVAAAAGVRP EAVCEPLHDR LGFAGAAQPL VLLAQALATA RPGMLILVAA FGQGVDVLAF
RTTEQITRRK ACLGVDGWLA RRRPESNYVK HLSFTGEVAL DGGMRAELDL KTPPTMLYRD
RRTILSLMGG RCRVTGAVQY PKTDISVSPN ARLVGTQDDY RLADLRARVV TFTADHLAFS
PDPPGCYGMI DFDGGGRMMV DMVDLDEDGL KVGDPVRMMF RLKRDDVRGF KHYFWKAAPD
YRPAN