Gene Caul_2184 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaul_2184 
Symbol 
ID5899639 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCaulobacter sp. K31 
KingdomBacteria 
Replicon accessionNC_010338 
Strand
Start bp2374745 
End bp2377075 
Gene Length2331 bp 
Protein Length776 aa 
Translation table11 
GC content71% 
IMG OID641562675 
ProductDEAD/DEAH box helicase domain-containing protein 
Protein accessionYP_001683810 
Protein GI167646147 
COG category[J] Translation, ribosomal structure and biogenesis
[K] Transcription
[L] Replication, recombination and repair 
COG ID[COG0513] Superfamily II DNA and RNA helicases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value0.383881 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCCTTCC CCGCCTCCCA CCCCGCTCTT GAGCGGGCTC TTGCCGCTCA AGGCTATCTC 
GAACCCACCC CCGTCCAAGC CGCCGTGCTG GAAGACGAGG CCATCGGCCG CGACCTGCTG
GTCAGCGCCC AAACCGGCTC GGGCAAGACC GTCGCCTTCG GCCTGGCCGC CGCCACGACC
CTGCTCGGCG ACGCCGAGAA GTTCGCCAAG GCCGGCCTGC CGATGTGCCT GGTGATCGCC
CCGACCCGCG AGTTGGCCAT CCAGGTCAAT CGCGAACTGG CCTGGCTCTA TGCCGACGCC
GGCGCCGTGG TCGTCAACTG CGTGGGGGGC ATGGACGCCC GTCGCGAGCA GCGCGCCCTG
AACTTCGGCG CCCATATCGT CGTCGGCACC CCCGGCCGCC TGCGCGACCA CATCGAGCGC
GGCCACCTGG ACCTGTCCGA GCTGAAGGTC GCCGTCCTCG ACGAGGCCGA CGAGATGCTC
GACATGGGCT TCCGCGAAGA CCTGGAATTC ATCCTCGACG CCGCGCCGAG CGAACGCCGC
ACCCTGCTGT TCTCGGCCAC CCTGGCGCGC GAGATCGTGC AACTGGCCAA GAGCTACCAG
AACGACGCCC TGCGCATCGA CACGGTCGGC CGCAACGAAC CGCACCGCGA CATCGAATAC
CGCGCCGTCC GCGTCGCCCC CAACGAGGTC GAGCACGCGG TCGTCAACCT GCTGCGCTAT
TTCGAAAGCC CCGGCGCCCT GGTGTTCTGC AACACCCGTG AAAGCGTCCG CGCCCTGCAC
AGCAAGCTGC GCGAGCGCGG CTTCGCCGTG GTCGGCCTGT CGGGCGAACT GAGCCAGCGT
GAACGCGCCG ACGCCCTGCA GGCGCTGCGC GACGGCCACG CCCGCGTCTG CGTCGCCACC
GACGTCGCCG CGCGCGGTCT CGACCTGCCG GATCTCGGCC TGGTCATCCA CGCCGAACTG
CCGGTCAACA AGGCGACGCT TCTGCACCGT TCGGGCCGCA CCGGCCGGGC GGGCAAGAAG
GGCGTCAGCG CCCTGGTCGT CCCCTACACC CGTCGCCGCA AGGCCGAGCA GCTGCTGATG
GCCGCCGGCG TCGAAGGCCA CTGGGGCGGC GCCCCGACCG CCGACGAGAT CCGCCTCAAG
GACACCGAGC GCCTGCTCGA CGATCCGATC TTCGACGAGG CCGTAGTCGA GGAAGACACC
GCCCTGGCCG AAGCGATGCT GGCCAAGCGC ACGCCGCTGG AGATCGCCGC CGCCCTGATC
CGCACCCGCC GCGCCAAGCT GCCGGCGCCG GAAGAGATCT ATGACGACCC GCGCCACGGC
GCCGATCCGG GTCCGCGTTC GCGCGACGCC GGCCCGCGCT CGAACGACGC CGGCGACCGC
TTCGCCGACC GGGGTCCGCG CGACTTTGGC GGCGACCGTC CCGAACGCGA ACCGCGGTCC
GACATGGCCA ACTCGGCCTG GTTCCGCCTC AACACCGGCC GTCGCAACAA CGCCGACCCC
AAGTGGCTGA TCCCGCTGAT CTGCCGCCTG GGCCACATCA CCAAGAAGGA CATCGGCTCG
ATCCGGATCT TCGACTACGA CACCAAGTTC GAGATCTCGG CCGAGGCGGA AGTGAAGTTC
GGGGCCGCCG TCCAGGCCAC CGTGCGCGAC GACGTGACCA TCACCCCGAC CACGGCCCCG
GCCGCCCGGG ACGCCTCGGG TCCGCGCAAG GACTACGCGC CCCGTCCGCC GCGCGACGAC
AACGACGCGC CGCGCCGCGA ATATGCGCCA CGGCCGCCCC GCGACGACGC CGGTCCGCGT
GGCCCGCGCG CCAATGCCCC CGGCTCGCGC GACAGCAGCC CGCGTCCGTC GTCCTACAGC
GCCGACGGCG TCAGCAACGC CAAGCCCTAC GCCCGCGGCG AAGAACCCAA GCGTGACTAC
AAGCCGCGTG ACGATGGTCC CAAGCGTGAC TTCAAGCCGC GTCCGGCCGC TCGTGACGAC
GCTCCGCGCG ACTTCAAGCC CCGCGCCCCG CGCGACGAAG CCCCCAAGGC CTATTCGCCG
CTGAACGAAG GCCCGGCCCC ACGCCGCGCC CCGCGTGAAG CGCGCGAGAG CGTCCCCTAC
GATCCGGAAG CCAAGCCGGC CAAGGCGCCT TACGTCGCCC GTCCCCGCGA CGGCGCGCCG
GGCGGCAAGC CCTATGCCGG CAAGAGCAAT GATGGTCCGG CCAAGCCGTT CAAGGGCCCC
AAGCCCTTCA AGGCCAAGGG CGCGTTTGGC GACAAGCCGG CGTTCGGCGG TCCCAAGCCC
GCCTTCGGCA AGAAGCCGGC CGGCGGCAAG GCGCCGTTCA AGAAGAAGTA G
 
Protein sequence
MPFPASHPAL ERALAAQGYL EPTPVQAAVL EDEAIGRDLL VSAQTGSGKT VAFGLAAATT 
LLGDAEKFAK AGLPMCLVIA PTRELAIQVN RELAWLYADA GAVVVNCVGG MDARREQRAL
NFGAHIVVGT PGRLRDHIER GHLDLSELKV AVLDEADEML DMGFREDLEF ILDAAPSERR
TLLFSATLAR EIVQLAKSYQ NDALRIDTVG RNEPHRDIEY RAVRVAPNEV EHAVVNLLRY
FESPGALVFC NTRESVRALH SKLRERGFAV VGLSGELSQR ERADALQALR DGHARVCVAT
DVAARGLDLP DLGLVIHAEL PVNKATLLHR SGRTGRAGKK GVSALVVPYT RRRKAEQLLM
AAGVEGHWGG APTADEIRLK DTERLLDDPI FDEAVVEEDT ALAEAMLAKR TPLEIAAALI
RTRRAKLPAP EEIYDDPRHG ADPGPRSRDA GPRSNDAGDR FADRGPRDFG GDRPEREPRS
DMANSAWFRL NTGRRNNADP KWLIPLICRL GHITKKDIGS IRIFDYDTKF EISAEAEVKF
GAAVQATVRD DVTITPTTAP AARDASGPRK DYAPRPPRDD NDAPRREYAP RPPRDDAGPR
GPRANAPGSR DSSPRPSSYS ADGVSNAKPY ARGEEPKRDY KPRDDGPKRD FKPRPAARDD
APRDFKPRAP RDEAPKAYSP LNEGPAPRRA PREARESVPY DPEAKPAKAP YVARPRDGAP
GGKPYAGKSN DGPAKPFKGP KPFKAKGAFG DKPAFGGPKP AFGKKPAGGK APFKKK