Gene Caul_0041 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaul_0041 
Symbol 
ID5897753 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCaulobacter sp. K31 
KingdomBacteria 
Replicon accessionNC_010338 
Strand
Start bp48198 
End bp50348 
Gene Length2151 bp 
Protein Length716 aa 
Translation table11 
GC content64% 
IMG OID641560524 
Productpolynucleotide phosphorylase/polyadenylase 
Protein accessionYP_001681677 
Protein GI167644014 
COG category[J] Translation, ribosomal structure and biogenesis 
COG ID[COG1185] Polyribonucleotide nucleotidyltransferase (polynucleotide phosphorylase) 
TIGRFAM ID[TIGR03591] polyribonucleotide nucleotidyltransferase 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value0.382985 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value0.134416 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTTCGAAA TCATGCGCAA GACGATCGAG TGGGGCGGCA AGACGCTGGT CCTCGAAACC 
GGTCGCATCG CCCGTCAGGC CGACGGCGCC GTTCTGGCCA CCATGGGCGA AACCGTCGTC
CTGGCCACCG CCGTGTTCGC CAAGAAGGCC AAGCCGGGTC AGGACTTCTT CCCCCTGACC
GTCAACTACA TCGAGAAGAC CTACGCCGCG GGCAAGATCC CGGGCGGCTT CTTCAAGCGC
GAAGGTCGTC CGTCGGAAAA GGAAACCCTG GTTTCCCGCC TGATCGACCG CCCGATCCGC
CCGCTGTTCG TCAAGGGCTT CAAGAACGAA GTCCAGGTCG TCTGCACCGT GCTGCAGCAC
GACCTTGAGA ACGATCCCGA CATCGTCGCC ATGTGCGCCG CCTCGGCCGC CCTGGTCATT
TCCGGCGCCC CGTTCATGGG CCCGATCGGC GGCTGCCGCG TGGGTTACGT CAACGACGAG
TACATCCTGA ACCCGACGCT CGACGAGCTG AAGGAAAGCA AGATGGACCT GGTCGTGGCC
GGCACCGCCG ACGCCGTGAT GATGGTCGAA TCCGAAATCC AGGAACTCTC GGAAGAGATC
GTCCTGGGCG GCGTCAGCTT CGCCCACAAG TCGATGCAGG CCGTGATCAA CGGCATCATC
GAGCTGGCCG AACACGCCGC CAAGGAGCCC TTCGACTTCC AGCCGGAAGA CACCGACGCC
CTGAAGGCCG AAGTGAAGAA GGCCGTCGGC GCCGACCTGG CCGACGCCTA CACCATCCGC
GCCAAGGGCG ACCGTCACGC CGCCCTGTCA GCCGCCAAGT CCAAGGCCGT CGACACCTTC
GCCAAGAGCG ATGCGAACCC GGCCGGCATC GATCCGCTGA AGCTGATCTC GGTGTTCAAG
GAACTGGAAG CCGACATCGT TCGTCGCTCC ATCCTCGACA CCGGCATCCG CATCGACGGC
CGCACCGTCG ACACCGTGCG CCCGATCCTG GGTGAAGTCG GCATCCTGCC GCGCACCCAC
GGCTCGGCCC TGTTCACCCG CGGCGAAACC CAAGCCATCG TCGTGGCCAC CCTGGGCACC
GGCGACGACG AGCAGTTCAT CGACGCCCTG GAAGGCACCT ACAAGGAAGC CTTCCTGCTG
CACTACAACT TCCCTCCCTT CTCGGTCGGC GAGACCGGTC GGATGGGCAG CCCCGGCCGC
CGCGAAATCG GCCACGGCAA GCTGGCCTGG CGCGCCCTGC GCCCGATGCT GCCGGCCAAG
GAAGACTTCC CCTACACCAT CCGCCTGGTC TCCGAGATCA CCGAGTCGAA CGGCTCGTCC
TCGATGGCCA CGGTCTGCGG CGCCTCGCTG GCTATGATGG ACGCGGGCGT TCCGCTGATC
CGTCCGGTCT CGGGTATCGC CATGGGCCTG ATCCTCGAAA AGGACGGCTT CGCCGTGCTG
TCCGACATCC TGGGTGACGA AGATCACCTG GGCGACATGG ACTTCAAGGT GGCCGGCACC
AGCCAGGGTC TGACCTCGCT GCAGATGGAC ATCAAGATCG CCGGCATCAC CGAAGAGATC
ATGAAGCAAG CGCTGGCCCA GGCTAAGGGC GGTCGCGAGC ACATCCTCGG CGAGATGAAC
AAGGCGATGG ATGCGCCGCG CGAAGAAGTC GGCGACTACG CGCCGAAGAT CGAAACCATC
ACCATCCCGA CCGACAAGAT CCGGGAAGTG ATCGGCACCG GCGGCAAGGT GATCCGCGAG
ATCGTCGCCA CCACCGGCGC CAAGGTCGAC ATCAACGACG AAGGCACGGT CAAGGTCTCG
GCCTCGGACG GCGCCAAGAT CAAGGCCGCG ATCGACTGGA TCAAGTCGAT CACGCAAGAA
GCTGAAGTCG GCGCGATCTA CGACGGCAAG GTCGTGAAGG TCGTCGATTT CGGCGCCTTC
GTGAACTTCT TCGGCGCCAA GGACGGCCTG GTCCACGTCA GCCAGATCAG CAACGAACGG
GTCGCCAAGC CCTCGGACGT GCTGAAGGAA GGCCAGATCG TCAAAGTGAA GCTTCTCGGC
TTCGACGATC GCGGCAAGAC CAAGCTGTCG ATGAAGGTCG TCGACCAGGA AACCGGCGAA
GACCTGTCCA AGAAGGAAGC CGTGAGCCCG GAGGAAGCCG TCAACACCTA A
 
Protein sequence
MFEIMRKTIE WGGKTLVLET GRIARQADGA VLATMGETVV LATAVFAKKA KPGQDFFPLT 
VNYIEKTYAA GKIPGGFFKR EGRPSEKETL VSRLIDRPIR PLFVKGFKNE VQVVCTVLQH
DLENDPDIVA MCAASAALVI SGAPFMGPIG GCRVGYVNDE YILNPTLDEL KESKMDLVVA
GTADAVMMVE SEIQELSEEI VLGGVSFAHK SMQAVINGII ELAEHAAKEP FDFQPEDTDA
LKAEVKKAVG ADLADAYTIR AKGDRHAALS AAKSKAVDTF AKSDANPAGI DPLKLISVFK
ELEADIVRRS ILDTGIRIDG RTVDTVRPIL GEVGILPRTH GSALFTRGET QAIVVATLGT
GDDEQFIDAL EGTYKEAFLL HYNFPPFSVG ETGRMGSPGR REIGHGKLAW RALRPMLPAK
EDFPYTIRLV SEITESNGSS SMATVCGASL AMMDAGVPLI RPVSGIAMGL ILEKDGFAVL
SDILGDEDHL GDMDFKVAGT SQGLTSLQMD IKIAGITEEI MKQALAQAKG GREHILGEMN
KAMDAPREEV GDYAPKIETI TIPTDKIREV IGTGGKVIRE IVATTGAKVD INDEGTVKVS
ASDGAKIKAA IDWIKSITQE AEVGAIYDGK VVKVVDFGAF VNFFGAKDGL VHVSQISNER
VAKPSDVLKE GQIVKVKLLG FDDRGKTKLS MKVVDQETGE DLSKKEAVSP EEAVNT