Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Caul_0041 |
Symbol | |
ID | 5897753 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Caulobacter sp. K31 |
Kingdom | Bacteria |
Replicon accession | NC_010338 |
Strand | + |
Start bp | 48198 |
End bp | 50348 |
Gene Length | 2151 bp |
Protein Length | 716 aa |
Translation table | 11 |
GC content | 64% |
IMG OID | 641560524 |
Product | polynucleotide phosphorylase/polyadenylase |
Protein accession | YP_001681677 |
Protein GI | 167644014 |
COG category | [J] Translation, ribosomal structure and biogenesis |
COG ID | [COG1185] Polyribonucleotide nucleotidyltransferase (polynucleotide phosphorylase) |
TIGRFAM ID | [TIGR03591] polyribonucleotide nucleotidyltransferase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 0.382985 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 11 |
Fosmid unclonability p-value | 0.134416 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTTCGAAA TCATGCGCAA GACGATCGAG TGGGGCGGCA AGACGCTGGT CCTCGAAACC GGTCGCATCG CCCGTCAGGC CGACGGCGCC GTTCTGGCCA CCATGGGCGA AACCGTCGTC CTGGCCACCG CCGTGTTCGC CAAGAAGGCC AAGCCGGGTC AGGACTTCTT CCCCCTGACC GTCAACTACA TCGAGAAGAC CTACGCCGCG GGCAAGATCC CGGGCGGCTT CTTCAAGCGC GAAGGTCGTC CGTCGGAAAA GGAAACCCTG GTTTCCCGCC TGATCGACCG CCCGATCCGC CCGCTGTTCG TCAAGGGCTT CAAGAACGAA GTCCAGGTCG TCTGCACCGT GCTGCAGCAC GACCTTGAGA ACGATCCCGA CATCGTCGCC ATGTGCGCCG CCTCGGCCGC CCTGGTCATT TCCGGCGCCC CGTTCATGGG CCCGATCGGC GGCTGCCGCG TGGGTTACGT CAACGACGAG TACATCCTGA ACCCGACGCT CGACGAGCTG AAGGAAAGCA AGATGGACCT GGTCGTGGCC GGCACCGCCG ACGCCGTGAT GATGGTCGAA TCCGAAATCC AGGAACTCTC GGAAGAGATC GTCCTGGGCG GCGTCAGCTT CGCCCACAAG TCGATGCAGG CCGTGATCAA CGGCATCATC GAGCTGGCCG AACACGCCGC CAAGGAGCCC TTCGACTTCC AGCCGGAAGA CACCGACGCC CTGAAGGCCG AAGTGAAGAA GGCCGTCGGC GCCGACCTGG CCGACGCCTA CACCATCCGC GCCAAGGGCG ACCGTCACGC CGCCCTGTCA GCCGCCAAGT CCAAGGCCGT CGACACCTTC GCCAAGAGCG ATGCGAACCC GGCCGGCATC GATCCGCTGA AGCTGATCTC GGTGTTCAAG GAACTGGAAG CCGACATCGT TCGTCGCTCC ATCCTCGACA CCGGCATCCG CATCGACGGC CGCACCGTCG ACACCGTGCG CCCGATCCTG GGTGAAGTCG GCATCCTGCC GCGCACCCAC GGCTCGGCCC TGTTCACCCG CGGCGAAACC CAAGCCATCG TCGTGGCCAC CCTGGGCACC GGCGACGACG AGCAGTTCAT CGACGCCCTG GAAGGCACCT ACAAGGAAGC CTTCCTGCTG CACTACAACT TCCCTCCCTT CTCGGTCGGC GAGACCGGTC GGATGGGCAG CCCCGGCCGC CGCGAAATCG GCCACGGCAA GCTGGCCTGG CGCGCCCTGC GCCCGATGCT GCCGGCCAAG GAAGACTTCC CCTACACCAT CCGCCTGGTC TCCGAGATCA CCGAGTCGAA CGGCTCGTCC TCGATGGCCA CGGTCTGCGG CGCCTCGCTG GCTATGATGG ACGCGGGCGT TCCGCTGATC CGTCCGGTCT CGGGTATCGC CATGGGCCTG ATCCTCGAAA AGGACGGCTT CGCCGTGCTG TCCGACATCC TGGGTGACGA AGATCACCTG GGCGACATGG ACTTCAAGGT GGCCGGCACC AGCCAGGGTC TGACCTCGCT GCAGATGGAC ATCAAGATCG CCGGCATCAC CGAAGAGATC ATGAAGCAAG CGCTGGCCCA GGCTAAGGGC GGTCGCGAGC ACATCCTCGG CGAGATGAAC AAGGCGATGG ATGCGCCGCG CGAAGAAGTC GGCGACTACG CGCCGAAGAT CGAAACCATC ACCATCCCGA CCGACAAGAT CCGGGAAGTG ATCGGCACCG GCGGCAAGGT GATCCGCGAG ATCGTCGCCA CCACCGGCGC CAAGGTCGAC ATCAACGACG AAGGCACGGT CAAGGTCTCG GCCTCGGACG GCGCCAAGAT CAAGGCCGCG ATCGACTGGA TCAAGTCGAT CACGCAAGAA GCTGAAGTCG GCGCGATCTA CGACGGCAAG GTCGTGAAGG TCGTCGATTT CGGCGCCTTC GTGAACTTCT TCGGCGCCAA GGACGGCCTG GTCCACGTCA GCCAGATCAG CAACGAACGG GTCGCCAAGC CCTCGGACGT GCTGAAGGAA GGCCAGATCG TCAAAGTGAA GCTTCTCGGC TTCGACGATC GCGGCAAGAC CAAGCTGTCG ATGAAGGTCG TCGACCAGGA AACCGGCGAA GACCTGTCCA AGAAGGAAGC CGTGAGCCCG GAGGAAGCCG TCAACACCTA A
|
Protein sequence | MFEIMRKTIE WGGKTLVLET GRIARQADGA VLATMGETVV LATAVFAKKA KPGQDFFPLT VNYIEKTYAA GKIPGGFFKR EGRPSEKETL VSRLIDRPIR PLFVKGFKNE VQVVCTVLQH DLENDPDIVA MCAASAALVI SGAPFMGPIG GCRVGYVNDE YILNPTLDEL KESKMDLVVA GTADAVMMVE SEIQELSEEI VLGGVSFAHK SMQAVINGII ELAEHAAKEP FDFQPEDTDA LKAEVKKAVG ADLADAYTIR AKGDRHAALS AAKSKAVDTF AKSDANPAGI DPLKLISVFK ELEADIVRRS ILDTGIRIDG RTVDTVRPIL GEVGILPRTH GSALFTRGET QAIVVATLGT GDDEQFIDAL EGTYKEAFLL HYNFPPFSVG ETGRMGSPGR REIGHGKLAW RALRPMLPAK EDFPYTIRLV SEITESNGSS SMATVCGASL AMMDAGVPLI RPVSGIAMGL ILEKDGFAVL SDILGDEDHL GDMDFKVAGT SQGLTSLQMD IKIAGITEEI MKQALAQAKG GREHILGEMN KAMDAPREEV GDYAPKIETI TIPTDKIREV IGTGGKVIRE IVATTGAKVD INDEGTVKVS ASDGAKIKAA IDWIKSITQE AEVGAIYDGK VVKVVDFGAF VNFFGAKDGL VHVSQISNER VAKPSDVLKE GQIVKVKLLG FDDRGKTKLS MKVVDQETGE DLSKKEAVSP EEAVNT
|
| |