Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Caul_0009 |
Symbol | |
ID | 5897721 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Caulobacter sp. K31 |
Kingdom | Bacteria |
Replicon accession | NC_010338 |
Strand | + |
Start bp | 13433 |
End bp | 16258 |
Gene Length | 2826 bp |
Protein Length | 941 aa |
Translation table | 11 |
GC content | 72% |
IMG OID | 641560492 |
Product | PII uridylyl-transferase |
Protein accession | YP_001681645 |
Protein GI | 167643982 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG2844] UTP:GlnB (protein PII) uridylyltransferase |
TIGRFAM ID | [TIGR01693] [Protein-PII] uridylyltransferase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 28 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 16 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCCCCGTC GCCTTCGCCC CACCCCGCTG GAACATGTCG TCGACGGGTA CGCCCTGCGC GCGCGCCTGT CGGCCGCCGC CCTCGACCTG ATCGGCGACG AGGCCGCCCA GCGCGCCCGC GCCATCGAGA TCCTCAAGCA GGCGCTGTTT CGCGGCCGGA TGATCGCCAA GGAGCGGCTG GAAAACGGGG CCGGCGGGCT GGAGACGGCC CGGCTGCTCA GCGGCGTCAC CGACGAGGTG ATCACCGCCC TCTACGACTT CACCACCGTC CACGTGTTCC GGGCCCGCAA CCCGACCGAG GGCGAGCGCC TGTGCCTGCT GGCCGTCGGC GGCTACGGCC GGCGCACCCT GGCGCCGTTC AGCGACATCG ATCTGCTGTT CCTGCGGCCC TACAAGCAGA CCCCGCACGC CGAGAGCGTG ATCGAGTTCA TGCTCTATGC GCTGTGGGAC CTGGGCTTCA AGGTCGGCCA CGCCTCGCGC ACCATCGAGG AGTGCGTGCG GCTCTCCAAG GAAGACTTCA CGATCCGCAC GACGATCCTG GAGGCCCGCC GGCTGACCGG CGACGAGCGC CTGGCCGCCG AGCTGAAGAG GCGCTTCCAG GACGATGTGA TGAAGGGCAC CGGCGCCCAG TTCGTGGCCG CCAAGCTGAA GGAGCGCGAC GACCGCCAGG CCCGGGCCGG GGCCAGCCGC TACATGGTCG AGCCCAACGT CAAGGAGGGC AAGGGCGGCC TGCGCGACCT GCACACCCTG ATGTGGATCG CCGAATATCT GCACCCGGTC GACCGGCCCG AGGACGTCTT CCTGCTGGAG GTGTTCGACC GCCGCGAGGC CAAGGGCTTC ATCCGCGCCT TCGACTTCCT GCACGCGGTG CGCGCCCACC TGCATTTCGC CACCGGCCGG CCGGAAGAGC GCCTGACCTT CGACCTGCAG CCCGAGATCG CCCGCCGCAT GGGCTATGGC GACCGCGGCG ACGCCCCGGC GGTCGAGCGC TTCATGCGCC GCTACTTCCT GGTCGCCAAG GAGGTCGGAG CCCTGACCCG CGCCTTCTCG GCCAAGCTGG AGGCCGAGCA CTTCAAGCAC GAGCCCAAGG GCATCTCCCG CTTCCTGCCG GGCGGCGGCA AGCCCAAGCG CAAGGCGCTG GACGTCGCCG GCTTCTACGA GGACGGCGGC CGGCTCAATA TCGACGGCCC CGAGGTGTTC GAGCGCGATC CGGTCAACCT GATCCGGCTG TTCAAGACCG CCGACGAGCG CGACCTGGAC CTGCATCCCG ACGCCTTCAC CTCGGTGACC CGCAACCTGC ACCTGATCAC CTCGAAGGTG CGCCGCGACC CCAACGCCAC CAAGGCCTTC CTCGAGCTGC TGGCCTACGG CAAGCGCTCC TACCGCACCC TGACCCTGAT GAACGACGCG GGGGTGCTGG GCCGGTTCGT CCCGGAATTT GGCCGCATCG TCGCCCAGAT GCAGTTCAAC ATGTACCACT CCTACACGGT GGACGAGCAC ACCCTGCGGG CCGTGGGCGT CATCGGCGAC ATGGCCGCCG GCCGCCTGGT CGACGACCAT CCGCTGGCCG TCTCGATCCT GCCGCTGATC GAGGACCGCG AGGCCCTGTT CCTGGCCATG CTGCTGCACG ACACCGGCAA GGGCGGGGTG GGCGGCCAGG AGAAGGCCGG GGCCCGCAGC GCCCGCAGCG CCTGCGAGCG CCTGGGCGTC GACCGGCTGA AGGTCGAGCT GGTGGCCTGG CTGGTCGAGA ACCACCTGGT GATGAGCGAC TTCGCCCAGA AGCGCGACGT GGCCGATCCT GGCACGGTCG CCGCCTTCGC CCGCATCGTC GAGACCCCCG AGCGCCTGCG CCTGCTGCTG GTGATCACCG TCGCCGATAT CCGCGCCGTT GGGCCGGGCG TCTGGAACGG CTGGAAGGGC CAGCTTCTCC GAGAGCTTTA CAACGCCACC GAGGCCGTCT TCCGGGGCGG GCGCGGCAGC GACGCCGCCG CCAGCGTCCA GCGCCATCAG GAAGCCGCCG CCGAGGCCGC GCGCGAGGCC CTGGTCGAGG CCGATCCCGC CGCCAAGGGC TGGGCCCAGG CCATGGAGGC GGCCTATTTC GGGGCCTTCT CGCTGCAGGA CCTGCAGGAC CACGCGGCCC TGGCCCGTCG CGCCGCCATC CAGGGCGGGG CCGCCGCCGA GGGCCGCGTG CCGGTGGGCG CCAACGCCGC CGAGATCGTC ATCGCCGCCA AGGACCGGCG GGGGCTGTTC GCCGACCTCG CCCTGGCCAT CTCCTCCCTG GGGGGAAATG TGGTCGGCGC CCGGGTCTTC ACCTCGCGCC AGGGCCAGGC CCTGGACGTC TTCCATGTGC AGGACGTGAC CGGCGCGGCC CTGGGGTGCG AGAACCCGCG CGTCCTGCGC CGCCTGGCCG ACGCCCTGGA GGCGGCCGGA CGCGGCGAAC CCCTGGTCAT CGAGCCCCGT CGCGGCGGCG AACAGTCGCG TACCGCCGCC TTCTCGATCG CCCCGACCGT GGTGATCGAC AACGAGGCCT CCAACGAGGC CACCGTCGTC GAGGCCTCGG GCCGCGACCG TCCCGGCCTG CTGCAGGCCC TGGCCCGCAC CCTGGCCGAC AACGGCCTGT CCATCCAGTC GGCCCACATC GACGGCTACG GCGAGCGGGC GGTCGACGCG TTCTACGTGC AGACGTCCGA GGGCGGGAAG GTCGCCGACG CCAAGAAGGT CACGGCCCTG AAGGCGGATC TGCTGGCGGC GTTGGAGCAG AACGAGGCCG GGGCGCCGAA CACGCGGCCG GGGCTGAAGC GGGCGCGGGC GAGCGTGGCG CGGTAG
|
Protein sequence | MPRRLRPTPL EHVVDGYALR ARLSAAALDL IGDEAAQRAR AIEILKQALF RGRMIAKERL ENGAGGLETA RLLSGVTDEV ITALYDFTTV HVFRARNPTE GERLCLLAVG GYGRRTLAPF SDIDLLFLRP YKQTPHAESV IEFMLYALWD LGFKVGHASR TIEECVRLSK EDFTIRTTIL EARRLTGDER LAAELKRRFQ DDVMKGTGAQ FVAAKLKERD DRQARAGASR YMVEPNVKEG KGGLRDLHTL MWIAEYLHPV DRPEDVFLLE VFDRREAKGF IRAFDFLHAV RAHLHFATGR PEERLTFDLQ PEIARRMGYG DRGDAPAVER FMRRYFLVAK EVGALTRAFS AKLEAEHFKH EPKGISRFLP GGGKPKRKAL DVAGFYEDGG RLNIDGPEVF ERDPVNLIRL FKTADERDLD LHPDAFTSVT RNLHLITSKV RRDPNATKAF LELLAYGKRS YRTLTLMNDA GVLGRFVPEF GRIVAQMQFN MYHSYTVDEH TLRAVGVIGD MAAGRLVDDH PLAVSILPLI EDREALFLAM LLHDTGKGGV GGQEKAGARS ARSACERLGV DRLKVELVAW LVENHLVMSD FAQKRDVADP GTVAAFARIV ETPERLRLLL VITVADIRAV GPGVWNGWKG QLLRELYNAT EAVFRGGRGS DAAASVQRHQ EAAAEAAREA LVEADPAAKG WAQAMEAAYF GAFSLQDLQD HAALARRAAI QGGAAAEGRV PVGANAAEIV IAAKDRRGLF ADLALAISSL GGNVVGARVF TSRQGQALDV FHVQDVTGAA LGCENPRVLR RLADALEAAG RGEPLVIEPR RGGEQSRTAA FSIAPTVVID NEASNEATVV EASGRDRPGL LQALARTLAD NGLSIQSAHI DGYGERAVDA FYVQTSEGGK VADAKKVTAL KADLLAALEQ NEAGAPNTRP GLKRARASVA R
|
| |