Gene Caul_0009 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaul_0009 
Symbol 
ID5897721 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCaulobacter sp. K31 
KingdomBacteria 
Replicon accessionNC_010338 
Strand
Start bp13433 
End bp16258 
Gene Length2826 bp 
Protein Length941 aa 
Translation table11 
GC content72% 
IMG OID641560492 
ProductPII uridylyl-transferase 
Protein accessionYP_001681645 
Protein GI167643982 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG2844] UTP:GlnB (protein PII) uridylyltransferase 
TIGRFAM ID[TIGR01693] [Protein-PII] uridylyltransferase 


Plasmid Coverage information

Num covering plasmid clones28 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCCCGTC GCCTTCGCCC CACCCCGCTG GAACATGTCG TCGACGGGTA CGCCCTGCGC 
GCGCGCCTGT CGGCCGCCGC CCTCGACCTG ATCGGCGACG AGGCCGCCCA GCGCGCCCGC
GCCATCGAGA TCCTCAAGCA GGCGCTGTTT CGCGGCCGGA TGATCGCCAA GGAGCGGCTG
GAAAACGGGG CCGGCGGGCT GGAGACGGCC CGGCTGCTCA GCGGCGTCAC CGACGAGGTG
ATCACCGCCC TCTACGACTT CACCACCGTC CACGTGTTCC GGGCCCGCAA CCCGACCGAG
GGCGAGCGCC TGTGCCTGCT GGCCGTCGGC GGCTACGGCC GGCGCACCCT GGCGCCGTTC
AGCGACATCG ATCTGCTGTT CCTGCGGCCC TACAAGCAGA CCCCGCACGC CGAGAGCGTG
ATCGAGTTCA TGCTCTATGC GCTGTGGGAC CTGGGCTTCA AGGTCGGCCA CGCCTCGCGC
ACCATCGAGG AGTGCGTGCG GCTCTCCAAG GAAGACTTCA CGATCCGCAC GACGATCCTG
GAGGCCCGCC GGCTGACCGG CGACGAGCGC CTGGCCGCCG AGCTGAAGAG GCGCTTCCAG
GACGATGTGA TGAAGGGCAC CGGCGCCCAG TTCGTGGCCG CCAAGCTGAA GGAGCGCGAC
GACCGCCAGG CCCGGGCCGG GGCCAGCCGC TACATGGTCG AGCCCAACGT CAAGGAGGGC
AAGGGCGGCC TGCGCGACCT GCACACCCTG ATGTGGATCG CCGAATATCT GCACCCGGTC
GACCGGCCCG AGGACGTCTT CCTGCTGGAG GTGTTCGACC GCCGCGAGGC CAAGGGCTTC
ATCCGCGCCT TCGACTTCCT GCACGCGGTG CGCGCCCACC TGCATTTCGC CACCGGCCGG
CCGGAAGAGC GCCTGACCTT CGACCTGCAG CCCGAGATCG CCCGCCGCAT GGGCTATGGC
GACCGCGGCG ACGCCCCGGC GGTCGAGCGC TTCATGCGCC GCTACTTCCT GGTCGCCAAG
GAGGTCGGAG CCCTGACCCG CGCCTTCTCG GCCAAGCTGG AGGCCGAGCA CTTCAAGCAC
GAGCCCAAGG GCATCTCCCG CTTCCTGCCG GGCGGCGGCA AGCCCAAGCG CAAGGCGCTG
GACGTCGCCG GCTTCTACGA GGACGGCGGC CGGCTCAATA TCGACGGCCC CGAGGTGTTC
GAGCGCGATC CGGTCAACCT GATCCGGCTG TTCAAGACCG CCGACGAGCG CGACCTGGAC
CTGCATCCCG ACGCCTTCAC CTCGGTGACC CGCAACCTGC ACCTGATCAC CTCGAAGGTG
CGCCGCGACC CCAACGCCAC CAAGGCCTTC CTCGAGCTGC TGGCCTACGG CAAGCGCTCC
TACCGCACCC TGACCCTGAT GAACGACGCG GGGGTGCTGG GCCGGTTCGT CCCGGAATTT
GGCCGCATCG TCGCCCAGAT GCAGTTCAAC ATGTACCACT CCTACACGGT GGACGAGCAC
ACCCTGCGGG CCGTGGGCGT CATCGGCGAC ATGGCCGCCG GCCGCCTGGT CGACGACCAT
CCGCTGGCCG TCTCGATCCT GCCGCTGATC GAGGACCGCG AGGCCCTGTT CCTGGCCATG
CTGCTGCACG ACACCGGCAA GGGCGGGGTG GGCGGCCAGG AGAAGGCCGG GGCCCGCAGC
GCCCGCAGCG CCTGCGAGCG CCTGGGCGTC GACCGGCTGA AGGTCGAGCT GGTGGCCTGG
CTGGTCGAGA ACCACCTGGT GATGAGCGAC TTCGCCCAGA AGCGCGACGT GGCCGATCCT
GGCACGGTCG CCGCCTTCGC CCGCATCGTC GAGACCCCCG AGCGCCTGCG CCTGCTGCTG
GTGATCACCG TCGCCGATAT CCGCGCCGTT GGGCCGGGCG TCTGGAACGG CTGGAAGGGC
CAGCTTCTCC GAGAGCTTTA CAACGCCACC GAGGCCGTCT TCCGGGGCGG GCGCGGCAGC
GACGCCGCCG CCAGCGTCCA GCGCCATCAG GAAGCCGCCG CCGAGGCCGC GCGCGAGGCC
CTGGTCGAGG CCGATCCCGC CGCCAAGGGC TGGGCCCAGG CCATGGAGGC GGCCTATTTC
GGGGCCTTCT CGCTGCAGGA CCTGCAGGAC CACGCGGCCC TGGCCCGTCG CGCCGCCATC
CAGGGCGGGG CCGCCGCCGA GGGCCGCGTG CCGGTGGGCG CCAACGCCGC CGAGATCGTC
ATCGCCGCCA AGGACCGGCG GGGGCTGTTC GCCGACCTCG CCCTGGCCAT CTCCTCCCTG
GGGGGAAATG TGGTCGGCGC CCGGGTCTTC ACCTCGCGCC AGGGCCAGGC CCTGGACGTC
TTCCATGTGC AGGACGTGAC CGGCGCGGCC CTGGGGTGCG AGAACCCGCG CGTCCTGCGC
CGCCTGGCCG ACGCCCTGGA GGCGGCCGGA CGCGGCGAAC CCCTGGTCAT CGAGCCCCGT
CGCGGCGGCG AACAGTCGCG TACCGCCGCC TTCTCGATCG CCCCGACCGT GGTGATCGAC
AACGAGGCCT CCAACGAGGC CACCGTCGTC GAGGCCTCGG GCCGCGACCG TCCCGGCCTG
CTGCAGGCCC TGGCCCGCAC CCTGGCCGAC AACGGCCTGT CCATCCAGTC GGCCCACATC
GACGGCTACG GCGAGCGGGC GGTCGACGCG TTCTACGTGC AGACGTCCGA GGGCGGGAAG
GTCGCCGACG CCAAGAAGGT CACGGCCCTG AAGGCGGATC TGCTGGCGGC GTTGGAGCAG
AACGAGGCCG GGGCGCCGAA CACGCGGCCG GGGCTGAAGC GGGCGCGGGC GAGCGTGGCG
CGGTAG
 
Protein sequence
MPRRLRPTPL EHVVDGYALR ARLSAAALDL IGDEAAQRAR AIEILKQALF RGRMIAKERL 
ENGAGGLETA RLLSGVTDEV ITALYDFTTV HVFRARNPTE GERLCLLAVG GYGRRTLAPF
SDIDLLFLRP YKQTPHAESV IEFMLYALWD LGFKVGHASR TIEECVRLSK EDFTIRTTIL
EARRLTGDER LAAELKRRFQ DDVMKGTGAQ FVAAKLKERD DRQARAGASR YMVEPNVKEG
KGGLRDLHTL MWIAEYLHPV DRPEDVFLLE VFDRREAKGF IRAFDFLHAV RAHLHFATGR
PEERLTFDLQ PEIARRMGYG DRGDAPAVER FMRRYFLVAK EVGALTRAFS AKLEAEHFKH
EPKGISRFLP GGGKPKRKAL DVAGFYEDGG RLNIDGPEVF ERDPVNLIRL FKTADERDLD
LHPDAFTSVT RNLHLITSKV RRDPNATKAF LELLAYGKRS YRTLTLMNDA GVLGRFVPEF
GRIVAQMQFN MYHSYTVDEH TLRAVGVIGD MAAGRLVDDH PLAVSILPLI EDREALFLAM
LLHDTGKGGV GGQEKAGARS ARSACERLGV DRLKVELVAW LVENHLVMSD FAQKRDVADP
GTVAAFARIV ETPERLRLLL VITVADIRAV GPGVWNGWKG QLLRELYNAT EAVFRGGRGS
DAAASVQRHQ EAAAEAAREA LVEADPAAKG WAQAMEAAYF GAFSLQDLQD HAALARRAAI
QGGAAAEGRV PVGANAAEIV IAAKDRRGLF ADLALAISSL GGNVVGARVF TSRQGQALDV
FHVQDVTGAA LGCENPRVLR RLADALEAAG RGEPLVIEPR RGGEQSRTAA FSIAPTVVID
NEASNEATVV EASGRDRPGL LQALARTLAD NGLSIQSAHI DGYGERAVDA FYVQTSEGGK
VADAKKVTAL KADLLAALEQ NEAGAPNTRP GLKRARASVA R