Gene Caul_2992 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaul_2992 
Symbol 
ID5900447 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCaulobacter sp. K31 
KingdomBacteria 
Replicon accessionNC_010338 
Strand
Start bp3251084 
End bp3252988 
Gene Length1905 bp 
Protein Length634 aa 
Translation table11 
GC content67% 
IMG OID641563489 
Productsulfotransferase 
Protein accessionYP_001684617 
Protein GI167646954 
COG category[R] General function prediction only 
COG ID[COG4783] Putative Zn-dependent protease, contains TPR repeats 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value0.49268 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones23 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGAACATCG GCCGCGACGC TTTATTGCTA GAGGCGAGAG ACCTGCGCTC GCGCCAACGC 
CTGCCAGACG CCTTGGCCGC CCTCGCGCGG CTGGAGACGC TGCATCCCCG GTTCAGCCGT
CTGCATCAGG AGCGCGGCCA CTGTCATGTC CTGCTCGGGG ATGCGCCGGC GGCCATCGAC
GCGCTGCGCG AGGCGGTGCG TCTCAACCCG ACGCTACCGG CGAGCTGGGA CATGCTGGAG
CAGCTTTACC GCATGCAGGG CGCCACCGCC CAGGCGGTCA ATGCGGCGCG GCATCTGGCC
ACGCTGAGAC AGCTGCCGTC GGCCGTGGTG GCGGCCAACG GCCTGTTCGC CGATGGCGAC
CTGTCGCCGG CGGAGGAGAT CCTTCGGGAC TACCTGAGCC GGGATGGCGA CAATGTGGGC
GCCCTGCGTC TGCTGGCGCG GATCCGCATG CAGAGCGACG CGCTCGACGA GGCCGAGGCG
TTGCTGAAAT CCGTGATCGA GCGCGCGCCG GACTACCACG CCGCGCGCCT GGACTACGCC
ATGGTGCTCT TGCAGCGGCA AAAGCCTCTG GAGGCGCGGC GGGAGGCCGA ACATTTGCTC
GCGCACGACC CGGACAACCG CGACTACCTC AAGCAGTACG GCGCGGCCTG CGTCAGCCTG
GGCGACCACG AGCCGGTGAT CGATCTCTAT GAAAGGTTGC TGGCGGGGCG ACCTCAGTCG
GGCGCCGAGG TCGCCGACCT GCGGCTTTGG CGGGCGAACG CCCTGAAGAT CACTGGCCGG
CGGCAGGAGG CCATCGCGGA CTATCGCGCC GCCCTGGCGG CGCGGCCGGA CCATGGCGTT
GCCTGGTTCA GCCTCGCCAA CCTCAAGACC TACCGCTTCA CCGACGATGA CGTCTCGCGA
ATGCAGGTGG CCGAGGCCCA GCCGGGCATC CAGACCATGG ACCGGGTCTA TATGGCCTTC
GCGTTGGGCA AGGCGCTGGA GGACCGGGGC GACTACGCGG CCTCGTGGCG GCGCTACGAG
CGCGGCAATG CGGTGCGGCG CGCGGCCGGC CGCTATCGCC CGGAGATCGC GGAAGCCTGC
GCGCTCCGAT TGAAGCAAAT CTTCACCGCC GATTTCTTCG CCGAACGCGC CGGCTGGGGC
GTGGACGATC CGGCGCCCAT CTTCATCCTG GGCCTACCCC GTTCGGGCTC GACCTTGATC
GAGCAGATCC TGGCCTCCCA TTCCCGCGTG GAGGGCACGC AGGAACTGAC CGAGATCGGC
CGATATGCCG GCGAACTCTG CGGTCGCGAT CCGGATTGCG GTTTGCCACT GGACCCCGAG
GCGTTGTCGC GCCTTAAGGC GGAGGATGTC CGAGCGCTCG GCGAACGCTT CCTGGCCGAA
ACCCGCGCCT ATCGTCGGCT GGGCAGACCG TCCTTCATCG ACAAGATGCC AAATAACTTC
TGGCACATCG GGCTGATCCA CCTGATCCTG CCTCGCGCGA CAATCATCGA TGTGCGGCGC
GAACCGATGG CCTGCTGCTT CAGCAATCTC AAGCAGTTGT TCGGCACGAC CAACCAGGAA
TTCACCTACG GCGTCGACGA CATCGCCCGC TACTACCGCA CCTATCTCGA CGTCATGCGG
CACTGGGGCG ATGTGTTGCC GGAGAGGGTT CTGAAGGTCC GGTACGAGGA CGTGGTCGAG
GATCTCGAAG GCGGCGTGCG GCGTATGCTG GAGCACTGCA AACTGCCCTT CGAGCCGGCC
TGCCTGACCT TCCACGAGAC CAAGCGCAGC GTGCGCACGC CCAGTTCCGA GCAGGTGCGC
CAGCCCATCG GTCGCGAGGG GCTCACGCAA TGGGAGCACT ACGCGCCTTG GCTCAACGAC
CTGCGGGACG CGCTGGGCGA CGCCATGACC GGCTACAGGG ACTGA
 
Protein sequence
MNIGRDALLL EARDLRSRQR LPDALAALAR LETLHPRFSR LHQERGHCHV LLGDAPAAID 
ALREAVRLNP TLPASWDMLE QLYRMQGATA QAVNAARHLA TLRQLPSAVV AANGLFADGD
LSPAEEILRD YLSRDGDNVG ALRLLARIRM QSDALDEAEA LLKSVIERAP DYHAARLDYA
MVLLQRQKPL EARREAEHLL AHDPDNRDYL KQYGAACVSL GDHEPVIDLY ERLLAGRPQS
GAEVADLRLW RANALKITGR RQEAIADYRA ALAARPDHGV AWFSLANLKT YRFTDDDVSR
MQVAEAQPGI QTMDRVYMAF ALGKALEDRG DYAASWRRYE RGNAVRRAAG RYRPEIAEAC
ALRLKQIFTA DFFAERAGWG VDDPAPIFIL GLPRSGSTLI EQILASHSRV EGTQELTEIG
RYAGELCGRD PDCGLPLDPE ALSRLKAEDV RALGERFLAE TRAYRRLGRP SFIDKMPNNF
WHIGLIHLIL PRATIIDVRR EPMACCFSNL KQLFGTTNQE FTYGVDDIAR YYRTYLDVMR
HWGDVLPERV LKVRYEDVVE DLEGGVRRML EHCKLPFEPA CLTFHETKRS VRTPSSEQVR
QPIGREGLTQ WEHYAPWLND LRDALGDAMT GYRD