Gene Caul_3399 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaul_3399 
Symbol 
ID5900854 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCaulobacter sp. K31 
KingdomBacteria 
Replicon accessionNC_010338 
Strand
Start bp3671375 
End bp3672463 
Gene Length1089 bp 
Protein Length362 aa 
Translation table11 
GC content73% 
IMG OID641563905 
Productpeptidase M48 Ste24p 
Protein accessionYP_001685024 
Protein GI167647361 
COG category[R] General function prediction only 
COG ID[COG4783] Putative Zn-dependent protease, contains TPR repeats 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones32 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value0.132097 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGCAGGCGC TTCTCTATGA CGGTCTCACC GCCCGGCCCT GGCCGGCGCG GGTGAGCCTG 
GACGGCGACC GCGTGGTCGC CGTCAGCGAA GCGCTCGACG CGGCGGGCGC GCCCTGTGCT
CGCCTAGACT GGCCCCTGGC CAAGGTCCGC GAGCAGGAGC GTGACGAGAC CACCTATCGC
CTGTCACTGG GCGGGGGCGA CGCACGGCTG ATCGTCGATA TCCAGGCCTG GCAGGTCCTG
ACCGGTCGCA CGCGCCGTCA CGTCGTCCGG CGTCGACGGG GCGGGGAGTG GCGGGTGATC
GGCGGTCTGG CGGTGGCCGG CCTCTCGGTC ATGGCCTTCG TCTTCATCGT CGTGCCGCTG
GCCGCCGAGC CGCTGGCGCG CTGGACGCCG CCGGGCCTGG AGGCGCGGTT CGGGCGCAGC
ATGCAGGCCC AGGTCCATCT CGCCATGCCG GTTTGCGAGG GCGATCCCCA GGGCGCGGCG
GCGCTGGCGA AGTTGGGGCA AGCCCTGTCG GAGGGCGCCG ACACGCGCTT CCCGATCCGG
GTGCAAGCGG TGCGCGCCCC GTTCGTCAAC GCCTTCGCCC TGCCCGGCGG GACGGTGATG
GTCACCGACG AACTGATCGC CCAGGCCCAT TCGCCCGACG AACTGGCCGG GGTGATCGCC
CACGAGATCG CCCATGTCGA GCGTCGCCAC GTCATGCAGG CGGCCTGGCG CTCGGCGGGC
GCGGGCCTGG TGCTGGACGC CGTGGTCGGC GGCGGCAGCG GCGCGGGGCA GCAGGCGATC
CTGCTGGCCA GCAGCCTGTC CAACCAGCGC TTCAGCCGCA AGCTGGAGGC CGAGGCCGAT
ACGCGGGGCA TGCAGCTGCT GGCGCGGCGC GACATCTCCA GCCAGGGCAT GGCCGACTTC
TTCGACCGGA TGGCCGACCG CAAGGCCGAC GCCCGGGTGC GTCAGGCCGC CGAATGGTTT
TCGTCGCATC CCGACATGGT CGCGCGCGCG GCGCTGGCCA AGGCCGCCGC CCGGCCTGGA
CGGCCGGCCC TGTCGGACGC CGATTGGCGG CGGGTCAAGG CCGTGTGCAA GGCTGGTCGC
AAGCGCTAG
 
Protein sequence
MQALLYDGLT ARPWPARVSL DGDRVVAVSE ALDAAGAPCA RLDWPLAKVR EQERDETTYR 
LSLGGGDARL IVDIQAWQVL TGRTRRHVVR RRRGGEWRVI GGLAVAGLSV MAFVFIVVPL
AAEPLARWTP PGLEARFGRS MQAQVHLAMP VCEGDPQGAA ALAKLGQALS EGADTRFPIR
VQAVRAPFVN AFALPGGTVM VTDELIAQAH SPDELAGVIA HEIAHVERRH VMQAAWRSAG
AGLVLDAVVG GGSGAGQQAI LLASSLSNQR FSRKLEAEAD TRGMQLLARR DISSQGMADF
FDRMADRKAD ARVRQAAEWF SSHPDMVARA ALAKAAARPG RPALSDADWR RVKAVCKAGR
KR