Gene Caul_3734 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaul_3734 
Symbol 
ID5901196 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCaulobacter sp. K31 
KingdomBacteria 
Replicon accessionNC_010338 
Strand
Start bp4046903 
End bp4049827 
Gene Length2925 bp 
Protein Length974 aa 
Translation table11 
GC content70% 
IMG OID641564257 
Productpeptidase M16 domain-containing protein 
Protein accessionYP_001685359 
Protein GI167647696 
COG category[R] General function prediction only 
COG ID[COG0612] Predicted Zn-dependent peptidases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.13373 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATCCGCG TTTCCCGCCT GGCGCTCGTT TTCTCGGCCG GGCTGAGTCT TTCCGCCTGC 
GCGACCCTGT CACACTTGAA GCCCGGCGGC GGCGATCGTC CGGTCACCGC CAAGGCCCCG
GCTCCCGCCC AGGCTCCCCG CGCGCCCGTC GCGTCCAACG TCCCGACGAT CGACGCGGTG
AAGCCCGGCC AGTGGCCGCA GGCGGTGTCG GACGTCGCGC CCGACCCGCG CGTCCGCTTC
GGCGTCCTGC CCAACGGCCT GCGCTACGCG ATCCAGAAGA ACGCCACCCC GCCCGGCCAG
GCCGCCCTGC GCCTGTGGTT CGACGCCGGC TCGCTGGACG AGACCGACGA GCAGCAGGGC
CTGGCTCACT TTCTGGAGCA CATGGCCTTC AATGGCTCGA AGAACGTGCC CGAAGGCGAG
ATGACCAAGA TCCTCGAGCG CCACGGCCTG GCGTTCGGCG CCGACACCAA CGCCTCGACC
AATTTCGGCG CCACGACCTA TCAGCTCGAC CTGCCCAAGA CCGACGACGA CACAGTCGAC
AGCGCCATGA TGCTGCTGCG CGAAGCGGCG GGCGAGCTGA CCATCGCCCA GGACGCGGTG
GACCGCGAGC GCGGGGTGGT GCTGTCGGAG GAGCGCACCC GCGACAGCCC CGGCTACCGG
GTGTTCGTCA ACACCTTCGG CTTCCAGCTG GAGGGCCAGC GCCCGCCCAA GCGCCTGCCG
ATCGGCAAGA CCGAGATCCT CAAGACCGCC CCCGCCCAGC GGATCCGCGA CTTCTACCAG
GCCTGGTACC GGCCCGAGAA CGCGGTGTTC GTCGCGGTCG GCGACTTCGA CGTCGACGCC
ATGGAGGCCC GGATCAAGGC CCGGTTCGGC GACTGGAAGG GCCAGGGTCA GCCCGGCGTG
AAGCCGGATC TGGGTCCGGT GGCCAAGCGT GGCCTGACCG CCAAGGTGCT GGTCGAGCCG
GGCGCGCAGA CCTCGGTGCA GATGTCGTGG ATCGCGCCGC CCGACCTGGA ACTGGAGACC
CGGGCCAAGG ACGCCCAGGA ACTGGTCAAG GCCCTGGGTT TCGCCGTGCT GAACCGCCGG
CTGCAGGTGC TGACCCGGTC CGACGCGCCG CCGTTCATCG CCGCCGTGGC CCTGCAGAAC
GACCAGGAGC ACGCCGCCCA GATCACCACC CTGGCCGCCA CCGTCCAGCC TGGCGGCTGG
AAAGAGGCGC TGACCGTCTT CGACCAGGAG CAGCGCCGCG TCGTGCAATA CGGCGTGCGC
CAGGATGAGC TGGATCGCGA GATCGCCGCG ATGCGCGCGG GCTTCGTGGC CGCCGCGGCC
GGAGAGGCCA CCCAGCGCAC CACCGCCCTG GCCGGCGCCA TCGTCGGCAC GCTGGACGAC
AAGGAGATCG TCACCAGCCC GTCCCAGAAC CTGGCGGTGT TCGACGAGGC GACCAAGGGC
CTGACCGCCG ACAAGGTCTC GGCCGTGCTG AAGGCCCAGT TCGTCGGCCA GGGCCCGCTG
ATCACCATCC CCACCCCGAC GGCCATCGAG GGCGCGGAGA AGACCGTGAC GGAGGCCTAT
CTGGCCTCGG GCAAGACGCC GGTCGCCGCG CCCGCCGCGC CGGGAACCCT GAACTGGCCC
TATGCCAGCT TCGGCCCGAT CGGCAAGGTC GCCGAGCAGC GCGACGTCAC CGACCTGGAC
ACCGTCTTCG TGCGCTTCGC CAACGGCGTG CGGCTGACCG TCAAGCCGAC CAAGTTCCGC
GACGACCAGA TCCTGGTGAA GGCCCGGATC GGCCATGGCC TGCTCGACCT GCCGGCCAAC
ACCCAGAGCC CGATGTGGGC CGGCTCGGCC TATATCGAGG GCGGCCTCAA GCAGATCAGC
ACCCAGGACA TGGAACGCGT GCTGAACGGC AAGGTCTGGA ACGCCGGCCT GGGGGTCGAG
GATGACGCCT TCAGCTTGTC GGGTCGCACC CGCCCAGAGG ACTTCGGCAC GGAACTCCAG
GTCCTGGCCG CCTACGCCAC CGAAGGCGGC TGGCGCCCCG AGGCCTATAC GCGGATCAAG
ACCTACTACG GCACGATCCA CGATCAGCTG GAGTCCACCC CCAGCGGCGT CATGGGACGC
GACCTGGGCG GATTGCTGCA TGGCGGCGAT GGGCGCTGGA CCTTCCCGAC CCGCCAGCAG
ATCGCCGCCG CCTCGCTCGA CCAACTGAAG GCCTCGGTCT CCGGACCGCT GACCAGCGAC
TCCATCGAGG TGGTGATCGT CGGCGACATC ACGGTCGACA AGGCCATCGC CGGCGTGGCC
GAGACCTTCG GGGCGCTGCC CACCCGCGCC GACACGCCCG TCCCGGCCGG CGCCGCCAAC
GCGCCCTTCC CCGCCCCGTC GCCGACCCCG GTGGTGCGCA CCCACAAGGG CCGCGCCGAC
CAGGGCCAGC TGTTCATGGC GTGGAAGACC GACGACCTGT TCGCCAACCT GCAGCGGGCC
CGCGACACCC AGGTCCTGGC CCAGGTGATG CAACTGCGGC TGACCGACGA ACTGCGCGAG
AAGCAGGGCG CCACCTATTC GCCGTCGGCC TCGGCCGCGG CCAGCGTGGC GTTCAACCAC
TGGGGCTATC TGGCGGTCAG CGTCGAGACC CCGCCCGACA AGATTGACGG CGTCATGGCC
AGCATCCGCC AGATCGCCGC CGACCTACGC GACAAGCCGA TCAGCGAGGA CGAGCTGGAC
CGGGCCAAGA AGCCGCGCAT CGACCAGATC GAGAAGGCCC GCGAGACCAA CGAGTACTGG
CTGGGAACCC TGTCGGGCGC CCAGACCGAT CCGCGCCTGA TCGACGCCAC CCGCTCGGTG
ATCGCCGGCC TGCAACGCGT GTCGGCCGCC GACGTCCAGA AGGCGGCCAA GGATTTCCTC
GGCGACGACA AGTCCTGGAC GATGATCGTG CGGCCGGAGA AATAA
 
Protein sequence
MIRVSRLALV FSAGLSLSAC ATLSHLKPGG GDRPVTAKAP APAQAPRAPV ASNVPTIDAV 
KPGQWPQAVS DVAPDPRVRF GVLPNGLRYA IQKNATPPGQ AALRLWFDAG SLDETDEQQG
LAHFLEHMAF NGSKNVPEGE MTKILERHGL AFGADTNAST NFGATTYQLD LPKTDDDTVD
SAMMLLREAA GELTIAQDAV DRERGVVLSE ERTRDSPGYR VFVNTFGFQL EGQRPPKRLP
IGKTEILKTA PAQRIRDFYQ AWYRPENAVF VAVGDFDVDA MEARIKARFG DWKGQGQPGV
KPDLGPVAKR GLTAKVLVEP GAQTSVQMSW IAPPDLELET RAKDAQELVK ALGFAVLNRR
LQVLTRSDAP PFIAAVALQN DQEHAAQITT LAATVQPGGW KEALTVFDQE QRRVVQYGVR
QDELDREIAA MRAGFVAAAA GEATQRTTAL AGAIVGTLDD KEIVTSPSQN LAVFDEATKG
LTADKVSAVL KAQFVGQGPL ITIPTPTAIE GAEKTVTEAY LASGKTPVAA PAAPGTLNWP
YASFGPIGKV AEQRDVTDLD TVFVRFANGV RLTVKPTKFR DDQILVKARI GHGLLDLPAN
TQSPMWAGSA YIEGGLKQIS TQDMERVLNG KVWNAGLGVE DDAFSLSGRT RPEDFGTELQ
VLAAYATEGG WRPEAYTRIK TYYGTIHDQL ESTPSGVMGR DLGGLLHGGD GRWTFPTRQQ
IAAASLDQLK ASVSGPLTSD SIEVVIVGDI TVDKAIAGVA ETFGALPTRA DTPVPAGAAN
APFPAPSPTP VVRTHKGRAD QGQLFMAWKT DDLFANLQRA RDTQVLAQVM QLRLTDELRE
KQGATYSPSA SAAASVAFNH WGYLAVSVET PPDKIDGVMA SIRQIAADLR DKPISEDELD
RAKKPRIDQI EKARETNEYW LGTLSGAQTD PRLIDATRSV IAGLQRVSAA DVQKAAKDFL
GDDKSWTMIV RPEK