Gene Caul_0107 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaul_0107 
Symbol 
ID5897819 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCaulobacter sp. K31 
KingdomBacteria 
Replicon accessionNC_010338 
Strand
Start bp116410 
End bp119274 
Gene Length2865 bp 
Protein Length954 aa 
Translation table11 
GC content70% 
IMG OID641560591 
Productpeptidase M16 domain-containing protein 
Protein accessionYP_001681743 
Protein GI167644080 
COG category[R] General function prediction only 
COG ID[COG0612] Predicted Zn-dependent peptidases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.224669 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATCCGTC CCGCCGCCAA GGCCGCCCTC GTGGCGCTGG CCCTCTCGAC CACCGCCCTT 
TCGTCGATGA CCTTGGCGCC CTTGGCCCAC GCCCAGACGC AAGCCAAGCC GGCCGCCGCC
GGGGTCGCCG TGCCGCCGAT CGTCTACAAG GAACGGACCC TGGCCAACGG CATGAAGGTC
TACACCTCGC GGGACGCGAC CACGCCCAAC GTGACGGTGC AGGTCTGGTA CGGGGTCGGC
AGCAAGGACG ATCCGGAGGG CCGCTCGGGC TTCGCGCACC TGTTCGAACA CCTGATGTTC
AAGTCGACGC GCAACATGCC CAACGAGGCC TTCGACCGCC TGACCGAGGA CGTCGGCGGC
TTCAACAACG CCTCCACCTA CGACGACTTC ACCAACTATT ACGAGGTCGT GCCCGCCAAC
CACCTGCAGC GCCTGCTATG GGCCGAGGCT GAGCGCCTGG GCTCGCTGGT CATCAACGAC
GCAGTGTTCA AGTCCGAGCG CGACGTGGTC AAGGAAGAGC TGCGCCAGCG GGTTCTGGCC
AATCCCTATG GGCGGTTCTT CAACCTCTAC ATCACCCAGG CCTCGTTCGC GCAGCATCCC
TACAAGCGCC CGGGCATCGG CTCGATCGAG GAGCTGGACG CCGCCACGGT CGACGACGTG
CGCGCCTTCC ACGCCGCCTA TTACCGCCCC GACAACGCCG CCCTGATCGT GGTCGGCAAC
TATGACGAAG CCCAGCTCAA CGCCTGGATC GACCAGTACT TCGCGCCGCT GAAGACCCCG
GCCGGGGCGA TCAAGCCGGT GAGCGTGGTC GAGCCGCCCC GCGCCGGACC CAAGACCGTC
ACCACCTACG GCCCCAACGT GCCGCTGCCG GGCGTGGCCA TGACCTGGCT GGCTCCGGCC
GCCGCCGATC CGGACGCCCC GGCCCTGTCT GTGCTGGACG CCATCCTGTC GGCCGGCAAG
TCGTCGCGCC TCTACAACAG CCTGGTCTAC GACCAGCAGA TCGCTCAGCA GATCTTCTCC
TCGACCAGCA CCAACGCTCA GCCCGGCATC TTCTACGTCG GCGCGATCAT GGCGGGCGGC
AAGACCGTGG AGCAGGGCGA GGCCTCGCTG TCGGCCCAGG TCGCCAAGCT GCGCGACGCC
CCGCCGACGC CCGCCGAACT GGCCGAGGCC AAGGCCGGCC TGCTGGCCGA CGCCGTGCGC
GGTCGCGAGA CCATCGACGG CCGCGCCTTC GCCATCGGCT ACGCCCTGCG CACCGAGGGC
GACGCCCAGC GCGCCAACAC CGACCTGGCC GCCCTGCAGG CGGTCACCGC CGCCGACGTC
CAGCGCGTGG CCAGGAAGTA CCTGACCGAC AACGGCCGCA CGACGATCCG CTACCTGCCG
GAATCGTCCA AGCCCGCCGG CGCCAAGGAC CCGTTCGAGG CGCCCAAGCC CACCGCCTCG
GTCAGCTACG GCGGCCCGGT CACCACCCTG GCCCCCGAGG GCGAACGCAT CGCCCCGCCG
GCCCCGGGCA AGCCGGTGTC GCCGGTCCTG CCCTCGCCCG TCGAGAAGAC CCTGGCCAAC
GGCCTGCGGG TCATCGTCGC CAAGTCCAGC GACCTGCCGC TGATCACCGC CGACCTGACC
GTCAAAGGCG GCTCGGGAGC CGATCCCGCC GGCCTGGCGG GCGTCTCCAG CCTGACCGCC
GAGCTGCTGA CCGAGGGCAC CAAGACCCGC ACCGCCACGC AAGTCGCCGC CGCCACCGAG
GCCCTGGGCG CCAATCTGGA GGCCGGTTCG GGCTGGGAGG CCTCGTCCCT GACCCTCAGC
GTCATCGCCG ACAAGGCTCC GCAAGGCCTG GCGATCATGG CCGACGTGGC CGAGAACCCG
GCCTTCAAGG TCGAGGAGCT GGAGCGGGTG AAGACCGAGG CCCTGGACGG GCTGTCCGTC
GCGTTCCAGC GCCCGGGCAG CGTCGCCGGC TTCGTCGTGC CGACCGTGAT CTATGGCGGC
TCAGGCTTTG GCCACGTCTC CGGCGGCACA CCCGGCTCGC TGCCGAAGAT CCAGCGCGAC
GCCCTGGTCA AGACCCACGC CGCCCACTGG CGTCCGGACA ACGCCATCCT GGTGCTGACC
GGCGACCTGA CGCCGGAGCA GGGCTTCGCC TTGGCCGAGA AGGCGTTCGG CGGCTGGGCC
AAGCCGGCCG GTCCGCCGCC GGCCCCGGTC AAGGGTCCCG CCGGCTATGC GCCCAGGAAC
ATCGTCATCG ACCTGCCGGG CACCGGCCAG GCCGCGGTCG TGGTGACCAA GCCGGCGATC
CTGCGCGCCG ACCCGCGCTA CTATGCCGGC CTGGTGGCCA ACGGCGTGCT GGGCGGCGGC
TATTCCTCGC GCCTGAACCA GGAGATCCGC ATCAAGCGCG GCCTGTCCTA TGGCGCCGGC
TCCAGCCTGT CGCCGCGCGC GGCGATCGGC GGTTTCTCGG CCAATGTCCA GACCAAGAAC
GAGTCCGCCG CCCAGGTGGT GACCCTGATC AAGGGCGAAC TGAGCCGGCT GGGCGCCGAG
CCCACCTCGA CCGCCGAGCT GGCGGCCCGC AAGTCGGTGC TGGTCGGCGA CTTCGGCCGC
GACCTGGGCA CGTCGGGCGG CTTGGCCGAC ATCCTGGGCA ACCTGGCCGT CTACGGGGTC
CCACTGAACG AGATCCAAGC CTATACCGGC AAGGTCGAGG CGGTGACCGC CGCCGACGTC
CAGGACTTCT CCAAGGCGGT GCTCGACCCG GCCCAGACCA GCGTCATCGT GGTCGGCGAC
GCCAAGACCT TCGGCGACAC GGTCAAGGCG GTGCTGCCGG GCGCGACCGA GATCCCGATC
GATCAGTTGG ATCTCGACAG CCCCACCTTG AAGAAAGCCA AGTAG
 
Protein sequence
MIRPAAKAAL VALALSTTAL SSMTLAPLAH AQTQAKPAAA GVAVPPIVYK ERTLANGMKV 
YTSRDATTPN VTVQVWYGVG SKDDPEGRSG FAHLFEHLMF KSTRNMPNEA FDRLTEDVGG
FNNASTYDDF TNYYEVVPAN HLQRLLWAEA ERLGSLVIND AVFKSERDVV KEELRQRVLA
NPYGRFFNLY ITQASFAQHP YKRPGIGSIE ELDAATVDDV RAFHAAYYRP DNAALIVVGN
YDEAQLNAWI DQYFAPLKTP AGAIKPVSVV EPPRAGPKTV TTYGPNVPLP GVAMTWLAPA
AADPDAPALS VLDAILSAGK SSRLYNSLVY DQQIAQQIFS STSTNAQPGI FYVGAIMAGG
KTVEQGEASL SAQVAKLRDA PPTPAELAEA KAGLLADAVR GRETIDGRAF AIGYALRTEG
DAQRANTDLA ALQAVTAADV QRVARKYLTD NGRTTIRYLP ESSKPAGAKD PFEAPKPTAS
VSYGGPVTTL APEGERIAPP APGKPVSPVL PSPVEKTLAN GLRVIVAKSS DLPLITADLT
VKGGSGADPA GLAGVSSLTA ELLTEGTKTR TATQVAAATE ALGANLEAGS GWEASSLTLS
VIADKAPQGL AIMADVAENP AFKVEELERV KTEALDGLSV AFQRPGSVAG FVVPTVIYGG
SGFGHVSGGT PGSLPKIQRD ALVKTHAAHW RPDNAILVLT GDLTPEQGFA LAEKAFGGWA
KPAGPPPAPV KGPAGYAPRN IVIDLPGTGQ AAVVVTKPAI LRADPRYYAG LVANGVLGGG
YSSRLNQEIR IKRGLSYGAG SSLSPRAAIG GFSANVQTKN ESAAQVVTLI KGELSRLGAE
PTSTAELAAR KSVLVGDFGR DLGTSGGLAD ILGNLAVYGV PLNEIQAYTG KVEAVTAADV
QDFSKAVLDP AQTSVIVVGD AKTFGDTVKA VLPGATEIPI DQLDLDSPTL KKAK