Gene Caul_4531 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaul_4531 
Symbol 
ID5901992 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCaulobacter sp. K31 
KingdomBacteria 
Replicon accessionNC_010338 
Strand
Start bp4903265 
End bp4905094 
Gene Length1830 bp 
Protein Length609 aa 
Translation table11 
GC content68% 
IMG OID641565050 
Productpeptidase M14 carboxypeptidase A 
Protein accessionYP_001686149 
Protein GI167648486 
COG category[E] Amino acid transport and metabolism 
COG ID[COG2866] Predicted carboxypeptidase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones29 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value0.417367 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTTGAAGA CATGTCTGGC GGCCCTGGCC GTCGCAATGA CGGCGGGCGT CACCATGGCG 
GAAACTCCAA AGGCTCCTTG GGACTCTGAA ATCCTTCCCC CAACCCTGGC CTGGCATGGC
GCCAGCGAAA AGCTGGTCGC CAAGCCGTCC GACCCGTGGA TCACGCCGTC TGAGCTGACG
GGCCTCACCG CCTCGCCCGA CTATGCCGAG ACCCGCGCCT GGCTGGAAAA GCTGGCCGCC
GCCTCGCCGC TGATCCGCAT GGAGAGCTTT GGCAAGTCGC CCGAGGGCCG CGACCTGCTG
GTGCTCTACG TCTCCAAGGA CGGCGCGACG TTCGATCCGA ACAAGCCCGT GCTGCTGGCC
CAGGCCGGCA TCCATCCAGG TGAGATCGAC GGCAAGGACG CCGGCATGAT GCTGCTGCGC
GACATCGCCT TCCACGGCAA GGACGCCCTG CTGGACAAGG TCAACCTGGT CTTCGTACCG
ATCTTCAGCG TCGACGGCCA CGAACGCGCG TCGCCCTACA GCCGCCCCAA CCAGCGCGGT
CCGGCCATCC AGGGCTGGCG GCACACGGCC CAGAACCTCA ATCTGAACCG CGACTACGGC
AAGCTCGACT CGCCGGAGAT GCGGGCGATG ATCGCCCTGA TCGATCGCGT CAAGCCCGAC
CTCTACATGG ACATCCACGT CACCGACGGC GAGGATTATC AGTACGACGT CACCTATGGC
TGGGCTGGCT ACAAGGGCGA CTTCGCGCGG TCCAAGGCGA CCAGCGCCTG GCTGGACAAC
ACCTACCTGC CGGCCACCGA GACCGGGCTG AAGGCGGCGG GGCACATTCC CGGCCAGCTT
GTCTTCGCGC TCGATGACCG GGACCCTAAG AAGGGCCTGG GCGCCTCGCC GGCCAGCTTG
CGCTACTCCA ACGGCTATGG CGACGCCGCG CGGATCGCCA CGGTTCTGGT CGAGAACCAT
TCCCTTAAGC CCTACAGGCA GCGCGTGCTG GGAACCTATG TGCTGCTGGA AGAAAGCCTG
AAGATCCTCG CCAAGGACGG CGCCGCCCTG CGCGCCGCCG AGCTTCAGGA CCGCGCCGCG
CGCCCCGCCG TGGTCGAGGC CAATTTCGGG ACCAGTTTCG GGGGCGGCGG CGACAAGCCG
CTACGCACCG TGAGCTTCCT GGGCGTGCAG TACGAGACCT ACGACTCCCC CGCCTCGGGC
GGCAAGGAGG TGCGCTGGCT GGGCAAGCCC GATCCCACGC CCTGGTCGCT GCCGCTCTAT
GCCGCCGAGC CGTCGCTGCA ACTGACCCCG GCCAAGGCCT ACTGGATCTC CTCGGCCAAG
CCGGACGTCA TCGAGCGCCT GCGCATCCAT GGCGTCCAGA TGGAGACCCT GGCCGCGCCC
AGGGCGGTGT CGGTCGACAT GATCCGCTTC ACCGCCCCCA AGCTCGCCCC CCGCGCCAGC
GAGGGCCATG TCGAGATCGC CGCCGGCCCG GTCACCCACG AGACCCGCTC GGTCACCTTC
CCCGCTGGCT CGGTGCGCGT TTCCACCGAC CAGCCGCTGG GCGAGATGGT GACCCTGCTG
CTGGAGCCGC AAAGCGACGA GAGCTTCTTC GCCTGGGGGA TGTTCCCCGA GGTGCTGCAG
CGGGTCGAAT ATATCGAGGG CTACGCCATC GCTCCGCTGG CCGAAAAGAT GCTGGCCGCC
GATCCCAAGC TGAAGGCGCA GTTCGAGGCC AGGCTGGCGG CGGATCCGAA GTTCGCGGCC
GATCCGGACG CGCGCCTGGC CTGGTTCTAC GCCCGCACGC CCTACTACGA CGACCACTAC
CGGCTCTATC CGGTCGGGCG GGAGCTGTAG
 
Protein sequence
MLKTCLAALA VAMTAGVTMA ETPKAPWDSE ILPPTLAWHG ASEKLVAKPS DPWITPSELT 
GLTASPDYAE TRAWLEKLAA ASPLIRMESF GKSPEGRDLL VLYVSKDGAT FDPNKPVLLA
QAGIHPGEID GKDAGMMLLR DIAFHGKDAL LDKVNLVFVP IFSVDGHERA SPYSRPNQRG
PAIQGWRHTA QNLNLNRDYG KLDSPEMRAM IALIDRVKPD LYMDIHVTDG EDYQYDVTYG
WAGYKGDFAR SKATSAWLDN TYLPATETGL KAAGHIPGQL VFALDDRDPK KGLGASPASL
RYSNGYGDAA RIATVLVENH SLKPYRQRVL GTYVLLEESL KILAKDGAAL RAAELQDRAA
RPAVVEANFG TSFGGGGDKP LRTVSFLGVQ YETYDSPASG GKEVRWLGKP DPTPWSLPLY
AAEPSLQLTP AKAYWISSAK PDVIERLRIH GVQMETLAAP RAVSVDMIRF TAPKLAPRAS
EGHVEIAAGP VTHETRSVTF PAGSVRVSTD QPLGEMVTLL LEPQSDESFF AWGMFPEVLQ
RVEYIEGYAI APLAEKMLAA DPKLKAQFEA RLAADPKFAA DPDARLAWFY ARTPYYDDHY
RLYPVGREL