Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Caul_4531 |
Symbol | |
ID | 5901992 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Caulobacter sp. K31 |
Kingdom | Bacteria |
Replicon accession | NC_010338 |
Strand | + |
Start bp | 4903265 |
End bp | 4905094 |
Gene Length | 1830 bp |
Protein Length | 609 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | 641565050 |
Product | peptidase M14 carboxypeptidase A |
Protein accession | YP_001686149 |
Protein GI | 167648486 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG2866] Predicted carboxypeptidase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 29 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 12 |
Fosmid unclonability p-value | 0.417367 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTTGAAGA CATGTCTGGC GGCCCTGGCC GTCGCAATGA CGGCGGGCGT CACCATGGCG GAAACTCCAA AGGCTCCTTG GGACTCTGAA ATCCTTCCCC CAACCCTGGC CTGGCATGGC GCCAGCGAAA AGCTGGTCGC CAAGCCGTCC GACCCGTGGA TCACGCCGTC TGAGCTGACG GGCCTCACCG CCTCGCCCGA CTATGCCGAG ACCCGCGCCT GGCTGGAAAA GCTGGCCGCC GCCTCGCCGC TGATCCGCAT GGAGAGCTTT GGCAAGTCGC CCGAGGGCCG CGACCTGCTG GTGCTCTACG TCTCCAAGGA CGGCGCGACG TTCGATCCGA ACAAGCCCGT GCTGCTGGCC CAGGCCGGCA TCCATCCAGG TGAGATCGAC GGCAAGGACG CCGGCATGAT GCTGCTGCGC GACATCGCCT TCCACGGCAA GGACGCCCTG CTGGACAAGG TCAACCTGGT CTTCGTACCG ATCTTCAGCG TCGACGGCCA CGAACGCGCG TCGCCCTACA GCCGCCCCAA CCAGCGCGGT CCGGCCATCC AGGGCTGGCG GCACACGGCC CAGAACCTCA ATCTGAACCG CGACTACGGC AAGCTCGACT CGCCGGAGAT GCGGGCGATG ATCGCCCTGA TCGATCGCGT CAAGCCCGAC CTCTACATGG ACATCCACGT CACCGACGGC GAGGATTATC AGTACGACGT CACCTATGGC TGGGCTGGCT ACAAGGGCGA CTTCGCGCGG TCCAAGGCGA CCAGCGCCTG GCTGGACAAC ACCTACCTGC CGGCCACCGA GACCGGGCTG AAGGCGGCGG GGCACATTCC CGGCCAGCTT GTCTTCGCGC TCGATGACCG GGACCCTAAG AAGGGCCTGG GCGCCTCGCC GGCCAGCTTG CGCTACTCCA ACGGCTATGG CGACGCCGCG CGGATCGCCA CGGTTCTGGT CGAGAACCAT TCCCTTAAGC CCTACAGGCA GCGCGTGCTG GGAACCTATG TGCTGCTGGA AGAAAGCCTG AAGATCCTCG CCAAGGACGG CGCCGCCCTG CGCGCCGCCG AGCTTCAGGA CCGCGCCGCG CGCCCCGCCG TGGTCGAGGC CAATTTCGGG ACCAGTTTCG GGGGCGGCGG CGACAAGCCG CTACGCACCG TGAGCTTCCT GGGCGTGCAG TACGAGACCT ACGACTCCCC CGCCTCGGGC GGCAAGGAGG TGCGCTGGCT GGGCAAGCCC GATCCCACGC CCTGGTCGCT GCCGCTCTAT GCCGCCGAGC CGTCGCTGCA ACTGACCCCG GCCAAGGCCT ACTGGATCTC CTCGGCCAAG CCGGACGTCA TCGAGCGCCT GCGCATCCAT GGCGTCCAGA TGGAGACCCT GGCCGCGCCC AGGGCGGTGT CGGTCGACAT GATCCGCTTC ACCGCCCCCA AGCTCGCCCC CCGCGCCAGC GAGGGCCATG TCGAGATCGC CGCCGGCCCG GTCACCCACG AGACCCGCTC GGTCACCTTC CCCGCTGGCT CGGTGCGCGT TTCCACCGAC CAGCCGCTGG GCGAGATGGT GACCCTGCTG CTGGAGCCGC AAAGCGACGA GAGCTTCTTC GCCTGGGGGA TGTTCCCCGA GGTGCTGCAG CGGGTCGAAT ATATCGAGGG CTACGCCATC GCTCCGCTGG CCGAAAAGAT GCTGGCCGCC GATCCCAAGC TGAAGGCGCA GTTCGAGGCC AGGCTGGCGG CGGATCCGAA GTTCGCGGCC GATCCGGACG CGCGCCTGGC CTGGTTCTAC GCCCGCACGC CCTACTACGA CGACCACTAC CGGCTCTATC CGGTCGGGCG GGAGCTGTAG
|
Protein sequence | MLKTCLAALA VAMTAGVTMA ETPKAPWDSE ILPPTLAWHG ASEKLVAKPS DPWITPSELT GLTASPDYAE TRAWLEKLAA ASPLIRMESF GKSPEGRDLL VLYVSKDGAT FDPNKPVLLA QAGIHPGEID GKDAGMMLLR DIAFHGKDAL LDKVNLVFVP IFSVDGHERA SPYSRPNQRG PAIQGWRHTA QNLNLNRDYG KLDSPEMRAM IALIDRVKPD LYMDIHVTDG EDYQYDVTYG WAGYKGDFAR SKATSAWLDN TYLPATETGL KAAGHIPGQL VFALDDRDPK KGLGASPASL RYSNGYGDAA RIATVLVENH SLKPYRQRVL GTYVLLEESL KILAKDGAAL RAAELQDRAA RPAVVEANFG TSFGGGGDKP LRTVSFLGVQ YETYDSPASG GKEVRWLGKP DPTPWSLPLY AAEPSLQLTP AKAYWISSAK PDVIERLRIH GVQMETLAAP RAVSVDMIRF TAPKLAPRAS EGHVEIAAGP VTHETRSVTF PAGSVRVSTD QPLGEMVTLL LEPQSDESFF AWGMFPEVLQ RVEYIEGYAI APLAEKMLAA DPKLKAQFEA RLAADPKFAA DPDARLAWFY ARTPYYDDHY RLYPVGREL
|
| |