Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Caul_3875 |
Symbol | |
ID | 5901337 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Caulobacter sp. K31 |
Kingdom | Bacteria |
Replicon accession | NC_010338 |
Strand | + |
Start bp | 4194930 |
End bp | 4196279 |
Gene Length | 1350 bp |
Protein Length | 449 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | 641564397 |
Product | membrane dipeptidase |
Protein accession | YP_001685499 |
Protein GI | 167647836 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG2355] Zn-dependent dipeptidase, microsomal dipeptidase homolog |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 30 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 22 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCCGGGGT CGTTCTGCAG CGCCGGCGCG GCCCGAGGGC CGGAGGGCGC TTGCGCCCTT GACCGCACCC CTCCGATCCC CGAGCGTGGC CGCAACGCCA TTCGCTTCGG AGCCAAGATG TCCCGCCTGC TGACCGCCCT GCTCGGCTCC GCCGCCCTGT TCGCCACCGC CGCACACGCC ACCGACACGC CCAGGCCGGG CGAGGTCTCC AAGCAGGATA GGGTTCTGCA CGAAAAGGTC CTCACGCTCG ACACGCACCT GGACACCCCC GAGCACTTCG CCCGCCGGGG TTGGAGCATG ATGGACCGGC ATGTCGTCAC CGAGGACGGC ACCCAGGTCG ACCTGCCGCG AATGAACGCC GGCGGCCTAG ATGGGGGGTT CTTCGTGATC TATACCACCC AAGGGCCTCT GACCGCCGAG GGCTATCGCG GCGCGCGTGA CTTCGCGCTC GAACGCGCCA CCGAGATCCG CGAGATGGTC GCCGCCCATC CCGACAAGTT CGAGCTGGCC TACACCGCCG ACGACGCCGA GCGGATCAAC AAGGCCGGCA AGAAGTTCGT CTTCCAGAGC ATCGAGAACA GCTGGCCGAT GGGTGAGGAT CTCACCCTGA TGCGGACCTT CTACGCCACC GGCGTGCGGA TGGCCGGACC GGTCCACTTC CGCAACAACC AGTTCGCCGA CAGCTCGACC GACAAGCCGA TCTGGCACGG CTTCTCGCCG CTGGGCCTGC GCTGGCTGGC CGAGGCCAAC CGGCTGGGGA TCCTGATCGA CGTCAGCCAC GCCTCCGACG ATGTGGTCGA CCAGGCCGTG GTGCTGTCCA AGGTCCCGAT CATCGCCTCG CACTCCGGCG CCAAGGCGGT CTATGACGCC GCCCGCAATC TCGACGACGG GCGGCTGAAG AAGATCGCCG ACGCGGGCGG GGTGATCTGC ATCAACTCGG TCTATCTGAA GGCCACGCCC ACCAGCCCAG AGCGCAAGGC CGCGTTCGAG GCTCTGGGCA AGGCCCCCGA CAGCGAGACG GCGAGCGAAG CCGAGATCGT CGCCTTCATG AAGAAGAAGG TCGAGATCGA CGCCAAGTTC CCGCCGGTCC GCGCCTCGTT CGAGGACTTC ATGGCCAGCC TGACCCACAC CCTCAAGCTG GTCGGCCCCG AGCACGTCGG CATCGGCGCC GACTGGGACG GCGGCGGCGG CGTGATCGAC TTCGAGGACG TCGCCGACCT GCCCAAGGTC ACCGCCCGGC TGAAGGCGGC CGGCTACACC GACGCCGACG TGGCGGCGAT CTGGGGCGGC AACGTGCTGC GCGTGGTGAA GCAGGCGCAG GACTACGCGA AGGCGGCAGC GGCCAAGTAA
|
Protein sequence | MPGSFCSAGA ARGPEGACAL DRTPPIPERG RNAIRFGAKM SRLLTALLGS AALFATAAHA TDTPRPGEVS KQDRVLHEKV LTLDTHLDTP EHFARRGWSM MDRHVVTEDG TQVDLPRMNA GGLDGGFFVI YTTQGPLTAE GYRGARDFAL ERATEIREMV AAHPDKFELA YTADDAERIN KAGKKFVFQS IENSWPMGED LTLMRTFYAT GVRMAGPVHF RNNQFADSST DKPIWHGFSP LGLRWLAEAN RLGILIDVSH ASDDVVDQAV VLSKVPIIAS HSGAKAVYDA ARNLDDGRLK KIADAGGVIC INSVYLKATP TSPERKAAFE ALGKAPDSET ASEAEIVAFM KKKVEIDAKF PPVRASFEDF MASLTHTLKL VGPEHVGIGA DWDGGGGVID FEDVADLPKV TARLKAAGYT DADVAAIWGG NVLRVVKQAQ DYAKAAAAK
|
| |