Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Caul_2542 |
Symbol | |
ID | 5899997 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Caulobacter sp. K31 |
Kingdom | Bacteria |
Replicon accession | NC_010338 |
Strand | - |
Start bp | 2756596 |
End bp | 2757813 |
Gene Length | 1218 bp |
Protein Length | 405 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | 641563033 |
Product | membrane dipeptidase |
Protein accession | YP_001684167 |
Protein GI | 167646504 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG2355] Zn-dependent dipeptidase, microsomal dipeptidase homolog |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 26 |
Plasmid unclonability p-value | 0.51564 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 20 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACCCGTT CCCTGCTTCT GGCCGCCGTC TCGGTCTTGG CCTTCGCCAC CTCGTCCCAG GCCGCCGACA CCGCCGCCTC GGTCCGCAAG ATCCACGAGG GCCTGCTGAC CCTCGACACC CATCTGGACA CTCCAGCCAA TTTCGGCCGT CCGGGCTGGG ATATCCTGGA CCGGCATGAC GCCGCCAAGG ACGGCTCGCA GATCGATTAT CCCCGCATGG TCGAGGGTGG GCTGGATGGC GGCTTCTTCG CCATCTACAC GCCGCAGGGG CCGCGCACGC CCGAGGCCAC CCGCGCCGCC CGGGACGGCG CCTTGGTCCG CGGCGTCGAG ATCCGCGAGA TGGTGGCCAA GCACGGCGAC AAGTTCGCCC TGGCCCTGAA GGCCGACGAC GCCGCCAAGA TCGCCGCCAG CGGCAAGCGC GTCGTCTTCA TGAGCATCGA GAACAGCTAC CCGATCGACG GCGACGTCAC CCTGCTGTCC AGCTTCTACG CCCTGGGCGT GCGGATCAGC GGCCTGGCCC ACTTCAAGAA CAACGACATG GCCGACAGCT CGACCGACAA GCCCGAGTGG CATGGCCTCA GCCCGCTGGG CAAACAGTTC GTCACCGAGG CCAACCGGCT GGGCGTGGTG CTGGACGGCT CGCATTCGTC CGACGACGTG CTCGATCAAC TGATCGCCCT GTCCAAGACC CCGGTGATCC TGACCCATTC AGGCTGCAAG GCGGTGTTCG ACCATCCGCG CAATGTCGAC GACGCCCGCA TCAAGGCCCT GGCCGACAGC GGCGGGGTGA TCCAGGTCGA CGCCTATTCC AGCTATCTGA TCGACACGCC CAAGAACCCC GATCGCGAGG CCGCCATGGC CGCCCTGATG GCCAAGGTCG GGGCGCGGGC TAAGATGACC GAGGAGCAGC GCGCCGCCTT CATAGCCGAA CGCAACGCCA TCGACGCCAA GTGGCCGGTG ACCAAGGCGA CGTTCCAGGA CTTCATGAAC CACCTCAACC ACGCCCTGAA GGTGGCCGGC GTCGATCACG TGGGCGTCGG CATCGACTTC GACGGCGGCG GCGGCGTCAC CGGCCTGAAC GACGCCTCCG ACTACTGGAA GATCTCCCAG GCCCTGCTGG CCGAGGGCTA CACCCAGGCC GACCTGGAGA AGATCTGGAG CGGCAACGTC CTGCGTCTGC TGCGCGCCGC CGAGGCGGCC AAGGCGCCGG CGGGGTGA
|
Protein sequence | MTRSLLLAAV SVLAFATSSQ AADTAASVRK IHEGLLTLDT HLDTPANFGR PGWDILDRHD AAKDGSQIDY PRMVEGGLDG GFFAIYTPQG PRTPEATRAA RDGALVRGVE IREMVAKHGD KFALALKADD AAKIAASGKR VVFMSIENSY PIDGDVTLLS SFYALGVRIS GLAHFKNNDM ADSSTDKPEW HGLSPLGKQF VTEANRLGVV LDGSHSSDDV LDQLIALSKT PVILTHSGCK AVFDHPRNVD DARIKALADS GGVIQVDAYS SYLIDTPKNP DREAAMAALM AKVGARAKMT EEQRAAFIAE RNAIDAKWPV TKATFQDFMN HLNHALKVAG VDHVGVGIDF DGGGGVTGLN DASDYWKISQ ALLAEGYTQA DLEKIWSGNV LRLLRAAEAA KAPAG
|
| |