Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Caul_1121 |
Symbol | |
ID | 5898576 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Caulobacter sp. K31 |
Kingdom | Bacteria |
Replicon accession | NC_010338 |
Strand | + |
Start bp | 1189947 |
End bp | 1190969 |
Gene Length | 1023 bp |
Protein Length | 340 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | 641561603 |
Product | peptidase S51 dipeptidase E |
Protein accession | YP_001682749 |
Protein GI | 167645086 |
COG category | [P] Inorganic ion transport and metabolism [Q] Secondary metabolites biosynthesis, transport and catabolism |
COG ID | [COG4242] Cyanophycinase and related exopeptidases |
TIGRFAM ID | [TIGR02069] cyanophycinase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 27 |
Plasmid unclonability p-value | 0.575257 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 5 |
Fosmid unclonability p-value | 0.00178513 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGGCTTTGC GTTGGATGTC GGCGGTCGCG ATGGCGGCGA CCATGATCAT GCCGGTGATG GCGCGGGCGG CCGATGCGCC GGCGGCGGGG CCCGACGCGA CCGGACCCGG CTACGAATAC TACGCCATCG GCGACGTGAA GGCCCCGACG CCAGGCAAGA CCGGCCCGCT GCTGGCCCTG ATGGGCGGCG GGGACTGGCC GCTGGAAGCC TTCCGCCAGT TCGTCCAGCA ATCGGGCGGC GGCCACATCG TCGTGCTGCG GGCGCGCGGC GGCCGTGAGC TGCAGGACGA GATCTACAAC GATGTCGGCG GCGTGCTCTC CGTCGAGACC CTGGTGATCC ACGACGAGGA CGCCGGCGAG GACCCCAAGC TGCTGGCGAT CATCGCCCAT GCCGACGGCG TCTTCTTCGG CGGCGGCGAC CAGTCCAACT ATGTGCGCGC CTGGAAGGGC ACGGCCCTGA ACAAGGCGCT GGACGCCCAC GTGAAGGCCG GCAAGCCGAT CGGCGGCACC AGCGCGGGCC TGGCGATCCT GGGCGGCTAT GTCTACGGCT GCCTGGATTC GATCAGCCTC ACCTCGCCCG ACGCGTTGAA GAACCCGACC GGACCCAGCG TCACCCTGGT GCGCGACTTC CTGCACCTGC CCTATCTGTC GCATGTGATC ACCGACACCC ACTTCGATAT CCGCGATCGT CAGGGGCGAC TGGTGACCTT CGTCGGCCGC CTGGCCCAGG AAGAGAACGA CCCGACCATC ACCGGCCTGG GCGTTGACCA GGACACCGCC ATGCTGGTCG ACGCCAACGG GATCGGCCGC TTCTACGGCA AGGGCTATGC CTGGCTGGTG CGGCCGATGG CCAAGCCCGC CACCATCGTC GCCGGACGGC CCTTGAACTA CGCGGCCTTC CCGATGGTCG GCATCGGCCC CGACAGCACC CTGGATTTCA AGACCTTCCA AGTGACCAAG CCGGCCTTCA CCTTGACGGC GCGGGTCAAG GACGGCGTCT TGATCCGCAG CGACCGCAAG TAA
|
Protein sequence | MALRWMSAVA MAATMIMPVM ARAADAPAAG PDATGPGYEY YAIGDVKAPT PGKTGPLLAL MGGGDWPLEA FRQFVQQSGG GHIVVLRARG GRELQDEIYN DVGGVLSVET LVIHDEDAGE DPKLLAIIAH ADGVFFGGGD QSNYVRAWKG TALNKALDAH VKAGKPIGGT SAGLAILGGY VYGCLDSISL TSPDALKNPT GPSVTLVRDF LHLPYLSHVI TDTHFDIRDR QGRLVTFVGR LAQEENDPTI TGLGVDQDTA MLVDANGIGR FYGKGYAWLV RPMAKPATIV AGRPLNYAAF PMVGIGPDST LDFKTFQVTK PAFTLTARVK DGVLIRSDRK
|
| |