Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Caul_1592 |
Symbol | |
ID | 5899047 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Caulobacter sp. K31 |
Kingdom | Bacteria |
Replicon accession | NC_010338 |
Strand | - |
Start bp | 1681310 |
End bp | 1683121 |
Gene Length | 1812 bp |
Protein Length | 603 aa |
Translation table | 11 |
GC content | 72% |
IMG OID | 641562080 |
Product | peptidase M24 |
Protein accession | YP_001683220 |
Protein GI | 167645557 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0006] Xaa-Pro aminopeptidase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 22 |
Plasmid unclonability p-value | 0.400079 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 15 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCGCCAGA CCTTCGATGA ATCCACCGAT CCGGGCTTCG GTCCCCGCCA CGTCCCCCTG ATCCGCGCCG CCATGGCGCG GCAGGGCCTG GACGGCTTCC TCGTGCCGCA CGAGGACGAA CACCAGAACG AGTATCTGCC GCCCGCCAAC GATCGCCTGG CCTGGGCCAG CGGCTTCACC GGCTCGGCGG GCGCCGGCGT GATCCTCAAG GACCGCGCCG CCGTGTTCGT CGATGGCCGC TACACCCTGC AGGTCCGCGA CCAGGTCGAC CAGGGCGTGT TCGAGATCCG CGACCTCGTC GAGGGCGGCG TCCCCGCCTA TCTGGAGACC GCCTCCAAGG GCGCGGTGAT CGGCTACGAC GCCCGCCTGC ACAGTCCCCA GGCGCTGGAC GGCCTGAAGG CCGCCGCCGC CAAGGCCGGC GCGGCGCTGA AGCCCGTCGC CGTCAACCCG ATCGACGAGG CCTGGGGCGC CGAACGTCCC GCTCAACCCG CCGCGCCGGT CGTGCCCCAG CCCGTGCAGT ACGCCGGCGA GGAATCGGCC TCCAAGCGCG CCCGCGTCGG CTCGGCCGTC GCGGCCTTGG GCGCCGACGC CGCGGTGATC ACCGCCCCGG CCTCGATCGC CTGGCTGTTC AACATCCGCG GCGGCGACGT CATCCGCTCG CCCCTGCCGC TGGCCCAGGC CGTGCTGCGA GCCGACGGCT CGGCCCGCCT GTTCCTCGAC CCGGCCAAGG TCACCGACGA GTTGCCCGCC TGGCTGGGCA ACCAGGTTTC GCTGGAGGCT CCCGAGGCCC TGGACGCCGC CCTGGCCGAA CTGGCGGGCA AGTCGGTGGT GGTCGATCCC GCCCAATCGT CGGCCTGGTA CTTCGATACG CTGGTCGCCG CCGGCGCCTC GGTGGTCCGC GCCATGGACC CCTGCACCCT GCCCCGCGCC TGCAAGAACC CCGTCGAGAT CGCCGGCACA ATCGAGGCCC ACAAGCGCGA CGGCGCGGCC CTGACCCGAT TCCTCCACTG GCTGGCCACC GAGGGTCAGG TCAATCCGCC CGACGAAAAG GAAGCCGTGG CCAAGCTGGA GGCGTTCCGC GAGGCGACCG GTCTGCTGAA GGACCTCAGC TTCGACACCA TCGGCGCGGC CAACGGCCAC GGCGCCCTGC CCCACTATCG CCCGACCGAG CGCGGCAACA TGCGCGCGAG GCTTGGCTCA TTGCTGCTGG TCGACAGCGG CGGCCAGTAC CTGGACGGCA CCACCGACGT CACCCGCACG GTCGCCATCG GCGAGCCGAC CGCCGAGATG GTCACCCGCA ACACCCTGGT CCTGAAGGGC CACCTGGCCA TCGCCCGCCT GCGCTTCCCG GCCGGCACCA CCGGCTCGGC CATCGACGCC TTCGCCCGCG CCGCCCTGTG GAGCCACGGC CTGGACTACG ACCACGGCAC CGGCCACGGC GTCGGCGTCT ATCTGGGCGT CCACGAGGGT CCGCACCGGA TCTCCAAGGC GCCCAACACG GTTTCCCTGC AGCCGGGGAT GATCGTTTCG AACGAGCCGG GCTACTACAA GGACGGCGAA TACGGCATCC GCATAGAGAA CCTCGAAGTC GTCATGCCGG CCGAGACGGT CGGAACCGGC GACCGCCCGA TGCACCGCTT CCAGGCCCTG ACCCTGGCCC CGATCGACCG GCGGCTGGTG GATAAGAGCC TGCTGTCGGC CGAGGAGATC GCCCAGTTCG ACGCCTATCA CGCGCGGGTC GCGGCGGAGA TCGGGCCGCG CGTGGAACCG GAAATCCGGG CCTGGCTGGA AGAAGTCTGC GCGCCGCTTT AG
|
Protein sequence | MRQTFDESTD PGFGPRHVPL IRAAMARQGL DGFLVPHEDE HQNEYLPPAN DRLAWASGFT GSAGAGVILK DRAAVFVDGR YTLQVRDQVD QGVFEIRDLV EGGVPAYLET ASKGAVIGYD ARLHSPQALD GLKAAAAKAG AALKPVAVNP IDEAWGAERP AQPAAPVVPQ PVQYAGEESA SKRARVGSAV AALGADAAVI TAPASIAWLF NIRGGDVIRS PLPLAQAVLR ADGSARLFLD PAKVTDELPA WLGNQVSLEA PEALDAALAE LAGKSVVVDP AQSSAWYFDT LVAAGASVVR AMDPCTLPRA CKNPVEIAGT IEAHKRDGAA LTRFLHWLAT EGQVNPPDEK EAVAKLEAFR EATGLLKDLS FDTIGAANGH GALPHYRPTE RGNMRARLGS LLLVDSGGQY LDGTTDVTRT VAIGEPTAEM VTRNTLVLKG HLAIARLRFP AGTTGSAIDA FARAALWSHG LDYDHGTGHG VGVYLGVHEG PHRISKAPNT VSLQPGMIVS NEPGYYKDGE YGIRIENLEV VMPAETVGTG DRPMHRFQAL TLAPIDRRLV DKSLLSAEEI AQFDAYHARV AAEIGPRVEP EIRAWLEEVC APL
|
| |