Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Caul_4272 |
Symbol | |
ID | 5901733 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Caulobacter sp. K31 |
Kingdom | Bacteria |
Replicon accession | NC_010338 |
Strand | + |
Start bp | 4642819 |
End bp | 4644477 |
Gene Length | 1659 bp |
Protein Length | 552 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | 641564791 |
Product | peptidase M28 |
Protein accession | YP_001685891 |
Protein GI | 167648228 |
COG category | [R] General function prediction only |
COG ID | [COG2234] Predicted aminopeptidases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 27 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 13 |
Fosmid unclonability p-value | 0.43547 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCCGTTCT CCCGTCGCGC CGCCCTCGCC GGCGTCGCCC TGCTGTTGAC CGTCCAACCT GTTTTCGCCG CGCCCGTCGC CAAGGCCAAA CCCGCCGCCA CCAAGCCAGC CGCCAGCTTC ACGCCCTCAC CGGCCGCTAT CAAGGCGCAT ATGAGCTTCC TGGCCGACGA CCTGCTGGAA GGCCGCGAGT CCGGCACGCG CGGCTACGAC ATCGCCGCCA ACTACGTGGC TTCGCAATAT GCGGTGATGG GCGTCAAGCC GGCCGGCGAC AACGGCTCCT ACCTGCAGCA GGTGCCGCTG ACGGCCTATC GCTCGGTCAA CGAGGGCGGG GTGTCCTACA CCACCGCCGA CGGCAAGTCC GGCGCCCTGA CCTTCGGCGA GGACTACCTG CCGTCCGCCC AAGCCCGCCA GGCCGACACC TCGGTGACCG CCCCGCTGGT CTTCGTCGGC TACGGGATCA ACGCCCCCGA GCGCGGTCGC GACGACTATG CCGGCCTGGA CGTGAAGGGC AAGATCGTCG TCCTGCTGAC CGGCGCGCCC AGCGGCTTCC AGACCGAGAT CCGGGCCCAC TACAGCAACA CCAACGTCAA GCGGGCCGAG GCGGCCAGGC GCGGCGCGGT CGGGGTCATC ACCCTGCCGA CCACCAGCTC CGAGAAGCGC CGCCCATTCG AGCGCGGCGT AGCCAACTAC CAGGAATGGA AGATGACCTG GAGCGACGCC CGGGGCGTGG CCTATGTGCG CGGCGCCGAG GCGCCGGGCC TGGCCACCCT GAGCCTGAAG GGCGGCGCCA AACTGTTCAC CGGCGCCCCG GTCACCCTGG AAAGCGTGCT GGCCGAGGCC GAGACGCCGG AAGGTCTGGT CAAGGGCTTC GACCTGCCTG TCAACGTGAC GATCCAGCTG AAGACCGAGA TCGAGAAGCG CCGGAGCAGC AACGTGGTCG GCCTGATCGA AGGCTCGGAC CCGACCCTGA AGGCCCAGAC CATCATCCTC AGCGCCCACC TCGACCACCT GGGCATTCAC GGCAAGGACG CCGACAAGAT CAACAACGGC GCGCTGGACA ACGCCTCGGG CGTCGCGACG ATGCTGGAGG TGGCCCGAGG CTTCAAGGAA GCCAAGACCA AGCCCAAGCG CTCGATCGTC CTGCTGGCGG TCACGGCCGA GGAAAAGGGC CTGATCGGCT CGGAATATTT CGCCAACAAC CCGACCGTGC CGAAGGCCGG CATCGCCGCC GACGTCAATC TGGACATGCC GATCCTGCTG TACGACTTCC AGGACGTGAT CGCCTTCGGC GCCGACCGCT CGTCGATCGG CCCGGCCGTG GCCCGCGCCG CTGGCCGCGT CGGCATCGGC CTGTCGGCCG ACCCGCTGCC GGAAGAGGGC CTGTTCACCC GCTCGGACCA CTATCGCTTC GTCGAGCAGG GCGTGCCGTC GGTGTTCCTG ATGACCGGTT TCAAGAACGG CGGCGAAAAG GGCTTCAAGG ACTTCCTGGC CACGCATTAC CACAAGCCCA ACGACGACCT GAACCAGCCG ATCAACTACG AGGCCGGCGC CCGCTTCGCC CTGGTCAATT ACGAGATCGC CCGCGAGCTG GCCGACATGC CGGCCCGTCC GAGCTGGAAC AAGGGCGACT TCTTCGGGAC GCTGTTCGGG AAGAAGTAG
|
Protein sequence | MPFSRRAALA GVALLLTVQP VFAAPVAKAK PAATKPAASF TPSPAAIKAH MSFLADDLLE GRESGTRGYD IAANYVASQY AVMGVKPAGD NGSYLQQVPL TAYRSVNEGG VSYTTADGKS GALTFGEDYL PSAQARQADT SVTAPLVFVG YGINAPERGR DDYAGLDVKG KIVVLLTGAP SGFQTEIRAH YSNTNVKRAE AARRGAVGVI TLPTTSSEKR RPFERGVANY QEWKMTWSDA RGVAYVRGAE APGLATLSLK GGAKLFTGAP VTLESVLAEA ETPEGLVKGF DLPVNVTIQL KTEIEKRRSS NVVGLIEGSD PTLKAQTIIL SAHLDHLGIH GKDADKINNG ALDNASGVAT MLEVARGFKE AKTKPKRSIV LLAVTAEEKG LIGSEYFANN PTVPKAGIAA DVNLDMPILL YDFQDVIAFG ADRSSIGPAV ARAAGRVGIG LSADPLPEEG LFTRSDHYRF VEQGVPSVFL MTGFKNGGEK GFKDFLATHY HKPNDDLNQP INYEAGARFA LVNYEIAREL ADMPARPSWN KGDFFGTLFG KK
|
| |