Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Caul_4017 |
Symbol | |
ID | 5901479 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Caulobacter sp. K31 |
Kingdom | Bacteria |
Replicon accession | NC_010338 |
Strand | - |
Start bp | 4352676 |
End bp | 4353737 |
Gene Length | 1062 bp |
Protein Length | 353 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | 641564538 |
Product | proline imino-peptidase |
Protein accession | YP_001685640 |
Protein GI | 167647977 |
COG category | [R] General function prediction only |
COG ID | [COG0596] Predicted hydrolases or acyltransferases (alpha/beta hydrolase superfamily) |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 37 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 17 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTCGAAAT CTCTTGTTCT GGCCGTTGCG CTGGCTTTCG CCGGCGCGCC CCATCTAGTC CTGGCCGCCC AGCCGGCCGC GGTCGCCGTG CCGGCGAACG CGGCCATCAA TGAAGAGGGC TTCGTCCGCA TCGGGGGGAT CGAGCAGTGG GTCACCATCC ACGGCGAGGA CCGCTCCAAG CCGGTGATCC TGATGGCGCA CGGCGGTCCG GGCAATCCGA TGACCCCCTA CGCCCACCCC TATTTCCAGG CGTGGGAGAA GGACTTCGTC ATCGTCCAGT GGGACCAGCG CGGCGCGGGC ATGACCTATG GCCGCAACCC GCCGGCCCAG GGCGAGCACC TGAGCGTCGA GCGGCTGCGC GACGACGGCA TCGAGGTGGC GACCTATGCG GCCAAGCATC TGGGCAAGCC CAAGGTGATC CTGATGGGCG GCTCGTGGAG TTCGATCCTG GGGACCCACA TGGCCAAGGC GCGGCCCGAC CTGTTCTACG CCTATGTCGG CTCCTCGCAC CTGGCCCGTT CGGCCGACAA TCTGAAGGCC TCGTACGACC GGACGCTGGG CCTGGCGCGG GCCGGCGGCG ACCAGGACGC GATCGGCAAG CTGGAGGCCA TGGGGCCGCC GCCCTGGACC AACCCGCGAA ACTTCGGGAT CATCCGCCGC ATCACCCGCA AGTACGAGGC CGCACGCACT GATCCCGCGC CCGCCACGTG GATGGAGCCC AATCCCGTCT ACGCCACGGA GAAGGCCCTG GCCGACTACG AGGGCGGCGA GGACTATTCC TACATCGAGT TCGTGGGGAT GAACGGCGAG GGTATGTATT CGAAGACCGA TCTCTACGCC CTGGGGCCGC AATTCAAGCT GCCGGTGTTC GTGATCCTGG GCGAGCAGGA CCTGGTCTCG ACGCCCGAGG TCGCCCGCGC CTGGTTCGAT ACCTTGCAGG CGCCTGACAA GGCGTTCGTG CTGCTGCCGC GCACGGGGCA CGATCCGAAC CCGGCCATGG CGGCGGCGCA GCTGGAGATC TTGAAGACGC GGGTCTTGCC GCTGATCGGA AAGGGCGGCT GA
|
Protein sequence | MSKSLVLAVA LAFAGAPHLV LAAQPAAVAV PANAAINEEG FVRIGGIEQW VTIHGEDRSK PVILMAHGGP GNPMTPYAHP YFQAWEKDFV IVQWDQRGAG MTYGRNPPAQ GEHLSVERLR DDGIEVATYA AKHLGKPKVI LMGGSWSSIL GTHMAKARPD LFYAYVGSSH LARSADNLKA SYDRTLGLAR AGGDQDAIGK LEAMGPPPWT NPRNFGIIRR ITRKYEAART DPAPATWMEP NPVYATEKAL ADYEGGEDYS YIEFVGMNGE GMYSKTDLYA LGPQFKLPVF VILGEQDLVS TPEVARAWFD TLQAPDKAFV LLPRTGHDPN PAMAAAQLEI LKTRVLPLIG KGG
|
| |