Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Caci_5030 |
Symbol | |
ID | 8336384 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Catenulispora acidiphila DSM 44928 |
Kingdom | Bacteria |
Replicon accession | NC_013131 |
Strand | + |
Start bp | 5763828 |
End bp | 5765402 |
Gene Length | 1575 bp |
Protein Length | 524 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | 644958129 |
Product | X-Pro dipeptidyl-peptidase domain protein |
Protein accession | YP_003115731 |
Protein GI | 256394167 |
COG category | [R] General function prediction only |
COG ID | [COG2936] Predicted acyl esterases |
TIGRFAM ID | [TIGR00976] putative hydrolase, CocE/NonD family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 0.0204968 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 31 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGGTGTTCG ACATCCGCGT CGAATCAGGG CTCGAAGCGA CGATGCGCGA CGGGACGATC CTGCGCGCCG ACGCCTACCG CCCGATGGGG AGCGGACCGT GGCCCGTGCT GCTGGTCCGC ACCCCGTACG ACAAGCAGAA CGCAGAGGTG CTCTCACGCC TCGATCCGCA AGGCGCCGCC GGCCGCGGCT ATCTGGTCAT CGTCCAGGAC TGCCGCGGCC GGTTCGCCTC CGACGGTGTG TGGGAACCGC TGCTGCACGA CGGCCCAGAC GGCTACGACA CGATCGTCTG GGCCGCGAGC CTGCCCGCCT CGAACGGCCG GGTCGGAACG TACGGACCCA GCTACCTGGG CTATACCCAG CAGGCGGCGA GGGCGGCGCA ACCACCTGGG TTGTGCGCCA GCGTTCCGGC GTTCACCTGG TCGGATCCGA ACGACGGGCT GATGGCGCGC GGCGGCGCCT ACGAACTCGG GCTGATGACG CACTGGACTT TGTCGCTCGG GTTCGATGTC TTGGCACGCC GGTACGCCGA CGCTCCGCAG GAGTTGGCAT CCCGGCTCGC GGCGCTGAAC GGTGCGCTGG AGGACTTCCG GTCGCGGGTC GTCTGGGACT CGCCTGCAGA GGACCTGCCT GTGCTCCGGC GCCTGGGGCT GACGACGCCA AAGCCGACCA GTGCTCCGCA CCAGCGCTCG GCTGCGATAC CGACCCTGAC CATCGCAGGC TGGTTCGACT GCTTCCTCCA AGGGAGCCTC GACAACCACG TCGCAGCGAC AGCCAGCGGC GCGCCGACGG CACTGATCGT CGGTCCGTGG ACGCACGACG ACCAGAGCAG CCAGGGCGGC GCGTCCCTCA ACGCACGCGA ACTCGACTTC CTCGATCGGC ACCTCAGGCC CGACTCCAGC GTCCAGGCGC CCGAGTCACC CGAGTCACCC GAGTCACCCG TGCAGGTATT CGTGATGGGC ACTGACGAAT GGCGCCGCTT TCCGTCCTGG CCATCCCAGA GCACTGAGAG CTCCTGGTAT CTGCACCCTG ATGCATGCCT GGCGCCTCTC TTGCCGCCGA ATTCACCGCC GGACTCCTTC GACCACGACC CCGACGACCC CGTCCCCACC CTCGGGGGCG CCATCCTCCT CGGCCCCGAT TTCCCTTCCG GACAGTGCGA CCAGGCGCAG ATCGAGGAGC GCGACGACGT CCTGATCTAC ACCAGCGAAC CGATGAAAAC CTCGCTCGAA GTCATCGGCC GCGTCCGCGT AGAACTGTTC GCCACATCGA CGGCACCGAG CACCGACTGG ATCGCACGCC TCTGCGACGT CGACGAACAC GGCGTCTCCC GCAACATCAC CGACGGCATC CTCCGCGCGC CATCAGCCGA GCCCCAGCGT CAGCCTCAGA AACACACGAT CGACCTGTGG TCCACAGCCC ACGCATTCCT CCCCGGCCAC CGCATCCGCC TCCAGATCAC CTCTACCTGC TTCCCCCGCT GGGCCCGCAA CCCCGCCTCG TCCACCGCGC GCCAAACCGT GCACCACGGC AACGCAACAC CGTCACGGCT CATCCTCCCG AGGACGCCGG CTTAA
|
Protein sequence | MVFDIRVESG LEATMRDGTI LRADAYRPMG SGPWPVLLVR TPYDKQNAEV LSRLDPQGAA GRGYLVIVQD CRGRFASDGV WEPLLHDGPD GYDTIVWAAS LPASNGRVGT YGPSYLGYTQ QAARAAQPPG LCASVPAFTW SDPNDGLMAR GGAYELGLMT HWTLSLGFDV LARRYADAPQ ELASRLAALN GALEDFRSRV VWDSPAEDLP VLRRLGLTTP KPTSAPHQRS AAIPTLTIAG WFDCFLQGSL DNHVAATASG APTALIVGPW THDDQSSQGG ASLNARELDF LDRHLRPDSS VQAPESPESP ESPVQVFVMG TDEWRRFPSW PSQSTESSWY LHPDACLAPL LPPNSPPDSF DHDPDDPVPT LGGAILLGPD FPSGQCDQAQ IEERDDVLIY TSEPMKTSLE VIGRVRVELF ATSTAPSTDW IARLCDVDEH GVSRNITDGI LRAPSAEPQR QPQKHTIDLW STAHAFLPGH RIRLQITSTC FPRWARNPAS STARQTVHHG NATPSRLILP RTPA
|
| |