Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Caci_4335 |
Symbol | |
ID | 8335689 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Catenulispora acidiphila DSM 44928 |
Kingdom | Bacteria |
Replicon accession | NC_013131 |
Strand | + |
Start bp | 4921960 |
End bp | 4924725 |
Gene Length | 2766 bp |
Protein Length | 921 aa |
Translation table | 11 |
GC content | 69% |
IMG OID | 644957438 |
Product | phosphoesterase |
Protein accession | YP_003115040 |
Protein GI | 256393476 |
COG category | [S] Function unknown |
COG ID | [COG3391] Uncharacterized conserved protein |
TIGRFAM ID | [TIGR02276] 40-residue YVTN family beta-propeller repeat |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 0.284385 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 23 |
Fosmid unclonability p-value | 0.290519 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCAGGTAA CGCGCCGCAA GGTGCAGAAG GCGAGCGCTT TCCAGCGCGC CGCCGGTCCG TCCCGGCACC GCCTGTCCAA GGCGGCTGTC GTGGCCCTCG GTCTGGTCGT CACCAGTGCC GGGGTCGGAT CGGCTTCCAC CCGGCAGCTC GGCCACCAGC AGGTCGGCCA GCTGACCAAC AAGGGCGAGG TCATCGCCAG CGACCAGTAC ATCAACCCGA TCGGCTCGCG CCTGGTGGTG AACCAGGGCA AGATCATGGG CGCCACCGTG AGCCCGGACG GCACGCACGT GGCCGCCACG ATCACCGACG GCACCGGCGC GATGGTCATC ATCGACCTGC GCAGCTACCA GGTGCAGCAG GTGATCGGCA AGGCCGCCAC CGGCGTGAAC CTGGCGATCA GCGGAAACGA CGTCGGGCAG ACCTCGCCGA CGTACTCTCC CGACGGCAAG TCGGTGTGGG TGGGCCGGGC CAACGGCTAC ACCAAGTTCA CCGTGAAGCC CGACGGCACG CTGGCCACCC CGGTCGACGT CGCGATCCCG GCGGCCGGCT CGGCGCAGGC GCTGTCCGGC AAGGCGGTCT TCTCCGCCGA CGGCGCGACG GTGTACGGGG CGGTCAACGG CCAGAACCGG GTCGTGGCGA TGGACGCCGC CACCGGCGCG ATCACCCAGA GCTGGACCAC CGGCATCGCG CCGCGCGGCA TGGCGCTGGT CAACGGCAAG CTGTACGTGA GCAACGAGGG CGGCCGCACC GCCGTCGCCG GCGACACGAC GATGAACTCC TACGGCACCG CGGTGCCGGC GAACCCGGAG ACCGGCGCCT CGACGACCGG CACGGTCAGC GTCATCGACA CCGTGAACCC CACGGTGCCG GTCGGCAGCA TCACCGTCGG CACGCACCCG ACCGCCGTCT ACGCCTCGGG CGCCACGGTC TTCGTGGCGA ACACCGCCGC GAACTCCGTC TCGGTCATCG ACACCGCCAA GAACAAGGTG GTCCAGACGA TCGCGACCCA GCCCTGGACC GAGGCCTCGG TCGGCTACGA GCCCGACGCG ATCTCCCTGA CCCCCGACGG CCACCTGCTG GTCAGCCTGG GCCGGGCGAA CGCGGTCGCG GTGTACCGCT TCAAGTCCGC GCAGCAGCCG GTCAGCTACG TCGGCCTGCT GCCGACCGAC TACTTCCCGG CCGAGGTGGC CACGGTCGGC AACCAGATCG TGGTGGCCAA CACCCGCGGC GTGGACGCCC TGCGCCCGAC CGTCGCCGCC GGCCACGGCA CCCACGACAC CACCAGCAGC CTGACCCGCT TCAAGCTGCC GACCGACCAG CAGATCCGGG ACTACACCGG CAAGGTCTTC CAGTACAACG GCTGGACGCC CGATTCGGTG AAGGTGGCCT CCAAGCCCAA CCAGCAGGCG AAGAAGGCGG TCCCGATCCC GGCGCAGATC GGCGACCCCT CGACGATCAA GCACGTGTTC CTGATCGTCA AGGAGAACCG GACCTACGAC CAGGTCTACG GCGACATGCC GCAGGGCAAC GGCGACTCGA CGCTGACGCA GTACGGCGAG GCTGTCACGC CGAACCAGCA CGCCCTGGCC GAGCAGTTCG GTCTGTACGA CAACTTCTAC GACGTCGGCA CGAACTCCGC CGAGGGCCAC AACTGGCTGA TGCAGTCGGA CGACCCGGAG TACACCGAGT CCTCGGCCGG TGAGTACACC CGTAGCTACG ACACGGAGAA CGACGTCCTG GGCCACCAGG AGTCGGGCTT CATCTGGACC GGCGCGAAGG CGGCGGGCAA GAGCGTCAAG GACTACGGCG AGTTCCAGTC GATCGAGAAC AAGCCGGCCG GCGCGACCTG GCAGGACTAC TACTGCGACG CGCAGACCAT GTCCGCCACC GGCGCCCCGA GCCAGTACCC GATCCAGACC GGCTCGGCGA TCCCGTCGCT GAACGACGTC TCGGTCCCCG GCTTCCCGCT GTTCGACCTC TCGGTCCCGG ACGTGTACAA GGCGCAGGTC TGGAAGCAGG ACTTCGAGAA GAACGGCCCG GCCAACCTGA ACATGTTCTG GCTCTCCGAC GACCACACCG GCGGACCGCC GAGCCCGGCC GCCGAGGTCG CCGACAACGA CCTCGCGGTC GGCCAGATCG TCGACACCAT CTCGCACAGC CCGTACTGGA AGGACTCGGC CATCTTCGTG GTCGAGGACG ACTCCCAGGC CGGTCTGGAC CACGTCGACG GCCACCGCGC CCCGGTCCAG GTCATCAGCC CCTACGCCAA CCACGGCACC GTGGACTCCA CCTACTACTC GCAGATCATG ATGGTCCGCA CCATCGAGCA GATCCTCGGC ATCAAGCCCA TGAACCAGCT CGACTCCGCG GCCACCCCGA TGACCTCGGC CTTCACCACC AAGCCGAACC TCACCCCCTT CACCACCGTG CCGAACCAGA CCTCGCTGAC CCTGGGCCTG CCCACCCAGC CGGCCTGCGG CGCCAACGTC CCGGCGGGCC AGACCACCGC CTCCGTCGCC AAGGCCAGCG CCGCCGCCAC CGCGGTCCCG GCCTCGGAGA CCGCGGTCGC CGCCCAGTGG AAGTCCTGGG CCGCCCTGCA GCACCTGACC GGCCCGAACG CGATGCCGGA CTTCGCCAAC CCCCAGCTGA TGAACCGGTA CACGTACTAC CAGAACAACG GCTGGACCAA GGCTTACCCG GGCGACAAGA AGATCCTCGC GCCGAACGAT GTTCCGGGTG CCTTCCTGCC GGGTTCGGCT GACTAG
|
Protein sequence | MQVTRRKVQK ASAFQRAAGP SRHRLSKAAV VALGLVVTSA GVGSASTRQL GHQQVGQLTN KGEVIASDQY INPIGSRLVV NQGKIMGATV SPDGTHVAAT ITDGTGAMVI IDLRSYQVQQ VIGKAATGVN LAISGNDVGQ TSPTYSPDGK SVWVGRANGY TKFTVKPDGT LATPVDVAIP AAGSAQALSG KAVFSADGAT VYGAVNGQNR VVAMDAATGA ITQSWTTGIA PRGMALVNGK LYVSNEGGRT AVAGDTTMNS YGTAVPANPE TGASTTGTVS VIDTVNPTVP VGSITVGTHP TAVYASGATV FVANTAANSV SVIDTAKNKV VQTIATQPWT EASVGYEPDA ISLTPDGHLL VSLGRANAVA VYRFKSAQQP VSYVGLLPTD YFPAEVATVG NQIVVANTRG VDALRPTVAA GHGTHDTTSS LTRFKLPTDQ QIRDYTGKVF QYNGWTPDSV KVASKPNQQA KKAVPIPAQI GDPSTIKHVF LIVKENRTYD QVYGDMPQGN GDSTLTQYGE AVTPNQHALA EQFGLYDNFY DVGTNSAEGH NWLMQSDDPE YTESSAGEYT RSYDTENDVL GHQESGFIWT GAKAAGKSVK DYGEFQSIEN KPAGATWQDY YCDAQTMSAT GAPSQYPIQT GSAIPSLNDV SVPGFPLFDL SVPDVYKAQV WKQDFEKNGP ANLNMFWLSD DHTGGPPSPA AEVADNDLAV GQIVDTISHS PYWKDSAIFV VEDDSQAGLD HVDGHRAPVQ VISPYANHGT VDSTYYSQIM MVRTIEQILG IKPMNQLDSA ATPMTSAFTT KPNLTPFTTV PNQTSLTLGL PTQPACGANV PAGQTTASVA KASAAATAVP ASETAVAAQW KSWAALQHLT GPNAMPDFAN PQLMNRYTYY QNNGWTKAYP GDKKILAPND VPGAFLPGSA D
|
| |