Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Caci_5662 |
Symbol | |
ID | 8337022 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Catenulispora acidiphila DSM 44928 |
Kingdom | Bacteria |
Replicon accession | NC_013131 |
Strand | + |
Start bp | 6530268 |
End bp | 6531317 |
Gene Length | 1050 bp |
Protein Length | 349 aa |
Translation table | 11 |
GC content | 70% |
IMG OID | 644958766 |
Product | HAD-superfamily hydrolase, subfamily IA, variant 1 |
Protein accession | YP_003116362 |
Protein GI | 256394798 |
COG category | [R] General function prediction only |
COG ID | [COG1011] Predicted hydrolase (HAD superfamily) |
TIGRFAM ID | [TIGR01549] haloacid dehalogenase superfamily, subfamily IA, variant 1 with third motif having Dx(3-4)D or Dx(3-4)E |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 22 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 24 |
Fosmid unclonability p-value | 0.342758 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCGAATG GCGGCATGGC GGCCGGCACC GCCGGGCACA ACGCTGCGGT GGACGACAAC CTGGGCCAGG GCGCCCCAGC CGATCACAGG CCAAGCGAGA ACACTGCGGC CAATCGCAGC GGAAAGACCG TGACCCACCG CAGCGCGAGC CAAAGCACTG CCGCCGATCA CGCCCCGAGC CAGCACGCCG CAGTCGATCG CAGCCTGAGT GAGACCGCCG CAGCCGACCG CAGCCCGAGC CAGAGCACCG TGACCGAAGT CCGAGCCGCC ACCGCCGGCG GCCCGAGTAT CAGCGTCCAG CCTCGCCGGA CGCGCGCGAC CGGCTATCTG GCCGTCGCCG GGGACTTGGC GCCTCGCGAC CGTCGCGCGT CTGATGTGGA GGCGATCCTC TGCGATCTCG ATGACACGCT GTATCCGCAG GCTGCGTGGC TCGATGGCGC GTGGAGTGCT GTGGCGGCGG CGGGTGCGCG GTGGGGCGTC GAGGAGCGGG CGTTTCTGGC GGCGCTGCGG GCTGATGCGG CGGTGGGGTC GGCGCGGGGC GGGATCATTG ATCGGGCGCT GGTGGATGTG GGGGTCGGGG GCGGGGCGGA GCTGGTTGCT GAGCTGCTCG CCGCGTTTCG GGCGTATCGG CCTGTGCGGC TGGAGCCGTA TCCGGGGGTG CGGGAGGCGT TGGTGCGGTT GCGGGTGGCG GGGGTGCGGC TCGCGGTGGT GACTGATGGG GATGTGGAGG TGCAGGCTTG GAAGGTGCGG GCTTTGGGGT TGTCCGCTTT TTTTGAGTGC GTGGTCGTCT CGGATGCGCT GGGGGGACGC GGGGTGCGCA AGCCGAGTGC GGTGCCGTTC TTGGCCGCGG TGGAGGGGTT GGGGGTGCGG CCTGAGCGGT GTGTTGTGGT GGGGGACCGT CCTGAGAAGG ATGTTATGGG AGCTCTGGGG GCTGATATCA GGGCTGTTCG GGTGAAAACG GGGGAATATC GGCAGGTTGC CGATGTGGCA GGGACCTGGC ATACGGCTGC TGATTTTCCG GCTGCCGTCG ACTGGTTGCT GCGGGAATGA
|
Protein sequence | MANGGMAAGT AGHNAAVDDN LGQGAPADHR PSENTAANRS GKTVTHRSAS QSTAADHAPS QHAAVDRSLS ETAAADRSPS QSTVTEVRAA TAGGPSISVQ PRRTRATGYL AVAGDLAPRD RRASDVEAIL CDLDDTLYPQ AAWLDGAWSA VAAAGARWGV EERAFLAALR ADAAVGSARG GIIDRALVDV GVGGGAELVA ELLAAFRAYR PVRLEPYPGV REALVRLRVA GVRLAVVTDG DVEVQAWKVR ALGLSAFFEC VVVSDALGGR GVRKPSAVPF LAAVEGLGVR PERCVVVGDR PEKDVMGALG ADIRAVRVKT GEYRQVADVA GTWHTAADFP AAVDWLLRE
|
| |