Gene Caci_5662 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaci_5662 
Symbol 
ID8337022 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCatenulispora acidiphila DSM 44928 
KingdomBacteria 
Replicon accessionNC_013131 
Strand
Start bp6530268 
End bp6531317 
Gene Length1050 bp 
Protein Length349 aa 
Translation table11 
GC content70% 
IMG OID644958766 
ProductHAD-superfamily hydrolase, subfamily IA, variant 1 
Protein accessionYP_003116362 
Protein GI256394798 
COG category[R] General function prediction only 
COG ID[COG1011] Predicted hydrolase (HAD superfamily) 
TIGRFAM ID[TIGR01549] haloacid dehalogenase superfamily, subfamily IA, variant 1 with third motif having Dx(3-4)D or Dx(3-4)E 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones24 
Fosmid unclonability p-value0.342758 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCGAATG GCGGCATGGC GGCCGGCACC GCCGGGCACA ACGCTGCGGT GGACGACAAC 
CTGGGCCAGG GCGCCCCAGC CGATCACAGG CCAAGCGAGA ACACTGCGGC CAATCGCAGC
GGAAAGACCG TGACCCACCG CAGCGCGAGC CAAAGCACTG CCGCCGATCA CGCCCCGAGC
CAGCACGCCG CAGTCGATCG CAGCCTGAGT GAGACCGCCG CAGCCGACCG CAGCCCGAGC
CAGAGCACCG TGACCGAAGT CCGAGCCGCC ACCGCCGGCG GCCCGAGTAT CAGCGTCCAG
CCTCGCCGGA CGCGCGCGAC CGGCTATCTG GCCGTCGCCG GGGACTTGGC GCCTCGCGAC
CGTCGCGCGT CTGATGTGGA GGCGATCCTC TGCGATCTCG ATGACACGCT GTATCCGCAG
GCTGCGTGGC TCGATGGCGC GTGGAGTGCT GTGGCGGCGG CGGGTGCGCG GTGGGGCGTC
GAGGAGCGGG CGTTTCTGGC GGCGCTGCGG GCTGATGCGG CGGTGGGGTC GGCGCGGGGC
GGGATCATTG ATCGGGCGCT GGTGGATGTG GGGGTCGGGG GCGGGGCGGA GCTGGTTGCT
GAGCTGCTCG CCGCGTTTCG GGCGTATCGG CCTGTGCGGC TGGAGCCGTA TCCGGGGGTG
CGGGAGGCGT TGGTGCGGTT GCGGGTGGCG GGGGTGCGGC TCGCGGTGGT GACTGATGGG
GATGTGGAGG TGCAGGCTTG GAAGGTGCGG GCTTTGGGGT TGTCCGCTTT TTTTGAGTGC
GTGGTCGTCT CGGATGCGCT GGGGGGACGC GGGGTGCGCA AGCCGAGTGC GGTGCCGTTC
TTGGCCGCGG TGGAGGGGTT GGGGGTGCGG CCTGAGCGGT GTGTTGTGGT GGGGGACCGT
CCTGAGAAGG ATGTTATGGG AGCTCTGGGG GCTGATATCA GGGCTGTTCG GGTGAAAACG
GGGGAATATC GGCAGGTTGC CGATGTGGCA GGGACCTGGC ATACGGCTGC TGATTTTCCG
GCTGCCGTCG ACTGGTTGCT GCGGGAATGA
 
Protein sequence
MANGGMAAGT AGHNAAVDDN LGQGAPADHR PSENTAANRS GKTVTHRSAS QSTAADHAPS 
QHAAVDRSLS ETAAADRSPS QSTVTEVRAA TAGGPSISVQ PRRTRATGYL AVAGDLAPRD
RRASDVEAIL CDLDDTLYPQ AAWLDGAWSA VAAAGARWGV EERAFLAALR ADAAVGSARG
GIIDRALVDV GVGGGAELVA ELLAAFRAYR PVRLEPYPGV REALVRLRVA GVRLAVVTDG
DVEVQAWKVR ALGLSAFFEC VVVSDALGGR GVRKPSAVPF LAAVEGLGVR PERCVVVGDR
PEKDVMGALG ADIRAVRVKT GEYRQVADVA GTWHTAADFP AAVDWLLRE