Gene Caci_5439 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaci_5439 
Symbol 
ID8336793 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCatenulispora acidiphila DSM 44928 
KingdomBacteria 
Replicon accessionNC_013131 
Strand
Start bp6263981 
End bp6265015 
Gene Length1035 bp 
Protein Length344 aa 
Translation table11 
GC content75% 
IMG OID644958537 
ProductHAD-superfamily hydrolase, subfamily IIA 
Protein accessionYP_003116139 
Protein GI256394575 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG0647] Predicted sugar phosphatases of the HAD superfamily 
TIGRFAM ID[TIGR01460] Haloacid Dehalogenase Superfamily Class (subfamily) IIA 


Plasmid Coverage information

Num covering plasmid clones25 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones28 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGACCGCTG AGACCAACGA GGGCGCCGGC AACGGCGGAC CCGTGCCGTT CCTGACGTCC 
CAGAAGCCGC TCGCCGAGGC GTACGACACC GCGCTGCTGG ACCTGGACGG CGTGGTGTAC
CGCGGCGCCG ACGCGGTGCC GCACGCCGCC GAGGCGCTGC GCGCCGCGCA GGAGCACGGC
ATGCGGCGCA CCTACGTGAC CAACAACGCC TCGCGCACCC CGGAGGCCGT CGCCGAGCAC
CTGAACGAGC TCGGCGTCGC CGCGGCGGCG CACGAGGTCG TCACCTCGGC CCAAGCCGCC
GCGCGCATGG CGGTGGCCTG CGTCGGCGAG GGCGGCCGGG TCCTGGTGAT CGGCGGCGAC
GGACTGCGGG CGGCGGTGCG CGAGCTGGGG CTGAAGGCGG TGGCCGGCGC CGACGACATG
CCCGACATCG TGGTCCAGGG CTATTCGCCC GACCTCGGCT GGAAGGACCT GGCCGAGGCG
ACGTACGCGG TGCGCCGCGG CGTGCCGTGG ATCGCCACCA ACACCGACAC CACGGTCCCG
ACCGCGCGCG GTATCGCCCC GGGCAACGGC ACGCTGGTCG CCGCGGTCGG CGCCGCCTCG
GGCAAGACCC CGCAGGTCGC GGGCAAGCCG GAGCTGCCGC TGCACCGCGA GTCGATCCTG
CGCTCCGGCG CCACGCGGCC GCTGATCGTC GGCGACCGGC TGGACACCGA CATCGAGGGC
GCGGTCCGCG GGAACACCGA CAGCCTGCTG GTCTTCACCG GCGTGACCAC GGCGCGCGAC
CTGCTCGCCG CGCCGCCGGA CCGGCGCCCC AGCTACCTCG CCGAGGACCT GCGCGGGCTG
CTGACCGCGC ACGTCGCGCC GACCCGCGAC GGGGTGAACT TCGTCTCGGC GCGCTGGACC
GCCGCGGTGG TCTCCGAGCA GGTCGTGCTG CACGGGCACG GGGACAAGAT GGACGCGCTG
CGGGCGATGT GCGCCGCGGT GTGGGAGTAC GGCCGCGAGG TCGACGTCGA GGACGCGCTG
GCGAGTCTGG CTTAG
 
Protein sequence
MTAETNEGAG NGGPVPFLTS QKPLAEAYDT ALLDLDGVVY RGADAVPHAA EALRAAQEHG 
MRRTYVTNNA SRTPEAVAEH LNELGVAAAA HEVVTSAQAA ARMAVACVGE GGRVLVIGGD
GLRAAVRELG LKAVAGADDM PDIVVQGYSP DLGWKDLAEA TYAVRRGVPW IATNTDTTVP
TARGIAPGNG TLVAAVGAAS GKTPQVAGKP ELPLHRESIL RSGATRPLIV GDRLDTDIEG
AVRGNTDSLL VFTGVTTARD LLAAPPDRRP SYLAEDLRGL LTAHVAPTRD GVNFVSARWT
AAVVSEQVVL HGHGDKMDAL RAMCAAVWEY GREVDVEDAL ASLA