Gene Acry_1960 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcry_1960 
Symbol 
ID5160852 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAcidiphilium cryptum JF-5 
KingdomBacteria 
Replicon accessionNC_009484 
Strand
Start bp2150176 
End bp2151018 
Gene Length843 bp 
Protein Length280 aa 
Translation table11 
GC content72% 
IMG OID640553881 
ProductHAD family hydrolase 
Protein accessionYP_001235080 
Protein GI148260953 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG0647] Predicted sugar phosphatases of the HAD superfamily 
TIGRFAM ID[TIGR01459] HAD-superfamily class IIA hydrolase, TIGR01459
[TIGR01460] Haloacid Dehalogenase Superfamily Class (subfamily) IIA
[TIGR01509] haloacid dehalogenase superfamily, subfamily IA, variant 3 with third motif having DD or ED 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACCGCCG ATACGATGAG CGTCGAAACC CTCGCCCATG ATCACGACGG CTTCATCGTC 
GATCTCTGGG GCGTGGTGCA TGACGGCGTG CGCCCCTATC CCGGCGTTCC CGCCTGCCTG
CGCCACCTGC GCGAGGCCGG CAAGCGCGTC GTCTTCCTCT CCAACGCGCC CCGCCGCACC
GCGCCGGTCG CCGCCGCCCT GGCCGCGATG GACATCGGCC CCGAGCTATA CGACGGCATC
ATGACCAGCG GCGAAGCCGT CCGCGCTGCG CTGGTCTCAC GCACCGAGCC CGATTTCGCG
GCCCTGGGCG ACCGTCTGTT CCATCTCGGC CCGCCGCGTG ACCGCAACCT GTTCGATGAT
CTCGGCCTCG CCGAGGCCGA CCGCCCCGGC GCCGCCGATT TCCTGCTCAA CACCGGCCCG
GACGATCTCG CGCCGCCCGA CGATCCCGCC GCCTTCGATC CGTTGCTGCG TGAGGCCCTC
GGGGCCGGGT TGCCGATGGT CTGCGCCAAC CCGGACCTGG AGGTGATTCG CGACGGGCGC
CGCATCATCT GTGCCGGCAC GCTCGCCCAG CGCTACGCCG CCTGGGGCGG GCGGGTGATC
TGGCGGGGCA AGCCCGATCC CGCCGTCTAT CGCCCGACCC TCGACCTGCT CGGCACCGAA
CCTGGCCGGA CCATCGCGTT CGGAGATTCG CTGCGCACCG ACATCGCCGG CGCGAAGGCG
GCCGGCATCG CCTCGGTGCT CGTGCTGTCC GGCATCCACG TCGCCACGCC GGCCGAGGCG
CGGGCCGATT GCGCGGCCGC CGGGCTCGAT CCGCGCGCCA TCATCGGCGG GTTCCGCTGG
TAA
 
Protein sequence
MTADTMSVET LAHDHDGFIV DLWGVVHDGV RPYPGVPACL RHLREAGKRV VFLSNAPRRT 
APVAAALAAM DIGPELYDGI MTSGEAVRAA LVSRTEPDFA ALGDRLFHLG PPRDRNLFDD
LGLAEADRPG AADFLLNTGP DDLAPPDDPA AFDPLLREAL GAGLPMVCAN PDLEVIRDGR
RIICAGTLAQ RYAAWGGRVI WRGKPDPAVY RPTLDLLGTE PGRTIAFGDS LRTDIAGAKA
AGIASVLVLS GIHVATPAEA RADCAAAGLD PRAIIGGFRW