Gene Cagg_2607 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCagg_2607 
Symbol 
ID7267198 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChloroflexus aggregans DSM 9485 
KingdomBacteria 
Replicon accessionNC_011831 
Strand
Start bp3192080 
End bp3192889 
Gene Length810 bp 
Protein Length269 aa 
Translation table11 
GC content59% 
IMG OID643567433 
ProductHAD-superfamily hydrolase, subfamily IIA 
Protein accessionYP_002463912 
Protein GI219849479 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG0647] Predicted sugar phosphatases of the HAD superfamily 
TIGRFAM ID[TIGR01457] HAD-superfamily subfamily IIA hydrolase, TIGR01457
[TIGR01460] Haloacid Dehalogenase Superfamily Class (subfamily) IIA
[TIGR01509] haloacid dehalogenase superfamily, subfamily IA, variant 3 with third motif having DD or ED 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value0.988604 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATAGCAT TCTCTGCAAT CCGCGCCGTA CTGTTCGATA TGGACGGTGT GCTCTACCGG 
GGGCAAACGC CGCTGCCCGG CGTCGCCGAT CTGTGGCAGT TTTTGCACGA TCACCAGATC
GCCTTCGCCT GTGCGACTAA CAACGCTTCG ATGACGCCGC AGCAGTACGC GGCTAAGTTG
GCTGCCATGG GCATAGTGCT GCCGGCAGAT CGCGTGATTA CGTCGGCCCA AGCAACTGCC
CTGTATCTGC GTGATCACTA CCCGCCGGGT ACGCGCGTGT TTGTGGTCGG CATGCAGGGG
TTACGCGCAG CATTGTTTGC CGATGGTTAC TTTGTCGAGG ATGACGACGC TCCGGAATTG
GTTGTGCAGG GTGCCGATTT TACGCTCACC TACGAGCGGC TCAAACGGGC AACGCTACAT
ATCCGGCGTG GCGCCCGCTT CATCTCTACG AATCCCGACC GCACCTTTCC CAGCGAAGAG
GGTCTCATTC CCGGCGCCGG TGCAATTGCT GCCGCCCTCA CTGCTGCTAC CGATGTCTCA
CCGCTGGTGA TTGGCAAGCC GGCGCCAACG ATGTTTCTGA TCGGCGCTAA GATGTTAGAT
GCTCCTCCGT CCGCAACACT TGTGGTTGGT GATCGGCTTG ATACCGATAT TGCCGGTGCA
ATCGCCGCCG GCATGCCGTC GGTGTTGGTG TTGACCGGCG TCAGTACAGT TGAAGAAGCT
ACCACCGGCC CGATCCGGCC TGATCTGATC GTGGCTGATT TGCCTGAGTT GCTGGCCCGC
TGGGCCGATG AATTATCGGC GCAACTGTAA
 
Protein sequence
MIAFSAIRAV LFDMDGVLYR GQTPLPGVAD LWQFLHDHQI AFACATNNAS MTPQQYAAKL 
AAMGIVLPAD RVITSAQATA LYLRDHYPPG TRVFVVGMQG LRAALFADGY FVEDDDAPEL
VVQGADFTLT YERLKRATLH IRRGARFIST NPDRTFPSEE GLIPGAGAIA AALTAATDVS
PLVIGKPAPT MFLIGAKMLD APPSATLVVG DRLDTDIAGA IAAGMPSVLV LTGVSTVEEA
TTGPIRPDLI VADLPELLAR WADELSAQL