Gene Cagg_0433 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCagg_0433 
Symbol 
ID7266601 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChloroflexus aggregans DSM 9485 
KingdomBacteria 
Replicon accessionNC_011831 
Strand
Start bp536724 
End bp538238 
Gene Length1515 bp 
Protein Length504 aa 
Translation table11 
GC content55% 
IMG OID643565300 
Product4-alpha-glucanotransferase 
Protein accessionYP_002461814 
Protein GI219847381 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1640] 4-alpha-glucanotransferase 
TIGRFAM ID[TIGR00217] 4-alpha-glucanotransferase 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.459094 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones22 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCAATTTC AGCGCGCAAG TGGTATCTTG CTCCATCCCT CATCGCTGCC CGGACCATGG 
GGGATTGGCG ATTTAGGCCC GATGGCCTAC CGGTTTGTCG ACTTTTTGGT AGCAGCCGGG
CAATCACTCT GGCAGGTCTT GCCACTTGGC CCAACCGGTT ACGGAGACTC ACCTTACCAG
TGCTTTTCAG CCTTTGCCGG CAATCCACTT CTGATCAGTA TTGATGACCT CATCGAACAC
AATCTCTTAA CCGTCGACGA AGCCCGGGCC GCACTCGGTT ATCTTCCGGC AGAACGGGTC
GATTTCGGTT CACTTATTCC GGCCAAGCAA ACCCTACTTC GTCAGAGTTT TGAACGGTTT
CGTAGCAGCC GTGGCACACC GCTCCATCAA GCGTACACAC GATTTTGCCT CGAGAATGCA
GAGTGGCTGG CGGATTATGC CCTCTTTATG GCGATCAAAG AGGCGCAAGG CGGTGGCAGT
TGGCACAATT GGCCTCCCGA TCTGCGCGAC CGCAAGCCCG AGGCGTTGGC CCGCATTCGC
CGCGATCTTG CGGTGAAGAT TGACTTTCAC CAGTACGTGC AGTTCCTCTT CTTCCGCCAA
TGGCAATCTC TGAAGGAATA TGCCAATCAA CAAGGCGTGA TTATTATCGG CGATGCACCG
ATTTTTGTCG CCGACGATAG CGCTGATGTC TGGGCACATC GCGATCTCTT TTACGTTGAT
GCGCAGGGAA TGCCAACGGT GGTCGCCGGT GTGCCACCAG ATTACTTCAG CACCACCGGC
CAGCGCTGGG GGAACCCGCT CTATCGGTGG GATAAAATGG CCATGACCGG TTATCGTTGG
TGGGTAGCAC GGATGCGACA GGCACTTACG TTGTACGACG TGCTGCGTCT TGATCATTTT
CGCGGATTTG AAGCTTATTG GGAAGTGCCG GCCAGTGCGC CGACAGCAGT TGAGGGGCGG
TGGGTGAAAG GACCGGGCGC CGATCTGTTT CACGTCTTGC ATGCTGAGTT AGGTGATTTA
CCGATTATCG CCGAAGACCT TGGTCTGATT ACACCGGAAG TAGAAAAACT GCGGCTCGCA
TTCGGGCTAC CCGGCATGAA GGTGCTCCAT TTTGCCTTCG GTGATAATCC GAACAACCCT
TACTTGCCAC ACAACTACAC AACGAATTAT GTCGTCTATA CCGGCACCCA CGATAATGAC
ACAACGGTGG GCTGGTTCAA CACCCTCGAT CCGGCCGGCC GGGCCGCCGT TCTCACCTAT
CTTGGCCGTG ATGAACAGAC GGTTGATATT GCATGGGATC TGATGCGGCT GGGGATGATG
TCCGTTGCCA ATTACGTAAT TACGCCTTTG CAAGATGTGT TACGGCTCGG CTCGGAGGCA
CGGATGAATA TGCCGGGTCG CCTCGGCGGC AATTGGGCGT GGCGGTTTTC GGCTGATGCG
CTACAGGAAG AACTGGTGGC ACTGTTGCGA AAACTCACCT ACACCTATGG TCGGCTTCAA
CCGGCAAAAA GCTGA
 
Protein sequence
MQFQRASGIL LHPSSLPGPW GIGDLGPMAY RFVDFLVAAG QSLWQVLPLG PTGYGDSPYQ 
CFSAFAGNPL LISIDDLIEH NLLTVDEARA ALGYLPAERV DFGSLIPAKQ TLLRQSFERF
RSSRGTPLHQ AYTRFCLENA EWLADYALFM AIKEAQGGGS WHNWPPDLRD RKPEALARIR
RDLAVKIDFH QYVQFLFFRQ WQSLKEYANQ QGVIIIGDAP IFVADDSADV WAHRDLFYVD
AQGMPTVVAG VPPDYFSTTG QRWGNPLYRW DKMAMTGYRW WVARMRQALT LYDVLRLDHF
RGFEAYWEVP ASAPTAVEGR WVKGPGADLF HVLHAELGDL PIIAEDLGLI TPEVEKLRLA
FGLPGMKVLH FAFGDNPNNP YLPHNYTTNY VVYTGTHDND TTVGWFNTLD PAGRAAVLTY
LGRDEQTVDI AWDLMRLGMM SVANYVITPL QDVLRLGSEA RMNMPGRLGG NWAWRFSADA
LQEELVALLR KLTYTYGRLQ PAKS