Gene Cagg_1076 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCagg_1076 
Symbol 
ID7268528 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChloroflexus aggregans DSM 9485 
KingdomBacteria 
Replicon accessionNC_011831 
Strand
Start bp1330309 
End bp1332366 
Gene Length2058 bp 
Protein Length685 aa 
Translation table11 
GC content57% 
IMG OID643565921 
Productalpha amylase domain-containing protein 
Protein accessionYP_002462426 
Protein GI219847993 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG0366] Glycosidases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value0.819574 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGTTACG AACTCTACGA ACAGTCCGAT CAGATCGTTC TCCGTACCGA CGCCTACGAG 
CTAGGTTGGT CAACTGAGAA TGGCGCATTA GTGCGTCTCC AACAACACGA TGGTCCGAAT
GTGCTTGGCT TTGGCCCGGT GATCGCCGGG ATTGACATTG CGTTAGGTAG TGCAACGAAT
TGGATCACAG CCCAGACATT TGCCCGCTAC ATCTGGCACC GCTTGAGCAT GCAAGAACAC
CGACCGGTGA TGACGATCAT TATTGGGATT GGCCCACTCA AGCTATTCGA TCACTACCGC
ATTGATGGCG AGGTGATCGA ACGTTGGATC GAGATTGAGA ATGTGAGTGG TGACGACCTG
CGCCTATACG GCGTGCGGAT GATCGTACCA AACGTGCGGG TTGGTGATCC GAATCGATGT
ACCTTCGATG CACCGGGCAA TAGCGTTCGA CCACGAACAC CGCTAACGAT TGCAGCCGGC
CAGGATCGCA ACGTGCTTCC ACGCCGTTTC TTTGCCCCCG GATTGCGCGG TGGGAGCGCC
TTTGAACCGT CGCCTACGCA AGGGGCAGGG GTAATGGCGT TATACGATAA CGACGTACCG
CTCACATTCC TGTGTTGGTA TATCGGTGAT GATGAAGCTG CGTTACCTTA TGTGATGGGC
AACGGTACGG CGGTCTCGCT AGCTTATGAA GTAGCTGTTA CCGGTTGGTT ACGCTCTGAA
CAACGATTAA GAGTGGGCAC GCAATGGATC GGCTTACAGC ACCGATCCTG GCCGGCAGCG
CTTGCCGTAT ACCGCGCACG CACCTCATTG CCACAACCAC CGGCGGCATG GTTGCGCGAT
GCGATTCTCT ACGTAACCGA TCTGCGTCAA CACGGCGGAG CTGCCGGTCT TGCCGCACAA
TTGCCCGAAC TGAAGGCCCT TGGCATTGAC ACACTATGCA TACTGCCGTG GCATACCGTT
GGCGAACGTC CACACCTGAT CGGCGATCTT GAACGGATTG ATCCGGTCTG TGGCGATGAG
CCGGCCATTC GCCGCATGAT CGAAACCGCC CATCAGCATG GGATGCGGGT ATTGCTCGAT
GTGGCGATGC AAGGTTGTGC GCCCGACTCA CGCTACCTTA GCGAACACCC AGAATGGTTT
GTACGCGACG ATTCGGGGGC ATTTGTCATC GGTGTGCCGA CCGACGCGCC GGCAGCAGCT
CGCCATCCGG GTGTAGGCTT GCCCACGAAC GGCTATCACT TTGACTGGAC GCGCACCGAT
TGGCGGTTGT ACTGGCAGCA ATGGGTGATC GCGCAGGTCG ACCGGTTTGC ACTTGATGGC
TTACGGGTCA TCGCACCATA CCAGGCTGCC CCGGCCTGGA TCCGTCGACC ACCGTTGCGG
GCGAGTTCTG GTACAGCGCT GATGGTACAG GCATTGCGTG AGATTATCGC TGTACGTCCG
TCTCTTTCAC TCCTGTGTAC GCTCTCTGGC CCGCATTCAG CCCAATTCGC CGGCGGGTGG
TTTGATTACC CCAGTCATCA TATGCTCATC CATCTTGCTA TGCGTCGGAT CACCCCGGCT
GAGTGGTGTG CCTATCAAAC CGATTACGCC GATCTGTATC CGGCAGCCTA CCGGATCGGC
TTTCTGGAGA TGCATGATAC CGCCGATTGC AACCCGTTGG CCGACGGTTT ACGCGGATCG
CGTTTGGTGC AGGCGTTGTG GGCGGTGATG GTGTTTAGCG GGCTTACACC GGCGATATGG
AACGGACAGG AGCATGCCGA TCGCACCGTC TTGCCGCGTT TGCTCGCACT CTGGCGGCAA
GAACCGGCGT TGCGGCATGG GACGGTCACT TATCAAACGC TTACCACGGC ACCGGTCGAG
GTGTTGACGA TTCGTCGCAC GCTGGGTGAG CGTATCCTGA CCGGAGTAGT TAATTTCGGT
GCGCTCCCAT CACATTTGCT CGTTACCGAA CCGCTCGGCA ACGACCTACT CGGTATCTTT
CCGCACAGTC GTGAGCGGCG TGGTGTGGGT GAGGTTGTTC AGTTGGCGGC ATTTGGGGTG
TACTGTTTTG AGGGATGA
 
Protein sequence
MSYELYEQSD QIVLRTDAYE LGWSTENGAL VRLQQHDGPN VLGFGPVIAG IDIALGSATN 
WITAQTFARY IWHRLSMQEH RPVMTIIIGI GPLKLFDHYR IDGEVIERWI EIENVSGDDL
RLYGVRMIVP NVRVGDPNRC TFDAPGNSVR PRTPLTIAAG QDRNVLPRRF FAPGLRGGSA
FEPSPTQGAG VMALYDNDVP LTFLCWYIGD DEAALPYVMG NGTAVSLAYE VAVTGWLRSE
QRLRVGTQWI GLQHRSWPAA LAVYRARTSL PQPPAAWLRD AILYVTDLRQ HGGAAGLAAQ
LPELKALGID TLCILPWHTV GERPHLIGDL ERIDPVCGDE PAIRRMIETA HQHGMRVLLD
VAMQGCAPDS RYLSEHPEWF VRDDSGAFVI GVPTDAPAAA RHPGVGLPTN GYHFDWTRTD
WRLYWQQWVI AQVDRFALDG LRVIAPYQAA PAWIRRPPLR ASSGTALMVQ ALREIIAVRP
SLSLLCTLSG PHSAQFAGGW FDYPSHHMLI HLAMRRITPA EWCAYQTDYA DLYPAAYRIG
FLEMHDTADC NPLADGLRGS RLVQALWAVM VFSGLTPAIW NGQEHADRTV LPRLLALWRQ
EPALRHGTVT YQTLTTAPVE VLTIRRTLGE RILTGVVNFG ALPSHLLVTE PLGNDLLGIF
PHSRERRGVG EVVQLAAFGV YCFEG