Gene Cagg_2091 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCagg_2091 
Symbol 
ID7267598 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChloroflexus aggregans DSM 9485 
KingdomBacteria 
Replicon accessionNC_011831 
Strand
Start bp2559512 
End bp2561545 
Gene Length2034 bp 
Protein Length677 aa 
Translation table11 
GC content55% 
IMG OID643566925 
Productglycogen branching enzyme 
Protein accessionYP_002463414 
Protein GI219848981 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG0296] 1,4-alpha-glucan branching enzyme 
TIGRFAM ID[TIGR01515] alpha-1,4-glucan:alpha-1,4-glucan 6-glycosyltransferase 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGTGAAG GTGAGCAAAC CCGCCGCACC CGGCGTAAAC AAGCAACACC GGTCACCGAA 
GCGACGGCAC CGATGACCAC AGCAGAGGTT GAACATACGA CTACCGAAGG ACCTGAACCG
TCGGCAGCTA CCACGGCGAT GATGAAGAGC ATTCTGAGTG AGGATGACAT TTATCTCTTC
AATCAGGGCA CCCACTACCG GCTCTACAAC AAATTCGGCG CCCAGCCGGT GACGATTGAA
GGCGTACCCG GTACCTATTT CGCAGTGTGG GCACCGAATG CTGAATACGT CGCCGTTATT
GGCGACTGGA ACAATTGGGA CCCCGGCGCA CATCCGCTGC GCCAGCGAGG CTTTTCCGGT
GTGTGGGAAG GGTTCATTCC CCACATCGGC AAAGGCATGC GCTACAAATT CCATATTGCC
TCACGCTATT ATGGCTACCG CGAAGACAAG ACCGATCCCT TTGGCAGCTA TTTTGAAGTA
GCACCGCAAA CGGCGGCTAT CATCTGGGAT CGGGAGTATA CGTGGTCGGA TCAGCAGTGG
ATGAGCGAAC GCGGTCAGCG CCAGCGTCTC GATGCGCCGA TCTCGATCTA CGAGGTTCAC
CTCGGTTCAT GGCGACGTAA GCCGGAGGAG GATAACCGCC CTCTCACCTA CCGGGAACTG
GCCCACGAAC TGGTCGAGCA TGTGAAAGCA TGCGGCTTTA CTCACGTGGA GCTGCTACCG
GTGACCGAAC ACCCCTTCTA CGGCTCGTGG GGCTACCAAT CGACCGGGAT GTTCGCACCA
ACCAGCCGGT ATGGTACCCC GCAAGATTTT ATGTACTTTG TCGATTACCT GCACCAGCAC
GGTATTGGTG TGATCCTCGA CTGGGTACCG AGCCACTTTC CAACCGATGG TCACGGACTG
GCGTATTTCG ACGGCACGCA TCTCTACGAA CACGCCGATC CGCGCAAAGG TTATCATCCC
GACTGGGGGA GCTACATCTA CAATTACGGT CGCAACGAGG TGCGTAGCTT TCTGATCAGT
TCGGCCCTGT GCTGGCTCGA CAAGTTCCAT ATTGACGGTT TGCGCGTCGA TGCCGTTGCC
AGTATGCTCT ATCTCGACTA CTCGCGGCGA CCCGGTGAGT GGATTCCGAA TGAATACGGC
GGTAATGAAA ATCTGGAAGC GATTAGCTTC TTGCGTGAAC TGAACACCCA AATCTACAAG
TATTATCCCG ACGTGCAAAC CATTGCCGAA GAGAGCACAG CGTGGCCAAT GGTTTCGCGC
CCGGTCTATG TGGGTGGGTT AGGCTTTGGC TTCAAGTGGG ATATGGGGTG GATGCACGAC
ACACTGCAAT ACTTCCGCCG CGATCCCATC TACCGTCGCT TCCACCACAA CGAGCTGACC
TTCCGTGGCC TCTATATGTT TACCGAGAAC TACGTACTCC CGCTCTCGCA CGATGAAGTC
GTTCACGGCA AGGGGTCGCT GCTCGATAAA ATGGCCGGCG ATGTTTGGCA GAAGTTCGCC
AACTTACGCT TGCTCTACTC CTATATGTTT GCCCAACCCG GTAAGAAGCT GCTCTTCATG
GGCGGTGAGT TTGGGCAATG GCGGGAATGG TCGCACGATA CGAGCCTCGA TTGGCATTTG
CTGATGTTCC CTTCCCATCA AGGAATGCTT CGGCTCATCA GTGACCTCAA CCGACTTTAC
CGCAGTGAAC CGGCTTTGCA CGAACTTGAC TGTGATCCGA AGGGGTTCGA GTGGATTGAC
GCCAATGATG CCGATACCAG TGTGTATAGC TTTTTGCGCA AGAATCGGCA CGGCGAGACG
ATTTTGGTGG TGCTGAATGC AACACCGGTA GTCCGCGAAG ACTACCGTGT CGGCGTACCG
TTTGGTGGTT GGTGGCGAGA GTTACTGAAC AGCGATTCGG AATACTATTG GGGGAGTGGG
CAAGGAAATG CCGGTGGTGT GATGGCTGAA GAACTACCAT CGCATGGACG GCCATTCTCG
TTGCGCTTGC GCTTACCGCC GTTGGGAGCG TTGTACTTCA AGCATAGCGG ATAG
 
Protein sequence
MSEGEQTRRT RRKQATPVTE ATAPMTTAEV EHTTTEGPEP SAATTAMMKS ILSEDDIYLF 
NQGTHYRLYN KFGAQPVTIE GVPGTYFAVW APNAEYVAVI GDWNNWDPGA HPLRQRGFSG
VWEGFIPHIG KGMRYKFHIA SRYYGYREDK TDPFGSYFEV APQTAAIIWD REYTWSDQQW
MSERGQRQRL DAPISIYEVH LGSWRRKPEE DNRPLTYREL AHELVEHVKA CGFTHVELLP
VTEHPFYGSW GYQSTGMFAP TSRYGTPQDF MYFVDYLHQH GIGVILDWVP SHFPTDGHGL
AYFDGTHLYE HADPRKGYHP DWGSYIYNYG RNEVRSFLIS SALCWLDKFH IDGLRVDAVA
SMLYLDYSRR PGEWIPNEYG GNENLEAISF LRELNTQIYK YYPDVQTIAE ESTAWPMVSR
PVYVGGLGFG FKWDMGWMHD TLQYFRRDPI YRRFHHNELT FRGLYMFTEN YVLPLSHDEV
VHGKGSLLDK MAGDVWQKFA NLRLLYSYMF AQPGKKLLFM GGEFGQWREW SHDTSLDWHL
LMFPSHQGML RLISDLNRLY RSEPALHELD CDPKGFEWID ANDADTSVYS FLRKNRHGET
ILVVLNATPV VREDYRVGVP FGGWWRELLN SDSEYYWGSG QGNAGGVMAE ELPSHGRPFS
LRLRLPPLGA LYFKHSG