Gene Cagg_0040 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCagg_0040 
Symbol 
ID7269037 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChloroflexus aggregans DSM 9485 
KingdomBacteria 
Replicon accessionNC_011831 
Strand
Start bp62561 
End bp63841 
Gene Length1281 bp 
Protein Length426 aa 
Translation table11 
GC content57% 
IMG OID643564913 
Productisocitrate lyase 
Protein accessionYP_002461429 
Protein GI219846996 
COG category[C] Energy production and conversion 
COG ID[COG2224] Isocitrate lyase 
TIGRFAM ID[TIGR01346] isocitrate lyase 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones25 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCACATGG ATCGTGCCGC ACAAATCAAA CAAATTGCAG ACAGTTGGAA CACACCTCGC 
TTTGCCGGTA TTGTGCGTCC GTACACTGCC GAAGATGTCT ATCGTTTGCG TGGTTCGGTA
CAGATCGAAT ACACTCTGGC GCGGATGGGT GCCGAGCGCT TGTGGGATCT GCTGCACACC
GAGCCGTATA TCAATGCCTT AGGCGCGCTG ACCGGTAATC AGGCGATGCA GCAGGTGAAG
GCCGGATTGA AGGCGATCTA CTTGAGCGGA TGGCAGGTTG CGGCTGACGC TAACCTCGCC
GGCCAAATGT ACCCTGACCA GAGCCTCTAT CCGGCAAATT CAGGCCCACA ATTGGTACGG
GCTATCAACA ACGCGCTACG ACGCGCCGAT CAGATTTACC ACAGTGAAGG ACGCAACGAT
ATTTACTGGT TTGCGCCGAT CGTTGCCGAT GCTGAGGCCG GGTTCGGTGG CCCGCTCAAT
GTCTTCGAGA TTATGAAGGC GTACATCGAA GCCGGTGCGG CGGGCGTACA CTTTGAAGAT
CAGCTTGCGT CCGAAAAGAA ATGTGGGCAT ATGGGTGGGA AAGTGTTGAT CCCAACCCAA
GCTGCGATCC GCAATTTGGT GGCTGCCCGT TTGGCCGCCG ATGTGATGGG GGTGCCGACC
CTTATTATCG CGCGTACCGA TGCTAATGCG GCAACCTTGC TGACGAGCGA TATTGATGAG
CGCGACCGGC CCTTCTGCAC CGGTGAGCGA ACCAGCGAAG GCTTCTATCG AGTACGGGCC
GGCCTTGATC AGGCAATTGC ACGCGGCTTA GCCTATGCAC CTTACGCCGA TATGATCTGG
TGCGAGACGA GCGAGCCAAA CCTCGAAGAG GCACGACGCT TCGCCGAGGC AATTCATGCT
CAATTCCCGG GCAAGCTGCT AGCGTACAAC TGCTCGCCTT CGTTCAACTG GAAGAAGAAG
CTCGACGATG CAACGATTGC TGCATTCCAG CGTGAGCTGG GCGCAATGGG CTACAAGTTC
CAGTTTGTGA CGCTGGCCGG CTTCCATACG CTTAACTATA GCATGTTTGA TTTGGCCCGG
AAGTATCGTG ATCACGGTAT GGCGGCGTAC AGTGAGTTGC AGCAAGCGGA GTTTGCCGCT
GAAGCGTTCG GCTACACAGC CACCCGCCAT CAGCGGGAGG TCGGTACCGG TTACTTCGAC
GAGGTAGCGC AGGTGATCGC CGGTGGTGAG ATCAGTACCA CGGCACTGAC CGGAAGCACC
GAGGAAGAGC AGTTCCATTA G
 
Protein sequence
MHMDRAAQIK QIADSWNTPR FAGIVRPYTA EDVYRLRGSV QIEYTLARMG AERLWDLLHT 
EPYINALGAL TGNQAMQQVK AGLKAIYLSG WQVAADANLA GQMYPDQSLY PANSGPQLVR
AINNALRRAD QIYHSEGRND IYWFAPIVAD AEAGFGGPLN VFEIMKAYIE AGAAGVHFED
QLASEKKCGH MGGKVLIPTQ AAIRNLVAAR LAADVMGVPT LIIARTDANA ATLLTSDIDE
RDRPFCTGER TSEGFYRVRA GLDQAIARGL AYAPYADMIW CETSEPNLEE ARRFAEAIHA
QFPGKLLAYN CSPSFNWKKK LDDATIAAFQ RELGAMGYKF QFVTLAGFHT LNYSMFDLAR
KYRDHGMAAY SELQQAEFAA EAFGYTATRH QREVGTGYFD EVAQVIAGGE ISTTALTGST
EEEQFH