Gene Cagg_3455 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCagg_3455 
Symbol 
ID7269680 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChloroflexus aggregans DSM 9485 
KingdomBacteria 
Replicon accessionNC_011831 
Strand
Start bp4208909 
End bp4210642 
Gene Length1734 bp 
Protein Length577 aa 
Translation table11 
GC content56% 
IMG OID643568265 
Productglyoxylate carboligase 
Protein accessionYP_002464733 
Protein GI219850300 
COG category[R] General function prediction only 
COG ID[COG3960] Glyoxylate carboligase 
TIGRFAM ID[TIGR00118] acetolactate synthase, large subunit, biosynthetic type
[TIGR01504] glyoxylate carboligase 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.13774 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value0.152002 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCACGCA TGAACGTGAT GGATGCGGTG ATCAACGTGT TAGAGAGCGA AGGTGTGCGT 
TACATCTTCG GTGTGCCGGG AGCAGCCATC TTACCGTTCT ACGACGCTCT GCGCAAAAGC
AAGCAGATTC GGCACCTCAT CGTGCGCCAC GAAGAGGGTG GCACGCACGC TGCCGACGGC
TATGCACGTG CGTCCGGCGA GGTCGGAATC TGTGTCGGCA CCAGTGGGCC AGCCGGTACC
AATATGATTA CCGGTCTCTA CACCGCAATG GCCGACTCGA TCCCGATCAT CTGCATCACC
GGTCAAGCAC GCACGGATGC TTTGCACAAA GAAGCGTTTC AGGCTGTTGA TATTGTCGAG
ATCGCCAAAC CGGTGACCAA ATGGGCCGTG CAGGTAAAGG AGGCAGCACA GATGCCGTGG
GTCTTTCGAC GCGCCTTTCA GATTGCCCGT GAAGGGCGTC CCGGACCGGT TCTGATCGAC
CTCCCGATCG ATGTACAAAA ACAAGCGATC GATTACGATG CGAGCCTGGA TGCCCCCTTA
CCGATATATC GCCCGGCGCC GTTACAATCA GCCCTGCGGC GGGCGATCGA GCTGCTCCTG
ACCGCTGAGC GACCGCTACT GATGTCCGGC GGCGGGGTGA TCATTGCCAA TGCCGCACCG
GAATTGGTCG CCTTAGCGGA ATATTTGCAA ATCCCTGTTT CACCGACGCT CATGGGTAAA
GGGGCAATTC CCGAAGACCA CCCTCTCTAC GCCGGCATTG TCGGTATTCA GACCCAACAG
CGCTTTGCGA ACGCAATTTT TCTCGAAAGT GATTGTATCC TCGCAATCGG GGCACGTTTC
GCCGACCGCC ATACCGGCCA ACTCGATATC TATCGCGGTA ATCGCACCTT TATCCACATC
GACATTGAGC CAACACAAAT TGGGAAGGTC TTCCATCCCG ATCTCGGTAT TGTTGGCGAT
GCCAAACTAG CCCTCCAAGG GCTATTGGCC GAGGCACACG ACTGCACACC ACCACGTACA
CCCGGTGCAT GGTTCGAGCG GGTGCAGTAT CTAAAACAGA CCCTGACCCG TCGCGACGAT
TTCGACGACG TTCCGATTAA AGCACCGCGA GTGTTCCGCG AGCTGAACGA GTATTTTGAT
CGTGACACCA TCTTCGTCAC TGCAATCGGG CTGTATCAGA TCTGGTCGGG TCAATTCCAA
AAGACCTACA AACCGCGCCA CTATATGGTG TGCGGTCAGG CAGGGCCACT CGGTTGGGAA
GTGAGTGCCT GCACCGGTGT CAAACTGGCC CGGCCCGACC AACAAGTGGT TGGTGTTGTG
GGAGACTACT CTTTCCAATT TTTGATGGAA GAGGTGGCAG TAGCAGTACA ATATCGTATC
CCGTTCGTGT TGGTGATGAT TAACAACACC TACATGGGCC TGATTCGTAT GGCCGAACTG
CCCTACCAAA TGAACTACGA AGTAAAGCTG GGATACGAGA CTGATACCGG TCAAACATTT
GGAATTGATC ACTGCAAAGT GATGGAAGCG ATGGGAGCGC TTGCCGTGCG TGTGACCACA
CCCGATCAGA TTCGACCGGC GCTCGATTGG GCAGTGCGCG AGAGCAACCA TCGCCGGTTA
CCGGTGCTCG TGGAAGTGAT GACGGAAGCC GAAGAAAATG CAGCTATGGG TACGCGCATT
GATCAGATTC GCGAATGGCA CCCACTGCCA GAGATGACGA GCACCACCAT TTAG
 
Protein sequence
MPRMNVMDAV INVLESEGVR YIFGVPGAAI LPFYDALRKS KQIRHLIVRH EEGGTHAADG 
YARASGEVGI CVGTSGPAGT NMITGLYTAM ADSIPIICIT GQARTDALHK EAFQAVDIVE
IAKPVTKWAV QVKEAAQMPW VFRRAFQIAR EGRPGPVLID LPIDVQKQAI DYDASLDAPL
PIYRPAPLQS ALRRAIELLL TAERPLLMSG GGVIIANAAP ELVALAEYLQ IPVSPTLMGK
GAIPEDHPLY AGIVGIQTQQ RFANAIFLES DCILAIGARF ADRHTGQLDI YRGNRTFIHI
DIEPTQIGKV FHPDLGIVGD AKLALQGLLA EAHDCTPPRT PGAWFERVQY LKQTLTRRDD
FDDVPIKAPR VFRELNEYFD RDTIFVTAIG LYQIWSGQFQ KTYKPRHYMV CGQAGPLGWE
VSACTGVKLA RPDQQVVGVV GDYSFQFLME EVAVAVQYRI PFVLVMINNT YMGLIRMAEL
PYQMNYEVKL GYETDTGQTF GIDHCKVMEA MGALAVRVTT PDQIRPALDW AVRESNHRRL
PVLVEVMTEA EENAAMGTRI DQIREWHPLP EMTSTTI