Gene Cagg_3782 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCagg_3782 
Symbol 
ID7267856 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChloroflexus aggregans DSM 9485 
KingdomBacteria 
Replicon accessionNC_011831 
Strand
Start bp4615108 
End bp4616706 
Gene Length1599 bp 
Protein Length532 aa 
Translation table11 
GC content56% 
IMG OID643568590 
Productmalate synthase 
Protein accessionYP_002465054 
Protein GI219850621 
COG category[C] Energy production and conversion 
COG ID[COG2225] Malate synthase 
TIGRFAM ID[TIGR01344] malate synthase A 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.153357 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.0512216 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCACCC CCCACTTTCC GGCCGGTGTG ACCATCACAG CGCCCATCAC TCCCGAGTAC 
GCCGAGATCC TCACCCCTGA AGCTTTAGAG TTCCTCGCTA CCCTACATCG TCGCTTTAAT
GCACGTCGGC TCGAATTACT CGCCCGCCGT GCCGAACGGC AACGAGCCAT CGACGCCGGT
GAACGGCCTG ATTTTCTTCC TGAAACGGCC CATATCCGCG AAAGCGATTG GACGATAGCC
CCCTTCCCCC CGCAACTCAA CGACCGCCGC GTTGAAATTA CCGGTCCGGT TGACCGCAAG
ATGATTATCA ATGCGCTCAA CTCAGGGGCG AAGGTCTTTA TGGCCGACTT TGAGGACGCG
AATACGCCAA CATGGCAAAA CCAGATCGAA GGTCAGATCA ATTTGCGTGA TGCTCTTCGC
CGGACGATTA CCTATACCAG TCCTGAAGGC AAGTATTACG CACTCAACCC CAATCCGGCG
ATCTTGTTCG TCCGCCCACG CGGCTGGCAT TTGCCGGAAA AGCATATGCT GGTCGATGGT
GAGCCAATCG CCGGTGCCAT CTTCGATTTT GGCCTCTATT TCTTCCACAA TGCACAAACG
GCTATCGAGG TACAGGGCGG CCCGTATTTC TACCTACCTA AGCTCGAAAG TCATCTTGAA
GCGCGGCTGT GGAACGACAT TTTTGTGCTG GCCCAAGAAT TGCGCGGTAT TCCACGAGGG
ACGATTAAAG CGACCGTGCT GATCGAAACG ATTCTGGCCG CGTTTGAGAT GGACGAGATT
CTCTACGAGC TACGTGAACA CTCAGCCGGC CTCAACTGTG GACGTTGGGA CTATATTTTC
TCGTGCATTA AGAAGTTTCG CAATGACCCC AATTTCTGTC TCGCCGACCG TGTGTTGGTT
ACGATGACAA CGCACTTTAT GCGTTCGTAC TCGCTCCTCG CGATTAAGAC TTGTCATCGG
CGCGGGGCGC ATGCGATGGG GGGAATGGCC GCGCAAATCC CGATTAAGAA CGACCCGGTC
GCAAATGAAG CAGCGTTAGC CAAGGTGCGT GCCGATAAGG AGCGTGAGGC AAACGATGGG
CATGACGGTA CCTGGGTGGC TCACCCCGGC CTCGTCCCTG TCGCGATGGA GGTCTTCGAC
CGCCTGATGC CCACTCCCAA TCAGATTAAC CGCCAACGCG ATGATGTCCA CGTGACGGCC
GCCGATCTGC TGGCCTTTGG TCCGAGTGAA CCGATTACCG AGCAGGGACT ACGGCTCAAC
ATCAATGTCG GGATTCAATA CCTCGGCGCA TGGTTGGCCG GTAATGGGTG TGTGCCGGTC
TTTAACCTGA TGGAAGACGC GGCGACTGCT GAAATATCGC GCGCTCAGAT CTGGCAGTGG
ATCCGCAGCC CGAAGGGAGT GTTGGCGGAT GGTCGTAAAG TGACCGTGGA ACTGTTCCGG
CAAATGCTTC CCGAAGAACT GGCGAAGGTG CGCGAGATTC TCGGCCCGGC CTACGAAGAT
GGTCGCTACG GAGAGGCCGC CGAACTGTTC GATGAAATTA CCACCGACCC AAACTTCGTT
GAGTTTCTGA CGTTGCCGGC ATATGACCGT ATTCCGTAA
 
Protein sequence
MTTPHFPAGV TITAPITPEY AEILTPEALE FLATLHRRFN ARRLELLARR AERQRAIDAG 
ERPDFLPETA HIRESDWTIA PFPPQLNDRR VEITGPVDRK MIINALNSGA KVFMADFEDA
NTPTWQNQIE GQINLRDALR RTITYTSPEG KYYALNPNPA ILFVRPRGWH LPEKHMLVDG
EPIAGAIFDF GLYFFHNAQT AIEVQGGPYF YLPKLESHLE ARLWNDIFVL AQELRGIPRG
TIKATVLIET ILAAFEMDEI LYELREHSAG LNCGRWDYIF SCIKKFRNDP NFCLADRVLV
TMTTHFMRSY SLLAIKTCHR RGAHAMGGMA AQIPIKNDPV ANEAALAKVR ADKEREANDG
HDGTWVAHPG LVPVAMEVFD RLMPTPNQIN RQRDDVHVTA ADLLAFGPSE PITEQGLRLN
INVGIQYLGA WLAGNGCVPV FNLMEDAATA EISRAQIWQW IRSPKGVLAD GRKVTVELFR
QMLPEELAKV REILGPAYED GRYGEAAELF DEITTDPNFV EFLTLPAYDR IP