Gene Cagg_3355 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCagg_3355 
Symbol 
ID7267095 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChloroflexus aggregans DSM 9485 
KingdomBacteria 
Replicon accessionNC_011831 
Strand
Start bp4066524 
End bp4068002 
Gene Length1479 bp 
Protein Length492 aa 
Translation table11 
GC content57% 
IMG OID643568164 
ProductMalate dehydrogenase (oxaloacetate-decarboxylating) 
Protein accessionYP_002464635 
Protein GI219850202 
COG category[C] Energy production and conversion 
COG ID[COG0281] Malic enzyme 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.000240183 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGCCCGTCG GTGTGACGGG CATTTTTGTG TATCTTGAGC CGGAAAACGA GGTGAATGGT 
ATGTCCGTTG GGGTGGCTTA TACGCTGACC TTGCGCTGCC AAATTGAAAA TCGGCCCGGT
ATGTTGGGCC GGCTCACGAC CCTGATCGGC GAGGTCGGTG GCGACATCGG CGCGATCGAT
ATTGTTCGCG CTGAACGTAA CTTTCTTGTG CGCGATATTA CTGTGCGCGT ACAAGATGAA
CAACACGGTG AGCAACTGGT AGCCGCGATC AACACGCTGC AAAACATTAA AGTGCTTCAG
GTGAGCGACC GTGTCCTCCT CACCCACCTC GGCGGTAAGC TTACTACCCA GAGCCGGGTT
CCGCTAAAAA CCCGTGATGA TCTCTCCCTG GCCTACACGC CCGGCGTGGC TCGTGTCTGC
CGTGCAATCG CCGACGATCC TGAAAAGGTC TATTCACTCA CGTGGAAAGG GAATAGCGTT
GCCGTGGTCA GCGATGGCTC GGCGATCCTC GGTTTGGGCA ATCTTGGCCC CGAAGCGGCG
ATGCCGGTGA TGGAAGGCAA GGCGATTCTG TTTAAAGAGC TGGCCAATAT CGATGCGGTA
CCGATTTGTC TCCGCTCACA AGACCCCGAC ATTATCGTGC AGACGGTTGA ACAGATTGCG
CCAAGCTTCG GCGGCATCAA TCTCGAAGAT ATTGCCGCTC CCAATTGCTT TATCGTCGAG
GGCCGACTGG AAGAGAGCCT TGATCTGCCG GTGATGCATG ACGATCAGCA CGGCACTGCA
GTGGTCGTGC TGGCGGCGTT ACGCAATGCG CTCCGTCTTG TCGGTAAAGC GTTGTCCGAT
GTCCGTGTGG TCATCAACGG TGTCGGTGCT GCCGGCACTG CCATTATTCG CACACTGCTT
GAGGCCGGCG TCGGCGAGAT TACGGCGGTG GATCGGTTCG GGATTTTGGT CGAGGGCGAT
GATGCCCGTC AGACGCCAAT GCAACGTATT ATCGCTTCCC TCACCAACCG TGAGCGTCGG
CGGGGCGATT TGGCCGTCGC TCTCCGCGGC GCCGATGTGT TCATCGGGGT GTCGCGCGGG
AATATTCTCA CCACCGACCA CGTGCGCTCG ATGAGTGCTG ACCCAATAGT GTTTGCGTTG
GCCAATCCTA TTCCTGAGGG CGATCCCGAC ATGCTGCGTC AGTACGCGCG TGTGGTTGCA
ACCGGACGGA GCGATCAGCC AAATCAGATC AATAATGTGC TCAGTTTTCC CGGTATCTTT
CGGGGTGCGC TTGATGTTAA TGCCCGTCGC ATCACCTCGG CGATGCGCCT TGCTGCTGCC
GAAGCGCTGG CAAATGCCGT TCCCATCGAG GAGATGAACG AAGACTATAT CGTGCCGAGT
GTCTTTAATC GTCAGATTGT GCCGGCCATT GCACATGCGG TGGCACAGGC TGCCATCGCC
GATGGAGTGG CGCGTCGTAA TCACTTGCCG GGTGATTAG
 
Protein sequence
MPVGVTGIFV YLEPENEVNG MSVGVAYTLT LRCQIENRPG MLGRLTTLIG EVGGDIGAID 
IVRAERNFLV RDITVRVQDE QHGEQLVAAI NTLQNIKVLQ VSDRVLLTHL GGKLTTQSRV
PLKTRDDLSL AYTPGVARVC RAIADDPEKV YSLTWKGNSV AVVSDGSAIL GLGNLGPEAA
MPVMEGKAIL FKELANIDAV PICLRSQDPD IIVQTVEQIA PSFGGINLED IAAPNCFIVE
GRLEESLDLP VMHDDQHGTA VVVLAALRNA LRLVGKALSD VRVVINGVGA AGTAIIRTLL
EAGVGEITAV DRFGILVEGD DARQTPMQRI IASLTNRERR RGDLAVALRG ADVFIGVSRG
NILTTDHVRS MSADPIVFAL ANPIPEGDPD MLRQYARVVA TGRSDQPNQI NNVLSFPGIF
RGALDVNARR ITSAMRLAAA EALANAVPIE EMNEDYIVPS VFNRQIVPAI AHAVAQAAIA
DGVARRNHLP GD