Gene Cagg_3333 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCagg_3333 
Symbol 
ID7267073 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChloroflexus aggregans DSM 9485 
KingdomBacteria 
Replicon accessionNC_011831 
Strand
Start bp4044897 
End bp4046462 
Gene Length1566 bp 
Protein Length521 aa 
Translation table11 
GC content57% 
IMG OID643568145 
Productdelta-1-pyrroline-5-carboxylate dehydrogenase 
Protein accessionYP_002464616 
Protein GI219850183 
COG category[C] Energy production and conversion 
COG ID[COG1012] NAD-dependent aldehyde dehydrogenases 
TIGRFAM ID[TIGR01237] delta-1-pyrroline-5-carboxylate dehydrogenase, group 2, putative 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.000000027969 
Fosmid HitchhikerNo 
Fosmid clonabilityunclonable 
 

Sequence

Gene sequence
ATGCTGACCC CCTTTCAAAA CGAACCGTTT GGCAATTTTG ACAGCGGCAA CCGTCGTGCG 
GCAATGCAAC GCGCGATCCG GCACGTCGCC GGGCAACAGG GAGGGACCTA CCCGCTGGTG
ATCGGTGGCG AGCACATCAT CACCGAGCGC AAGCTGGCCT CGATCAACCC GGCTGAGCCG
AAGAAGGTCG TTGGTTATGT AAGTAGTGCC TCACAAGAGC ATGCACATCA GGCGTTGTTG
GCGGCTGATG CTGCCTTTCG GACATGGTCG CGCACACCGG TGAGCGCGCG GGCGCAGGTC
TTGCTGCGTG CTGCGGCAAT TATGCGTCGG CGTAAAGAAG AGCTGGCGGC GTGGATGATG
CACGAGGTGA GTAAGAACTG GGTTGAGGCC GATGCTGATG TGGCCGAAGC GATTGACTTT
TGCGAGTGGT ATGCGCGGCA AGCGCTCGCG TTGCAAGGCG AGCGGCAACC GCTCGTGCCC
TATCCCGGCG AATTTAATGA GTTGCGCTAC ATTCCGCTGG GGCCGGGGTT GGCGATTCCG
CCGTGGAATT TTCCGCTTGC TATCACAACA ACGCTGACCG TTGCCCCGAT TGTGGTTGGC
AATACGGTGG TGCTCAAACC ATCACCGCGA GCACCGGTGA TGGCTAATCT GTTGGTACAG
ATTTTGGAAG AGGCCGGCTT GCCGCCGGGA GTAGTCAATC TCGTCACCGG TGAAGATGCG
GTGATCGGTG ACTTTCTGGT TGATCACCCA CTGGTGCGAT TCATCGGCTT CACCGGCTCG
AAGAATGTTG GGTTGCGTAT TCAGCAGCGC GCCGCTGTAC GCCAGCCCGG CCAGAATTGG
CTTAAGCGCG CGATCCTCGA AATGGGTGGT AAGGACGCGA TCATCGTTGA TGAGACGGCT
GATCTCGAAG CAGCGGCGAC CGGAATTGTC GTGAGCGCCT TTGGCTTTCA GGGCCAAAAG
TGCAGCGCCT GCTCGCGGGC AATTGTGGTT GAACAGGTTT ACGATCAGGT TTTGCAGCGG
GTGGTTGAGA AAGCGAAAGC TTTGCGTCTA GGTAATCCGA CAAAACCTGA GACCGATATG
GGTGCAGTGA TCGACCAGCG CGCCTTCGAT TCGATCAGCC AGTATATTGC GATTGGGCAG
GAAGAAGGTC GCTTAGTCTG CGGTGGTGAG GTGATCGATC ACGAGCTGGT CAAGGATGGT
GGTTTCTTCA TTAATCCGAC CATCTTCGCC GATGTTAAGC CGCATGCCCG CATTGCCCAA
GAAGAGATTT TCGGGCCGGT GCTGGCGTTT ATCCGCGCCG CCAACTTCGA CGAAGCCCTA
GCCATCGCCA ACGATACTGA ATATGGCCTG ACCGGCGGTT TGTACAGTCG TAGTCTTGAG
CGGTTGGAGC GTGCGCGTGA GGAGTTTCAC GTCGGTAATC TGTACTTCAA CCGTAAGTGT
ACGGGTGCGT TGGTGGGTGT CCAGCCGTTC GGTGGCTTTA ATATGTCGGG TACCGATAGT
AAGGCCGGTG GGAGTGATTA TTTGCGCTTG TTCACTCAGC CAAAGGTGAT CAGCGAGCGG
TTTTAG
 
Protein sequence
MLTPFQNEPF GNFDSGNRRA AMQRAIRHVA GQQGGTYPLV IGGEHIITER KLASINPAEP 
KKVVGYVSSA SQEHAHQALL AADAAFRTWS RTPVSARAQV LLRAAAIMRR RKEELAAWMM
HEVSKNWVEA DADVAEAIDF CEWYARQALA LQGERQPLVP YPGEFNELRY IPLGPGLAIP
PWNFPLAITT TLTVAPIVVG NTVVLKPSPR APVMANLLVQ ILEEAGLPPG VVNLVTGEDA
VIGDFLVDHP LVRFIGFTGS KNVGLRIQQR AAVRQPGQNW LKRAILEMGG KDAIIVDETA
DLEAAATGIV VSAFGFQGQK CSACSRAIVV EQVYDQVLQR VVEKAKALRL GNPTKPETDM
GAVIDQRAFD SISQYIAIGQ EEGRLVCGGE VIDHELVKDG GFFINPTIFA DVKPHARIAQ
EEIFGPVLAF IRAANFDEAL AIANDTEYGL TGGLYSRSLE RLERAREEFH VGNLYFNRKC
TGALVGVQPF GGFNMSGTDS KAGGSDYLRL FTQPKVISER F