Gene Cagg_1659 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCagg_1659 
Symbol 
ID7268961 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChloroflexus aggregans DSM 9485 
KingdomBacteria 
Replicon accessionNC_011831 
Strand
Start bp2024485 
End bp2025768 
Gene Length1284 bp 
Protein Length427 aa 
Translation table11 
GC content59% 
IMG OID643566501 
Productimidazolonepropionase 
Protein accessionYP_002462996 
Protein GI219848563 
COG category[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG1228] Imidazolonepropionase and related amidohydrolases 
TIGRFAM ID[TIGR01224] imidazolonepropionase 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones38 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGAACCCT GTGACCTGCT GATCCACTCG GCCACACAAC TCGTAACGTG TGCTGGGCCG 
CCCGGTTTGC GTCGTGGCCC GGCGATGCGC GAATTGGGAG TCATCCGCGA CGGAGCAGTC
GCTATTCGTG GATCGACCAT TGTGGCCGTT GGTCCTGGCA CCGATGTCCG CCGTCGCTTC
CGTGCGTCCC ACGAGATTGA TGCCCGCGGA CGGGCCGTGT GTCCCGGTTT GGTCGATTGT
CATACCCATA TCGTGTACGC CGGTGATCGG GTTGAGGAAT TTGAACAGCG CTGTGCCGGC
GCTACGTATC AAGAGATTAT GGCCGCCGGT GGTGGTATTT TACGCACCAT GCGGCTCACC
CGTGCGGCGA CAACTACCGA ACTGGTTCAT GCGGCACTAC CTCGCTTGCG GCAGATGTTG
TCGTTCGGGA CGACTACCGC CGAAGTGAAG ACCGGTTACG GTCTTGAACG CGACGCAGAA
TTACGTCAAT TGGCAGCTAT TGCGCTGCTT GATGCGGCAC AACCGATTGA GCTTGTCCCT
ACCTTTCTCG CAGCGCATGC GGTGCCACCA GAGTTTACCG GTCGAGCCGA TGACTACATT
GATCTGGTAG TCGAGTCGAT GTTGCCGCTC GCTCGCGACT GGTATGCTGT CTCATCATTC
GCTGCGCGCG CGATTCCGCT CTTCGTTGAT GTCTTCTGTG AGCGAGGTGC GTTCGATGTG
GCGCAGAGTC GGCGAGTGTT GGACGCAGCA CGCAGTTTGG GCCTACCGCG CAAAGCCCAC
GTCGATGAGT TTGTCGAGCT GGGTGGGCTG GCAATGGCGC TTGAACTGGG TGCCACGTCA
GTCGATCACC TCGATGTTAC CGGCCCGTCG GCCTTTACAG CACTGGCAGC CAGCTCGACC
GTCGCCGTCT TGTTACCGCT CGTCTCGCTC AATCTCGGTC TGAGCCATTT TGCTGCTGCA
CGGGCAATGA TCGATGCCGG CGTTGCCGTT GCGCTCAGCA CCGATGCCAA CCCCGGTTCG
GCGCCATCGC TGTCATTACC GTTGACAATG GCAATCGCCT GTCGCTACCT GCGCATGCTT
CCTGCCGAGA CATTGATTGC AACGACGGTC AACGCTGCCT ATGCGATCGG TCGCGGTGGG
CATGTTGGAG CATTAATGCC TGGTATGCAG GCCGATCTGC TCATCTTGGC CGCCGATGAT
TATCGCTGGC TGATGTATGA GTTAGGTGGA ATGCCGGTGG CACAGGTGAT CAAACGAGGG
CAGGTCGTAG TTACCAATGA GTAA
 
Protein sequence
MEPCDLLIHS ATQLVTCAGP PGLRRGPAMR ELGVIRDGAV AIRGSTIVAV GPGTDVRRRF 
RASHEIDARG RAVCPGLVDC HTHIVYAGDR VEEFEQRCAG ATYQEIMAAG GGILRTMRLT
RAATTTELVH AALPRLRQML SFGTTTAEVK TGYGLERDAE LRQLAAIALL DAAQPIELVP
TFLAAHAVPP EFTGRADDYI DLVVESMLPL ARDWYAVSSF AARAIPLFVD VFCERGAFDV
AQSRRVLDAA RSLGLPRKAH VDEFVELGGL AMALELGATS VDHLDVTGPS AFTALAASST
VAVLLPLVSL NLGLSHFAAA RAMIDAGVAV ALSTDANPGS APSLSLPLTM AIACRYLRML
PAETLIATTV NAAYAIGRGG HVGALMPGMQ ADLLILAADD YRWLMYELGG MPVAQVIKRG
QVVVTNE