Gene Cagg_1372 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCagg_1372 
Symbol 
ID7268664 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChloroflexus aggregans DSM 9485 
KingdomBacteria 
Replicon accessionNC_011831 
Strand
Start bp1695572 
End bp1697605 
Gene Length2034 bp 
Protein Length677 aa 
Translation table11 
GC content57% 
IMG OID643566215 
Productpeptidase S9 prolyl oligopeptidase active site domain protein 
Protein accessionYP_002462715 
Protein GI219848282 
COG category[E] Amino acid transport and metabolism 
COG ID[COG1506] Dipeptidyl aminopeptidases/acylaminoacyl-peptidases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.330488 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value0.102589 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACAAATC CAACCCCTAG TCGCTTCACG ATTGATGATC TCTACGAGCT GGGCTGGCTG 
GAAGATCCGC GTCTTAGCCC CGATGGACAG ACTGCAGCGG TTGTGTGGGT AACGGTTGAC
CGGGTCAACA ATGGGTATCG GCGGCAGATT GTGTTGGTAC CGACGAATGG CGGTTCACTG
CGACGCTTTA CACGCGGGAA ACACGATCGT CAGCCGCGCT GGAGCCCAGA TGGAAGATGG
TTGGCGTTTG TGTCACATCG CGACGATGAA CGCGGCCAAA TCTATCTGAT CCCGGTTGAT
GGTGGTGAAG CGCGGCAATT AACGGCAATG CCCAATGGCG CCAGCGATCC GGCTTGGAGT
CCTGATGGTC GGTGGATCGC GTTTTTATCA CCGGTGAGTG TTGACGAACA GGCGCGTGAG
GATGCCGGTG AGATGCCGTC GCCGCCGGCG GATGCGTGGG AAGCCCGTCG TGCTCGTGAG
CAGCGTCAGC ACGATGAGGA GCTTCGGATT GATCCACGGG TGGTGACAAA ACTACCGTAT
CGGAGTGGGA CCAGCTATTT CGATGATCGG TGGCGGCAGG TGTATGTTGT AGAGGTGGAC
GATGAAGATC GCACCGCCAC ACCTCGCCGG CTTACTTCGG GTGAAATTCA CTACAGTACA
CCGGTTTGGC TGCCGAACGG TGAAGCACTC CTCAGTACGG CGACGCGCGA TCCGGAAGCC
GATTCGCTGT TCGCTTATTA CGATGTTGTG CGTATTCCGC TTGATGGATT GCCCCATGCA
TTGACGAGTC CGGGCTTCTC GTACTTCGAT CCGCAGCCTT CGCCTGATGG CAGCCAGATT
GCGTTCTTAC GCCTCAATGA AGAGCGATTG CTCGGTGAAG GTCGGCGAGT CGCGATCATT
CCGGCGGAAG GTGGTGAACC GCACGACCTC ACGGCCCATA CCGATCTGAA CGTTGAACAA
TTCCGCTGGC AGCCCGACGG TCAGGGGATA CTGTTTAGTG CCGGATGGCG CGGCGATGCT
CATGTCTATC AGATCGGTCT TCCAGGCACA CCGACCTATC GTAATGGATT GACGTTGGTC
GGTGGGGCGC GGTTGGTCAG CGAGTTTGAT GTAGGGCGTG ATGGGAGTAT CGTCTTTATT
GCCGGGAGTG CTGATAATCC GTGCGATCTC TTCTTCCGTA GCGCTGATGG TCACGAGCGA
CGATTGACAG CGATCAATGA TCGGTTGCTT CAGCAACGGA TTATTGTGCC GATGGAAGAG
ATGACGTATC TTTCCCCTGA TGGTAGTGAG GTGCAGGGAT GGACGCTGCA TCCACCGGAT
TTCAATCCGA TGCAGCGTTA TCCGCTTGCG GTGTACATCC ATGGCGGGCC GCATGTGATG
TGGGGGCCTG GTTTTCGCTC GATGTGGCAT GAATGGCAAG TTGCAGCAGC GCGCGGATAT
GTGGTCTTCT TCTGTAATCC GCGGGGTAGT GAGGGGTATG GTGAGCTGTG GCGCGATGCA
ATTCGGCGTA ATTGGGGCGA GGCGGATGCA CCCGATATTC TGGCCGGAAT CGATGCGCTG
GTGGCACGTG GGTATATCGA TCCCAACCGG ATTGCCGTGA CCGGTGGTTC GTATGGTGGG
TATATGACGG CCTGGCTGAT CGGGCACGAT GACCGGTTTG CCTGTGCGGT TGCTGCTCGT
GGCGTATATA ATCTGCTGAC GTTACATGGT ACGAGTGACG CTCACGAGTT GATCGAAATC
GAGTTTGGTG GGTATCCGTG GGAGTTGTAC GAAGAGTTGT GGGATCATTC ACCATTAGCG
CACGCACACA AGATCAAAAC GCCGTTGCTG CTCTTGCATA GCGAGCTTGA TTACCGAGTG
CCGATTAGTG AAGCGGAGCA GCTCTTTGCC ATCCTCCGTC GTCAAAAGAA GGTCGTGGAG
TTGGTACGGT ATCCGCGCGA AGGTCATGAG CTGACGCGCA GCGGTGAACC ACGTCACCGT
GCCGATCATA TGCGACGGAC GCTTGAGTGG TTTGATCGGT ATTGTCAGGT GTAG
 
Protein sequence
MTNPTPSRFT IDDLYELGWL EDPRLSPDGQ TAAVVWVTVD RVNNGYRRQI VLVPTNGGSL 
RRFTRGKHDR QPRWSPDGRW LAFVSHRDDE RGQIYLIPVD GGEARQLTAM PNGASDPAWS
PDGRWIAFLS PVSVDEQARE DAGEMPSPPA DAWEARRARE QRQHDEELRI DPRVVTKLPY
RSGTSYFDDR WRQVYVVEVD DEDRTATPRR LTSGEIHYST PVWLPNGEAL LSTATRDPEA
DSLFAYYDVV RIPLDGLPHA LTSPGFSYFD PQPSPDGSQI AFLRLNEERL LGEGRRVAII
PAEGGEPHDL TAHTDLNVEQ FRWQPDGQGI LFSAGWRGDA HVYQIGLPGT PTYRNGLTLV
GGARLVSEFD VGRDGSIVFI AGSADNPCDL FFRSADGHER RLTAINDRLL QQRIIVPMEE
MTYLSPDGSE VQGWTLHPPD FNPMQRYPLA VYIHGGPHVM WGPGFRSMWH EWQVAAARGY
VVFFCNPRGS EGYGELWRDA IRRNWGEADA PDILAGIDAL VARGYIDPNR IAVTGGSYGG
YMTAWLIGHD DRFACAVAAR GVYNLLTLHG TSDAHELIEI EFGGYPWELY EELWDHSPLA
HAHKIKTPLL LLHSELDYRV PISEAEQLFA ILRRQKKVVE LVRYPREGHE LTRSGEPRHR
ADHMRRTLEW FDRYCQV