Gene Cagg_0026 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCagg_0026 
Symbol 
ID7269023 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChloroflexus aggregans DSM 9485 
KingdomBacteria 
Replicon accessionNC_011831 
Strand
Start bp41728 
End bp42792 
Gene Length1065 bp 
Protein Length354 aa 
Translation table11 
GC content59% 
IMG OID643564899 
ProductMembrane dipeptidase 
Protein accessionYP_002461415 
Protein GI219846982 
COG category[E] Amino acid transport and metabolism 
COG ID[COG2355] Zn-dependent dipeptidase, microsomal dipeptidase homolog 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones36 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACAACCG CTGCTTGGCC CCTCATCTTT GATGGTCACA ATGACGTCAT TCTCGAACTC 
TATCGGCCAC AACCAGGTAA AGAACGCGAT TTCTTTCATC CTAGTCCGCA TGGGCATCTC
GATCTCCCCC GCGCCCGGGT CGGCGGTTTC GGCGGTGGCT TCTTTGCGGT CTACGTCCCA
CCACCTACTT CTTCGCCGAC GGTCGATCGC CTCCCCGAAC CACCGTATCA TTTGCCCCTT
CCTCCTGCCC TCGATCCGAC CTACGCCCTG CGCACGACCC TCGCAATGGC AGCTCGCCTG
TTTCGGATTG AAGCCGAATC CCAAGGGGCG CTGCGCGTCT GTCGGACTGC CGGCGACATT
GCCGATTGCC TCGCCAACAA CATCATTGCT GCAATCTTCC ACATCGAAGG CGCGGAAGCC
ATCGGCCCCG ACCTCGACGA GCTTGAAGTG CTCTATCAGG CCGGCCTGCG TTCGCTCGGC
CCGGTCTGGA GTCGTCCCAA TATCTTTGGG TATGGCGTAC CGTTTGCCTT CCCTGCCTCA
CCTGACATCG GCCCTGGCCT CACCGAGGCC GGCAAGGCAT TGGTGAAGGC GTGCAATCGG
CTGCGTATCA TGATCGACCT CTCGCACCTT AACGAGGCCG GTTTTTGGGA TGTGGCCCGT
TTGAGCGACG CACCGTTGGT AGCCACCCAT TCCAATGCGC ACGCCATCTG CCCGAGCAGC
CGTAATCTGA CCGACCGACA GCTTGACGCG ATCCGCGATT CTGGTGGGAT GGTCGGGTTG
AATTTCGGCG TCACCTTTCT GCGCCCCGAT GGTAAGCGTG ATGCAACGAT GCCACTCAGC
GTGATGGTGC AGCAGATCAG CTATCTTGTC GAGCGGTTGG GGATCGATCA CGTTGGTTTT
GGCTCCGATT TCGATGGCGC ACTGATACCG AAGGAGATCG GTGATGTACG TGGCTTGCCT
CGTCTGTTGC AGGCCCTGAC CGACGCCGGC TTTACCTCTG CCGAGATACG AAAGCTAGCT
TATGAAAACT GGTTGCGTGT TTTGCGATTA ACGTGGGGAG AATAG
 
Protein sequence
MTTAAWPLIF DGHNDVILEL YRPQPGKERD FFHPSPHGHL DLPRARVGGF GGGFFAVYVP 
PPTSSPTVDR LPEPPYHLPL PPALDPTYAL RTTLAMAARL FRIEAESQGA LRVCRTAGDI
ADCLANNIIA AIFHIEGAEA IGPDLDELEV LYQAGLRSLG PVWSRPNIFG YGVPFAFPAS
PDIGPGLTEA GKALVKACNR LRIMIDLSHL NEAGFWDVAR LSDAPLVATH SNAHAICPSS
RNLTDRQLDA IRDSGGMVGL NFGVTFLRPD GKRDATMPLS VMVQQISYLV ERLGIDHVGF
GSDFDGALIP KEIGDVRGLP RLLQALTDAG FTSAEIRKLA YENWLRVLRL TWGE