Gene Cagg_2381 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCagg_2381 
Symbol 
ID7268733 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChloroflexus aggregans DSM 9485 
KingdomBacteria 
Replicon accessionNC_011831 
Strand
Start bp2895237 
End bp2896979 
Gene Length1743 bp 
Protein Length580 aa 
Translation table11 
GC content57% 
IMG OID643567208 
ProductX-Pro dipeptidyl-peptidase domain protein 
Protein accessionYP_002463691 
Protein GI219849258 
COG category[R] General function prediction only 
COG ID[COG2936] Predicted acyl esterases 
TIGRFAM ID[TIGR00976] putative hydrolase, CocE/NonD family 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.241656 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones21 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGCTGGA TCGGATGGTC TTTGGCAGGG GCGAGTGTAG CAGCGGCATC TGTCTTTACT 
GCTCGTCGTG CATTATTGGC GCGCTTGCTC GGTTTGCGCC CTGCCGAACA TCGTGTGACC
GTGACCCGCG ATGTGCCTAT TCGTATGCCT GACGGGGTAG TGCTCTACGC CGATCACTAT
GCACCGCAAA CCGGTGGCCC ATATCCGACC ATCCTCATCC GCACACCGTA TGGTCGCCCT
GGTGAGCTGG GGCCATTAGG GGTTTTTGAG CATACCGGCT GTCTCCTTTT TGCCGAACGT
GGTTACAACG TTATCGTGCA GAGTGTGCGC GGGCGTTATC GTTCGGGAGG TGTCTTTGAG
CCGTTTGTTA ACGAAGCTGC TGATGGGCGA GCTACTGTGG CTTGGATAGC CGAACAGCCG
TGGTTTGAAG GAAACCTTGG CTTGTGGGGA CCAAGTTACG TGGGGTATGT TCAGTGGGGG
CCGGCAATCG ATGGCCCTCC CTATCTTAAA GCGATTGTGC CGGTGGTGAC AAGCGCACGC
TTTTCTTCCC TCTTTTACCC CGGCGGTGCT TTCGCGTTCG AGTCAACGCT GCGTTGGGTC
TTTTTGATCG ATGCGACTAA TCGTCACCGT CAGTCCTTGA ATCCGGCAGC GTTATGGCGG
TTAATGGTGT TGCGCGAACG GATTTTGGCT CGTGCATTAC ACCATCAACC TTACGCCGAT
GCCGATCGGA TCGCTACCGG TGCGTCCGTT CCCTTTTTTC AAAGTTGGCT GAACGAGACC
GATCCACACG GTTCGTATTG GTCACAGGTT GATCAACATC GTGCTTTGCA CCGGATCAAT
GCTGCAGTAC ATTTGGTGGC CGGTTGGTAC GACATCTTCT TGCCCGGTCA GTTGGCCGAT
TACACGGCGC TGGTTGCTGC CGGTAAGCGA CCGTACCTTA CCGTGTTGCC TCGGGCGCAC
ACCGATCTGG CGCTCGTCTT TGAAGGGATG CGCGAGGGGT TGTGGTGGTT TGATGCCCAT
CTGAAAGGGC GCCGTGAGCT GCTCGCTCGT CGTCCGGTGC GAATTGCGTT GATGGGGAGC
AATGAGTGGC ACGAGATGGA TTTTTGGCCA CCACCGGCAG TAATGACGCG CTATTTTTTG
CAGCCCGCAG GTCGATTGGC TCGTGAGAAG CCACCTGCCG ATGGTGTCCC CAGCGTGTTT
CGGTATCATC CTGCCGATCC TACGCCTGCC ATCGGGGGGG CAGTGTTGAG TGCAAAGGCC
GGCCCACGCG ATCAGCGTCC GCTTGAATCT CGTCCGGATG TCTTAACCTT TACCAGTTCT
CCATTGGCGA ACGATCTTGA TGTGATCGGG CCGATTCGTC TCTGTCTCTA CGTATGTAGT
GAACATGAGC ATTTTGATCT CGTCGGTCGC CTCTGTGATG TCTATCCCGA TGGGCGGAGT
ATCAATATTT GTGACGGTAT TGTGCGGGTA CGGCCCGGTG TAGGTGAAGT GCAGCCCGAT
GGTTCACGTC GGATCGAGAT AGATCTGACG GCGACGGCTC AGCGCTTTCG AGTCGGTCAT
CGTCTGCGTG TGCAAGTCGC CGGTGGTGGT AGTCCACGGT GGGGGCCGCA CCCCGGAGAT
GATCGTCCGT ATGGGCAGGG ATGCGGGGGG CCGGTTTTGC TCCATCACAT TTGTCACGAT
GCCAACCACC CGTCGGCAAT CATTTTGCCG GTGGTTGATG CCGCTGTGCG TTGGGCTGCA
TAG
 
Protein sequence
MRWIGWSLAG ASVAAASVFT ARRALLARLL GLRPAEHRVT VTRDVPIRMP DGVVLYADHY 
APQTGGPYPT ILIRTPYGRP GELGPLGVFE HTGCLLFAER GYNVIVQSVR GRYRSGGVFE
PFVNEAADGR ATVAWIAEQP WFEGNLGLWG PSYVGYVQWG PAIDGPPYLK AIVPVVTSAR
FSSLFYPGGA FAFESTLRWV FLIDATNRHR QSLNPAALWR LMVLRERILA RALHHQPYAD
ADRIATGASV PFFQSWLNET DPHGSYWSQV DQHRALHRIN AAVHLVAGWY DIFLPGQLAD
YTALVAAGKR PYLTVLPRAH TDLALVFEGM REGLWWFDAH LKGRRELLAR RPVRIALMGS
NEWHEMDFWP PPAVMTRYFL QPAGRLAREK PPADGVPSVF RYHPADPTPA IGGAVLSAKA
GPRDQRPLES RPDVLTFTSS PLANDLDVIG PIRLCLYVCS EHEHFDLVGR LCDVYPDGRS
INICDGIVRV RPGVGEVQPD GSRRIEIDLT ATAQRFRVGH RLRVQVAGGG SPRWGPHPGD
DRPYGQGCGG PVLLHHICHD ANHPSAIILP VVDAAVRWAA