Gene Cagg_1310 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCagg_1310 
Symbol 
ID7268601 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChloroflexus aggregans DSM 9485 
KingdomBacteria 
Replicon accessionNC_011831 
Strand
Start bp1610346 
End bp1612154 
Gene Length1809 bp 
Protein Length602 aa 
Translation table11 
GC content58% 
IMG OID643566153 
Productoligoendopeptidase F 
Protein accessionYP_002462654 
Protein GI219848221 
COG category[E] Amino acid transport and metabolism 
COG ID[COG1164] Oligoendopeptidase F 
TIGRFAM ID[TIGR00181] oligoendopeptidase F 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0118183 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.00000404795 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGACCTTAT CTGCGAACAG TATTCCCACA CGGGCCGAAA TACCGGTCGA ATATACGTGG 
GATCTATCTC AAATCTTTGC CGATGTCCCC GCGTGGGAAC AAGAACGTAG CGCGGTTGAA
GCGCGTGCGC AAGAGTTAGC CGCCTTGCAA GGGACGCTGG CGCAGGGGCC GGCGCAGTTG
CTGGCAGCAT TGACGTTACG TGATGAGGTG GCGCAGCGAC TGCATGCGTT GTATGTCTAT
GCTCTCCATC GCAAGGATAG CGATGGTACC GATCCGGTAG GGCAGGGTTT GGCCGAGCGG
GCCGGTAGTT TTGCGGCTCG GATACAAGCG GTGCTGGCGT TTATTGAGCC AGAGATTTTG
ACGATCCCGG CGGAAACGCT GGACGAGTGG TTGGCGGCTA CCCCCGGCTT GCAAGTGTAT
CGCTATGCCT TAGAGAAGCT GAACCGCCAG CGGGCGCATA TCCGCTCGGC TGAGGTTGAG
CAGGTGATGG CAGCGTTGAG CGATATTGTC CGTGCGCCAT ACGCTACGTT CTCGATGTTG
ACCGACGCGG ATTTGCAATT TCCGACAATT GAGGATGAGC AGGGTCAGCC GGTGAAGTTG
TCACATGCGC GCTATGGTCG TTTGCTCGAA AGCCATGACC GGCGGGTGCG GCGTGATGCG
TTCAAGGGGT ACTACAGCGC GTTTTTGCCC TTCCGTAACA CGCTTGCCAC CACTCTCGGC
GCGGCGATCC GCTCGCACGT GATCGAGGCC CGGTTGCGCA ATTACGGATC GGCGCTAGAG
GCGGCGCTTG CTCCGAATGA AATTCCTGTC GAGGTGTACC ATAACCTGAT CGCGACCGTT
GAGGCTAATT TGCCGCGGTT TCATCGCTAT TTGACCGTGC GGCGACGCCT CATGGGTTTA
GATGACTTGC ATTTCTACGA TCTCTATGTG CAGCCAGTGC CCGATGTGGA AATGACCATT
CCCTACCGTG AGGCGTGTGA TCTGATGCGT GAGGCGTTCC GTCCGCTCGG CCCTGAGTAT
GGTGCGGCGC TCGATCAGAT GTTTACGCGG CGTTGGATCG ATGTGTATGA GAATGTGGGG
AAGCGGAGTG GTGCCTATAG CGGCGGTTCG TATGGGACGC CGCCCTACAT CTTGCTCAAC
TACCAAGACC GGCTGCGTGA TGTCTTTACC CTCGCCCACG AATTGGGCCA CTCGCTTCAT
TCGTACTTCA CCCGCGCCAC TCAGCCGTTC GTCTATGGCG AGTACACCAT CTTCGTCGCC
GAAGTGGCTT CGACGCTCAA CGAGGCGCTG CTGACCCACT ACATGTTGCA AAGCGGTGCT
GATGAGGCGT TGCGGCGGCG GTTGCTGGCC CAGCAGATCG AAGAGATTCG CGGTACTATC
TTCCGCCAGA CGATGTTTGC CGCCTTCGAG CTGTGGATGC ATGAGCAAGC CGAGCGTGGT
CAACCTCTCA CGGCTGATGC GCTGAGCCAG CATTACCGTG AGTTGGTTGT GCGGTATCAC
GGACCTGAGT TGGTGATCGA TGATGAGCTG GCGTATGAGT GGCTGCGCAT TCCGCACTTC
TACTATCAGT TCTACGTGTA TCAGTATGCG ACCGGCTTGT CGGCAGCCCT GGCGCTGAGC
CGCCAGATTA TCAACGAGGG CCAGCCGGCG GTTGAACGGT ATCTGCGGTT CTTGCGCAGC
GGTTCGTCGC GGTCGTCAAT CGATCTGCTG CGCGACGCCG GTGTTGATAT GACCTCGCCG
GCGCCGATTC AGGCCGCGAT GGATACGTTT GCTGAATTGG TCAGCCAATT GGAACAGTTG
GCACCGTAA
 
Protein sequence
MTLSANSIPT RAEIPVEYTW DLSQIFADVP AWEQERSAVE ARAQELAALQ GTLAQGPAQL 
LAALTLRDEV AQRLHALYVY ALHRKDSDGT DPVGQGLAER AGSFAARIQA VLAFIEPEIL
TIPAETLDEW LAATPGLQVY RYALEKLNRQ RAHIRSAEVE QVMAALSDIV RAPYATFSML
TDADLQFPTI EDEQGQPVKL SHARYGRLLE SHDRRVRRDA FKGYYSAFLP FRNTLATTLG
AAIRSHVIEA RLRNYGSALE AALAPNEIPV EVYHNLIATV EANLPRFHRY LTVRRRLMGL
DDLHFYDLYV QPVPDVEMTI PYREACDLMR EAFRPLGPEY GAALDQMFTR RWIDVYENVG
KRSGAYSGGS YGTPPYILLN YQDRLRDVFT LAHELGHSLH SYFTRATQPF VYGEYTIFVA
EVASTLNEAL LTHYMLQSGA DEALRRRLLA QQIEEIRGTI FRQTMFAAFE LWMHEQAERG
QPLTADALSQ HYRELVVRYH GPELVIDDEL AYEWLRIPHF YYQFYVYQYA TGLSAALALS
RQIINEGQPA VERYLRFLRS GSSRSSIDLL RDAGVDMTSP APIQAAMDTF AELVSQLEQL
AP