Gene Cagg_0130 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCagg_0130 
Symbol 
ID7266869 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChloroflexus aggregans DSM 9485 
KingdomBacteria 
Replicon accessionNC_011831 
Strand
Start bp178210 
End bp180024 
Gene Length1815 bp 
Protein Length604 aa 
Translation table11 
GC content55% 
IMG OID643565003 
Productoligoendopeptidase F 
Protein accessionYP_002461518 
Protein GI219847085 
COG category[E] Amino acid transport and metabolism 
COG ID[COG1164] Oligoendopeptidase F 
TIGRFAM ID[TIGR00181] oligoendopeptidase F 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.000121565 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.000025846 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGCAGACTG TTCGCGAACG CAGTGAGATT CCAGAGCAGT ACAAGTGGGA TCCGTTTAGT 
ATCTTTCCTT CACAGGCTGC ATGGGAAGCG GCCATCGACG AGGTTAATAC GCTCATTGCA
CGCGCTGCTC AGTTTCGTGG TCGGTTACAC GAAGGCCCAC CCGTTATCGC CGACTTTCTT
GGCCTGAGCG AGACGATTAT GCGCAACGTC GGCCAAATTA CGGTCTTCGC CACGATGTTC
TATACCGTCG ATACCAATGA CCGTGAAGCG AGTGCAATGC GTGATCGGGC AATTGGGCTG
CAAGCCCGAG CAAGTGCAGC ATTGGCGTTT GGTGAGCCTG AGTTGTTGGC CATCGGCGCC
GATCAGTTGT TGACCTGGGC GGATCAAGAT GAGTATCTGG CAACCTACCG CCACTATTTT
GAACGCCTGA TCGCTCGTGC TCCTCATGTG CGTTCTGCCG AAGTGGAAGA GTTGTTGGGG
CAGGTACGCG ATCCGTTCGC TGCGGCAAGT GCAGTCCACG GTGTATTGGC CAATGCTGAA
TTACGCTTTC CCCTCGCCTA CGACAGCAAT GGTGAGGCGT ATGAGATTAC GCAAGGCACG
ATTAATGCAC TCATTACCCA TCCTGATCGC ACCTTGCGTA AGCAGGCGTG GGAAGGGTAT
GCCGACGCGC ACATTGCCGT CGAAAATACG ATGGCCCAGT GCTTAGCCAC CGGCGTCAAA
CAGAATGTCT TTCTTGCCCG TGCGCGTCGG TATGCCTCGG CTCTTGAAGC CGCACTGAAG
CCGAATTTTA TTCCACTTGA GGTCTTTCAC AACCTGATCG CTACGTTCGA GCGCCATTTA
CCGATCTGGC ACCGGTATTG GCGGGTACGT CGTGCGGCCC TCGGTGTTGA TGAATTGCAT
GTTTACGATA CCAAAGCACC GTTAGCGACC CCGCTTATCG TGCCTTATGA GCGAGCTGTC
GATTGGATCT GCGCCGGTAT GGCTCCGCTG GGCAATGAAT ATGTCCAGAT TATGCGGCGT
GGGTTGCGCG AACAGCGTTG GGTTGATGTC TATCCCAATC GGGGTAAGCG GGCCGGTGCG
TTCTCAACCG GTGCACCGGG CACCCACCCG TTTATTATGA TGTCGTACAA CGATGACATC
TTCAGCCTTA GTACCCTTGC CCACGAGTTG GGTCACTCGA TGCATTCGTA CTATACGCGG
CGTACCCAAC CGGTGATCTA TACCAACTAT GGTCTGTTCC TGGCGGAAGT AGCCTCGAAT
TTCAATCAGG CGTTGGTGCG CGCGTATCTG TTCCAAACGT TAACCGACCG CAATGCCCAG
ATCGGCTTAA TCGAAGAGGC GATGGCGAAC TTCCATCGCT ATTTCTTCAT TATGCCGACG
CTGGCTCGCT TTGAGTTGGC TATCCATCAG CGCGCTGAAC GCGGTCAACC GTTAACCGCG
ACCATCTTTA ACGAGTTGAT GGCCGATCTC TTTGCCGAGG GGTATGGTAG CGAGGTCGTC
GTTGATCGGG CGCGCGTCGG TAATACGTGG GCGCAGTTTT CTACCCATCT GTACGCCAAT
TTCTATGTCT ATCAGTATGC AACCGGTATT GCCGGTGCCC ACGCGCTGGC CGCACCTATC
CTCGCCGGTA ATGCCGATGC CGCCGATCGC TATATCAATG AGTTTCTCAA GGCCGGTGGT
TCACGCTTTC CACTTGATAC GTTGCGACAG GCCGGGGTTG ATCTAACTTC ACCCGAACCG
GTTGAGCAGA CCTTTGCCGT GATGGCATCT TACATTGATC GGCTTGAGCA GTTGGTCGGT
GGATCAGGTT CATAA
 
Protein sequence
MQTVRERSEI PEQYKWDPFS IFPSQAAWEA AIDEVNTLIA RAAQFRGRLH EGPPVIADFL 
GLSETIMRNV GQITVFATMF YTVDTNDREA SAMRDRAIGL QARASAALAF GEPELLAIGA
DQLLTWADQD EYLATYRHYF ERLIARAPHV RSAEVEELLG QVRDPFAAAS AVHGVLANAE
LRFPLAYDSN GEAYEITQGT INALITHPDR TLRKQAWEGY ADAHIAVENT MAQCLATGVK
QNVFLARARR YASALEAALK PNFIPLEVFH NLIATFERHL PIWHRYWRVR RAALGVDELH
VYDTKAPLAT PLIVPYERAV DWICAGMAPL GNEYVQIMRR GLREQRWVDV YPNRGKRAGA
FSTGAPGTHP FIMMSYNDDI FSLSTLAHEL GHSMHSYYTR RTQPVIYTNY GLFLAEVASN
FNQALVRAYL FQTLTDRNAQ IGLIEEAMAN FHRYFFIMPT LARFELAIHQ RAERGQPLTA
TIFNELMADL FAEGYGSEVV VDRARVGNTW AQFSTHLYAN FYVYQYATGI AGAHALAAPI
LAGNADAADR YINEFLKAGG SRFPLDTLRQ AGVDLTSPEP VEQTFAVMAS YIDRLEQLVG
GSGS