Gene Cagg_1934 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCagg_1934 
Symbol 
ID7268850 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChloroflexus aggregans DSM 9485 
KingdomBacteria 
Replicon accessionNC_011831 
Strand
Start bp2367146 
End bp2368837 
Gene Length1692 bp 
Protein Length563 aa 
Translation table11 
GC content57% 
IMG OID643566772 
Productpeptidase M14 carboxypeptidase A 
Protein accessionYP_002463265 
Protein GI219848832 
COG category[E] Amino acid transport and metabolism 
COG ID[COG2866] Predicted carboxypeptidase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0111552 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value1.96465e-06 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGCCGACCA TCGACTTTAC CCGCTATTAT CGTCCACACG AAGTCGAAGC TGCACTCCAA 
GCGTGGGCTA CCGAGTACCC CAACCTCTGT GCTTTGCGTA GCATCGGCAC GAGTTACGAA
GGGCGTCCGA TCTGGCTCAT GACCCTCACC AATCAGGCAA CCGGTCCTGA CGACGAGAAA
CCGGCATTTT GGCTCGATGC CAACATTCAC GCCACCGAAG TCACCGGTTG TATGGGCGCA
CTCCACGTGA TCCAGACCGT GCTAGAGCAG TATGGGCGTG ATCCCAACAT TACCGCTTTA
CTCGACGAGC GGGCGCTTTA CATCGTTCCA TGTGTCAACC CTGACGGGAT GGAGCAAGCG
CTCACTTCAC CGGTCTACGT GCGTTCGGGT ACACGTCGCT ACCCCTTCAC CGATGATCGC
GATGGACTCT ACCCCGCCGA TGTCGATGGC GATGGTCTCA TACTGCAAAT GCGGGTGGTT
GATCCCGATG GTGGCTGGAA GGTGAGCGAA CGTGATCCGC GCTTGATGCG TCCGCGATTA
CCCGATGAGC GCGGCGGCAC CTACTATCGA GTGTATACCG AGGGCTACAT CCGCAACTTC
GACGGCTACG AGGTCAAGAT TGCCCCACCC ACTCAAGGAC TAGATTTCAA TCGCAACTTT
CCCTACATTT GGGCCCCTGA GGGTGTCCAG CGCGGCGCCG GGCCATATCC GACATCTGAA
CCGGAGATTC GCGCGATTGT GGAGTTCCTC ACATCTCACC TTAATGTGAG CAGTGCCATC
TCGTACCACA CGTATTCGGG TGCGATCTTG CGCCCGTATT CCGATAAACC TGACGATCAG
ATGCCGATCA ACGATCTGTG GACGTACAAA GAGCTGGGCC GGCGCGGCGA GGAGATCACC
GGCTACAAGC ATGTCTCGGT CTATCATGGC TTTCGGTATC ACCCACGCGA CGTGATGCGG
GGTGCGTTCG ACGACTGGGC TTACGAGCAA CTCGGCATCT ACGCTTTTAC CGTTGAGCTG
TGGGATATGA TCGGCGAAGC CGGTATCAAA GAACGCGATT TTATCGAGTG GTTCCGCGAT
CATCCCGAAG AAGACGATCT AAAGCTGCTG CGTTGGAACG ACGAACAACT CGACGGCGCA
GGATTCGTCA ATTGGCGCCC CTTTGATCAT CCGCAACTCG GCCGGGTTGA GATTGGTGGC
TGGATTGAAC GACGCACGTT CGGGAACCCA CCGGAGAAAT TCTTACTGCG TACCCTTGAA
CCGAACACCC GTTTCGTACT GGCCCTTGCC CAAACGGGGC CACGACTTGA GCTGCGGCAT
GTGCAGGCCG AACCGCTTGG TAACGAACTC TACCGACTGC AAGCGGTGGT GGTTAATAGC
GGCTACTTGC CAACCTACGG TAGCAGCAAG GCACAAGAGG TAAAAGCAGT ACAACCGATT
GCCGTCGAGG TAACCCTGCC GACGGACGGT GCGCTGGTGA GTGGTGAACA GCGGACAGAG
ATCGGTCAAC TGGAGGGGCG GGCAAACAAA CGGGCATTGT GGGGTGGTAG TTTTCCCACC
GACCACCTGC GCCGTCTGAC ATGGACAATC CGGGCCCCGG CGGGAAGTAG TGTTACCATT
ACGGCTCGCT CACAACGCGC CGGTGTAGCC CGTGCAACTG TGTCGTTGGG CGAAAGCATT
CATCAATCGT AA
 
Protein sequence
MPTIDFTRYY RPHEVEAALQ AWATEYPNLC ALRSIGTSYE GRPIWLMTLT NQATGPDDEK 
PAFWLDANIH ATEVTGCMGA LHVIQTVLEQ YGRDPNITAL LDERALYIVP CVNPDGMEQA
LTSPVYVRSG TRRYPFTDDR DGLYPADVDG DGLILQMRVV DPDGGWKVSE RDPRLMRPRL
PDERGGTYYR VYTEGYIRNF DGYEVKIAPP TQGLDFNRNF PYIWAPEGVQ RGAGPYPTSE
PEIRAIVEFL TSHLNVSSAI SYHTYSGAIL RPYSDKPDDQ MPINDLWTYK ELGRRGEEIT
GYKHVSVYHG FRYHPRDVMR GAFDDWAYEQ LGIYAFTVEL WDMIGEAGIK ERDFIEWFRD
HPEEDDLKLL RWNDEQLDGA GFVNWRPFDH PQLGRVEIGG WIERRTFGNP PEKFLLRTLE
PNTRFVLALA QTGPRLELRH VQAEPLGNEL YRLQAVVVNS GYLPTYGSSK AQEVKAVQPI
AVEVTLPTDG ALVSGEQRTE IGQLEGRANK RALWGGSFPT DHLRRLTWTI RAPAGSSVTI
TARSQRAGVA RATVSLGESI HQS