Gene Cagg_1780 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCagg_1780 
Symbol 
ID7267692 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChloroflexus aggregans DSM 9485 
KingdomBacteria 
Replicon accessionNC_011831 
Strand
Start bp2184543 
End bp2187452 
Gene Length2910 bp 
Protein Length969 aa 
Translation table11 
GC content57% 
IMG OID643566621 
ProductPeptidase M16C associated domain protein 
Protein accessionYP_002463116 
Protein GI219848683 
COG category[R] General function prediction only 
COG ID[COG1026] Predicted Zn-dependent peptidases, insulinase-like 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.126781 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value0.432438 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACGAATC TGTACGGATT TGAGCTACTG CGCGATGAGT TCATCCCCGA ACTCAATACA 
CGCGCTCGCT TGTATCGTCA TATCAAGACC GGCGCTGAAC TGCTCTCGCT GGAAAATGAC
GACGAGAATA AATGTTTCGG CATTACCTTC CGCACCCCGC CGCGCGACTC GACCGGCATT
GCCCACATTC TCGAACACTC GGTGCTCTGT GGCTCGCGTA AGTATCCGGT CAAAGACCCC
TTTTTCACGT TGGTGAAGGG ATCGGTACAT ACCTTCCTTA ATGCCATGAC CTACCCGGAT
AAAACAACTT ATCCGGTTGC GAGCACCAAT TTGAAAGACT TCTACAATCT GATCGATGTC
TATCTCGATG CAGTCTTCTT CCCGCGTATT ACGCCCGAAG TACTCAAGCA AGAGGGTTGG
CACTTCGAGC TACCGGCGCC CGATGCACCG CTCACCATCA AGGGCGTGGT CTACAACGAG
ATGAAGGGGG CCTATTCGTC GCCCGATGGG ATGCTCTATC GCTACTCGCA GCAGTCGCTC
TTCCCCGATA CAACGTATGG TCATTCATCG GGCGGCGATC CTCTGTCGAT CCCTGATCTA
ACGTATGAAG CCTTTAAGCG TTTCCACGAG ACGCTCTATC ATCCCTCTAA CGCGCGCATC
TTCTTCTACG GCGATGATCC GCCGGAAGAG CGGCTGCGCA AGCTCGACGA GTACTTGAGC
CAGTTTGAGC GGATCGATCC GCCTTCTCAG ATCGAAAAGC AGCCCCGTTT CAGCGAGCCA
CGTGTCTTGG AGTACACCTT TAGTGCTGCC GATGAGTCCC AGCAAAAGGG CATGGTGATG
CTCAATTGGC TCCTCGATGA TAATCGCGAT CCGACCGAAC TGATGGCACG CGAGTTGTTG
GGCTACATTT TGCTGGGGAA TGCAGCAGCG CCGTTGCGCA AGGCGCTGAT CGATTCCGGG
TTAGGCGAAG AGGTCATTGG CGGGTATGAA AGTGATTTGC TGCAACAGAC CTTTTCGGTT
GGAATGAAAG GGATCGATCC GGCAAATGCA GGGCAGGTTG AAGAGCTGAT CCTGCGCACG
CTGGCCGAAC TTGCCGAGCA AGGGATCGAT ACCGAGACGA TCGCAGCAGC CTTTAATACC
TTTGAGTTTA GCCTGCGCGA AAACAACACC GGTAGCTTTC CGCGTGGTTT GGTGTTGATG
CTGCGCGCGC TGAGTACGTG GCTCTATGAC GATGACCCGA TTGCACCGTT GCGCTTCGAG
GCGCCACTGG CGGCAGTGCA GACTGCCGTA AAGAACGGTG ACCGTCTCTT TGAACGTATG
ATCGGCGAGC TGCTGATCAA CAATCCACAC CGGACGCGCG TGACCTTACG CCCTGATCCC
GAACACGCTG CCCGTTTGGC CGCCGCTGAG CAAGCTCGGA TCGATGCCTT CGCCACGACC
CTCGATGAGG CCAAGCGGGC GGCGCTGGTC GCCGAAACGC AGGCGCTCGT CGAATGGCAG
CAGACGCCCG ATCCGCCCGA GGCGTTGGCG ACGATCCCGA CCCTGCGTTT GTCCGATCTC
GACCGGACGA TCAAGCGGAT TCCGACCGAT ATTGATGAGC GAGGTGGCGT AACGCTTCTG
CGTCACAATC TATTTACCAA CGGCATCGTT TATCTCGATC TGGCTTTTGA TCTGCGTGCC
GTACCACCGC ACTTACTGCC CTACGTGCCA CTCTTTGCCC GTGCTCTCAC CGAAATGGGT
ACGGCGACCT CCGATTTTGT CCGTTTACTC CAGCGCATCG GGCGTGAGAC CGGTGGGATC
GGCGCTGCGC CGATGACGGC AACCGACCTT GTTTCAGGGC AGTCGGTTGG TCGGTTGATG
GTCCGTGGCA AAAGTACGCT CGGCCAAGCC GGCGAGTTAT TCCGCTTGCT CGGCGAGATT
CTGCTAACGG TGAATCTCGA TAACTGTGAG CGCTTCAAGC AGATCGTTCT GCGCTCACGG
GCCAACCGTG AGTCGTCACT TATACCGTCA GGCAATGCGT ATGCTCGCCA ACGTCTTGCT
GCACGGTTTG CTCCGGCGGA ATGGGCCGAG GAACAAATGA GTGGGGTTTC CGCCATTTTC
TTCTTGCGCG AGCTTGAGCA GCGTGTGCAG CACGATTGGC CGAGTGTGTT GGCCGATCTT
GAAGCGGTGC GGACGGCGCT GATCAATCGG CACGGGTTGG TGGCGAACCT GACCCTTGAT
GCGAGTGGGC AAGAGACGAT CATGCCGATG CTGATGGCAT TTCTCGCCGA ACTGCCTGAT
GTGCCCTATA CGCCGGTGCA GTGGTCGGTA AGTAGTGTTG ACGGTGGTGA GGGGTTGATC
ATTCCGGCAC AGGTGAATTA CGTTGCCAAA GGGGTAAATC TCCATGCCTA CGGCATTCGG
CCTAGCGGCG CGGCAATGGT GGTGTTGCGC CACCTGCGCA TCGACTATCT GCTCGACCGC
ATCCGTATTC AGGGTGGGGC GTATGGCGCC GGCGGTAGTT ACGACCGCAG CACCGGCCTG
TTCATTACTA CCTCGTACCG TGATCCCAAC CTCTTGCGCA CGCTCGATGT GTACGACGAG
ATGGCAACCT TCTTGCGCGA GACGGCGCTC GATCCGGCGA CGGTTGAACG GGCGATTATC
GGTACAATCG GCGATATGGA TGCTTACCAG TTGCCGGATG CCAAGGGATA CACCGCCCTG
GTGCGCTACC TGACCAGCGT AAGCGACGAG TATCGCCAGC AGATTCGTGA TGAAGTGTTA
GCGACTACCC CGGCCGATTT TGTTGCCTTC GCCGAAGCTG CGGCAGCGTT GCGTGATCAC
GGTCACGTTG CGGTGTTGGG TTCCGCCGAG GCGATCGAGG CGGCCAACCG TGAACGACCC
GGTCTGTTGA ACCCGGTCAA GGTCTTGTGA
 
Protein sequence
MTNLYGFELL RDEFIPELNT RARLYRHIKT GAELLSLEND DENKCFGITF RTPPRDSTGI 
AHILEHSVLC GSRKYPVKDP FFTLVKGSVH TFLNAMTYPD KTTYPVASTN LKDFYNLIDV
YLDAVFFPRI TPEVLKQEGW HFELPAPDAP LTIKGVVYNE MKGAYSSPDG MLYRYSQQSL
FPDTTYGHSS GGDPLSIPDL TYEAFKRFHE TLYHPSNARI FFYGDDPPEE RLRKLDEYLS
QFERIDPPSQ IEKQPRFSEP RVLEYTFSAA DESQQKGMVM LNWLLDDNRD PTELMARELL
GYILLGNAAA PLRKALIDSG LGEEVIGGYE SDLLQQTFSV GMKGIDPANA GQVEELILRT
LAELAEQGID TETIAAAFNT FEFSLRENNT GSFPRGLVLM LRALSTWLYD DDPIAPLRFE
APLAAVQTAV KNGDRLFERM IGELLINNPH RTRVTLRPDP EHAARLAAAE QARIDAFATT
LDEAKRAALV AETQALVEWQ QTPDPPEALA TIPTLRLSDL DRTIKRIPTD IDERGGVTLL
RHNLFTNGIV YLDLAFDLRA VPPHLLPYVP LFARALTEMG TATSDFVRLL QRIGRETGGI
GAAPMTATDL VSGQSVGRLM VRGKSTLGQA GELFRLLGEI LLTVNLDNCE RFKQIVLRSR
ANRESSLIPS GNAYARQRLA ARFAPAEWAE EQMSGVSAIF FLRELEQRVQ HDWPSVLADL
EAVRTALINR HGLVANLTLD ASGQETIMPM LMAFLAELPD VPYTPVQWSV SSVDGGEGLI
IPAQVNYVAK GVNLHAYGIR PSGAAMVVLR HLRIDYLLDR IRIQGGAYGA GGSYDRSTGL
FITTSYRDPN LLRTLDVYDE MATFLRETAL DPATVERAII GTIGDMDAYQ LPDAKGYTAL
VRYLTSVSDE YRQQIRDEVL ATTPADFVAF AEAAAALRDH GHVAVLGSAE AIEAANRERP
GLLNPVKVL