Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cagg_1780 |
Symbol | |
ID | 7267692 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Chloroflexus aggregans DSM 9485 |
Kingdom | Bacteria |
Replicon accession | NC_011831 |
Strand | - |
Start bp | 2184543 |
End bp | 2187452 |
Gene Length | 2910 bp |
Protein Length | 969 aa |
Translation table | 11 |
GC content | 57% |
IMG OID | 643566621 |
Product | Peptidase M16C associated domain protein |
Protein accession | YP_002463116 |
Protein GI | 219848683 |
COG category | [R] General function prediction only |
COG ID | [COG1026] Predicted Zn-dependent peptidases, insulinase-like |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 4 |
Plasmid unclonability p-value | 0.126781 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 13 |
Fosmid unclonability p-value | 0.432438 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACGAATC TGTACGGATT TGAGCTACTG CGCGATGAGT TCATCCCCGA ACTCAATACA CGCGCTCGCT TGTATCGTCA TATCAAGACC GGCGCTGAAC TGCTCTCGCT GGAAAATGAC GACGAGAATA AATGTTTCGG CATTACCTTC CGCACCCCGC CGCGCGACTC GACCGGCATT GCCCACATTC TCGAACACTC GGTGCTCTGT GGCTCGCGTA AGTATCCGGT CAAAGACCCC TTTTTCACGT TGGTGAAGGG ATCGGTACAT ACCTTCCTTA ATGCCATGAC CTACCCGGAT AAAACAACTT ATCCGGTTGC GAGCACCAAT TTGAAAGACT TCTACAATCT GATCGATGTC TATCTCGATG CAGTCTTCTT CCCGCGTATT ACGCCCGAAG TACTCAAGCA AGAGGGTTGG CACTTCGAGC TACCGGCGCC CGATGCACCG CTCACCATCA AGGGCGTGGT CTACAACGAG ATGAAGGGGG CCTATTCGTC GCCCGATGGG ATGCTCTATC GCTACTCGCA GCAGTCGCTC TTCCCCGATA CAACGTATGG TCATTCATCG GGCGGCGATC CTCTGTCGAT CCCTGATCTA ACGTATGAAG CCTTTAAGCG TTTCCACGAG ACGCTCTATC ATCCCTCTAA CGCGCGCATC TTCTTCTACG GCGATGATCC GCCGGAAGAG CGGCTGCGCA AGCTCGACGA GTACTTGAGC CAGTTTGAGC GGATCGATCC GCCTTCTCAG ATCGAAAAGC AGCCCCGTTT CAGCGAGCCA CGTGTCTTGG AGTACACCTT TAGTGCTGCC GATGAGTCCC AGCAAAAGGG CATGGTGATG CTCAATTGGC TCCTCGATGA TAATCGCGAT CCGACCGAAC TGATGGCACG CGAGTTGTTG GGCTACATTT TGCTGGGGAA TGCAGCAGCG CCGTTGCGCA AGGCGCTGAT CGATTCCGGG TTAGGCGAAG AGGTCATTGG CGGGTATGAA AGTGATTTGC TGCAACAGAC CTTTTCGGTT GGAATGAAAG GGATCGATCC GGCAAATGCA GGGCAGGTTG AAGAGCTGAT CCTGCGCACG CTGGCCGAAC TTGCCGAGCA AGGGATCGAT ACCGAGACGA TCGCAGCAGC CTTTAATACC TTTGAGTTTA GCCTGCGCGA AAACAACACC GGTAGCTTTC CGCGTGGTTT GGTGTTGATG CTGCGCGCGC TGAGTACGTG GCTCTATGAC GATGACCCGA TTGCACCGTT GCGCTTCGAG GCGCCACTGG CGGCAGTGCA GACTGCCGTA AAGAACGGTG ACCGTCTCTT TGAACGTATG ATCGGCGAGC TGCTGATCAA CAATCCACAC CGGACGCGCG TGACCTTACG CCCTGATCCC GAACACGCTG CCCGTTTGGC CGCCGCTGAG CAAGCTCGGA TCGATGCCTT CGCCACGACC CTCGATGAGG CCAAGCGGGC GGCGCTGGTC GCCGAAACGC AGGCGCTCGT CGAATGGCAG CAGACGCCCG ATCCGCCCGA GGCGTTGGCG ACGATCCCGA CCCTGCGTTT GTCCGATCTC GACCGGACGA TCAAGCGGAT TCCGACCGAT ATTGATGAGC GAGGTGGCGT AACGCTTCTG CGTCACAATC TATTTACCAA CGGCATCGTT TATCTCGATC TGGCTTTTGA TCTGCGTGCC GTACCACCGC ACTTACTGCC CTACGTGCCA CTCTTTGCCC GTGCTCTCAC CGAAATGGGT ACGGCGACCT CCGATTTTGT CCGTTTACTC CAGCGCATCG GGCGTGAGAC CGGTGGGATC GGCGCTGCGC CGATGACGGC AACCGACCTT GTTTCAGGGC AGTCGGTTGG TCGGTTGATG GTCCGTGGCA AAAGTACGCT CGGCCAAGCC GGCGAGTTAT TCCGCTTGCT CGGCGAGATT CTGCTAACGG TGAATCTCGA TAACTGTGAG CGCTTCAAGC AGATCGTTCT GCGCTCACGG GCCAACCGTG AGTCGTCACT TATACCGTCA GGCAATGCGT ATGCTCGCCA ACGTCTTGCT GCACGGTTTG CTCCGGCGGA ATGGGCCGAG GAACAAATGA GTGGGGTTTC CGCCATTTTC TTCTTGCGCG AGCTTGAGCA GCGTGTGCAG CACGATTGGC CGAGTGTGTT GGCCGATCTT GAAGCGGTGC GGACGGCGCT GATCAATCGG CACGGGTTGG TGGCGAACCT GACCCTTGAT GCGAGTGGGC AAGAGACGAT CATGCCGATG CTGATGGCAT TTCTCGCCGA ACTGCCTGAT GTGCCCTATA CGCCGGTGCA GTGGTCGGTA AGTAGTGTTG ACGGTGGTGA GGGGTTGATC ATTCCGGCAC AGGTGAATTA CGTTGCCAAA GGGGTAAATC TCCATGCCTA CGGCATTCGG CCTAGCGGCG CGGCAATGGT GGTGTTGCGC CACCTGCGCA TCGACTATCT GCTCGACCGC ATCCGTATTC AGGGTGGGGC GTATGGCGCC GGCGGTAGTT ACGACCGCAG CACCGGCCTG TTCATTACTA CCTCGTACCG TGATCCCAAC CTCTTGCGCA CGCTCGATGT GTACGACGAG ATGGCAACCT TCTTGCGCGA GACGGCGCTC GATCCGGCGA CGGTTGAACG GGCGATTATC GGTACAATCG GCGATATGGA TGCTTACCAG TTGCCGGATG CCAAGGGATA CACCGCCCTG GTGCGCTACC TGACCAGCGT AAGCGACGAG TATCGCCAGC AGATTCGTGA TGAAGTGTTA GCGACTACCC CGGCCGATTT TGTTGCCTTC GCCGAAGCTG CGGCAGCGTT GCGTGATCAC GGTCACGTTG CGGTGTTGGG TTCCGCCGAG GCGATCGAGG CGGCCAACCG TGAACGACCC GGTCTGTTGA ACCCGGTCAA GGTCTTGTGA
|
Protein sequence | MTNLYGFELL RDEFIPELNT RARLYRHIKT GAELLSLEND DENKCFGITF RTPPRDSTGI AHILEHSVLC GSRKYPVKDP FFTLVKGSVH TFLNAMTYPD KTTYPVASTN LKDFYNLIDV YLDAVFFPRI TPEVLKQEGW HFELPAPDAP LTIKGVVYNE MKGAYSSPDG MLYRYSQQSL FPDTTYGHSS GGDPLSIPDL TYEAFKRFHE TLYHPSNARI FFYGDDPPEE RLRKLDEYLS QFERIDPPSQ IEKQPRFSEP RVLEYTFSAA DESQQKGMVM LNWLLDDNRD PTELMARELL GYILLGNAAA PLRKALIDSG LGEEVIGGYE SDLLQQTFSV GMKGIDPANA GQVEELILRT LAELAEQGID TETIAAAFNT FEFSLRENNT GSFPRGLVLM LRALSTWLYD DDPIAPLRFE APLAAVQTAV KNGDRLFERM IGELLINNPH RTRVTLRPDP EHAARLAAAE QARIDAFATT LDEAKRAALV AETQALVEWQ QTPDPPEALA TIPTLRLSDL DRTIKRIPTD IDERGGVTLL RHNLFTNGIV YLDLAFDLRA VPPHLLPYVP LFARALTEMG TATSDFVRLL QRIGRETGGI GAAPMTATDL VSGQSVGRLM VRGKSTLGQA GELFRLLGEI LLTVNLDNCE RFKQIVLRSR ANRESSLIPS GNAYARQRLA ARFAPAEWAE EQMSGVSAIF FLRELEQRVQ HDWPSVLADL EAVRTALINR HGLVANLTLD ASGQETIMPM LMAFLAELPD VPYTPVQWSV SSVDGGEGLI IPAQVNYVAK GVNLHAYGIR PSGAAMVVLR HLRIDYLLDR IRIQGGAYGA GGSYDRSTGL FITTSYRDPN LLRTLDVYDE MATFLRETAL DPATVERAII GTIGDMDAYQ LPDAKGYTAL VRYLTSVSDE YRQQIRDEVL ATTPADFVAF AEAAAALRDH GHVAVLGSAE AIEAANRERP GLLNPVKVL
|
| |