Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cagg_1655 |
Symbol | |
ID | 7268957 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Chloroflexus aggregans DSM 9485 |
Kingdom | Bacteria |
Replicon accession | NC_011831 |
Strand | + |
Start bp | 2018392 |
End bp | 2020893 |
Gene Length | 2502 bp |
Protein Length | 833 aa |
Translation table | 11 |
GC content | 59% |
IMG OID | 643566497 |
Product | peptidase U32 |
Protein accession | YP_002462992 |
Protein GI | 219848559 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG0826] Collagenase and related proteases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 6 |
Plasmid unclonability p-value | 0.296071 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 32 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCATAAAC CCGAAATTAT GAGTCCGGCC GGCTACTGGC CGCAACTCCA TGCCGCGATT GAGGCCGGCG CCGATGCCGT CTATTTTGGT TTGACGCATT TCACGGCACG CGCGAAGGTC GGTTTTACGC TCGATGAGTT GCCGGAGGTG ATGAGGACAC TCCATCGCCG CGGCGTTAAA GGGTACGTTA CGTTCAATAC CTTAGTGTTC GACCACGAGC TACGCACGGC GGCGAGGACA CTTGCAGCGA TCGCCGCAGC CGGCGCCGAT GCGATCATCG TGCAAGATCT CGGTATTGCA GCACTGGCCC ATCAGATTGT GCCCGATCTC CCTATTCACG GGAGCACGCA GATGAGCATC ACCAGTGCGG AAGGAGTAGC TTTTGCCGCC CGCTACGGCG TCTCGCGGGT GGTCTTGGCC CGCGAGTTGT CGCTGGCCGA GGTTGCCGCC ATTCGTTCAC GTAGCCCTAT CGAACTCGAA ATCTTTGTCC ATGGCGCCCT CTGCGTCTCA TATTCGGGAC AGTGTTTTTC GTCAGAGGCA TGGGGCGGAC GCAGTGCGAA CCGTGGTCAA TGCGCTCAAG CATGTCGCTT ACCCTACGAG TTGATCGTCG ACGGAAAACC ACGACCCCTC GGCGCGGCGC GCTATTTGCT CTCACCCGGC GATCTGGCCG CTATTGATGA TATGACAACT ATCGCTCGCC TGGGCGTGAG TGCGCTCAAG ATCGAAGGGC GGTATAAAGA TGCTGAATAC GTTGCGATCA CAACCAACGC ATATCGCCGC GCACTCGATG CGGTATGGGC CGGGCTTCCT TCCGATTTGA CCGTTGCCGA CCGGCTGTAC CTCGAACAAG TGTACTCTCG TGGGCTGGGA CCGCACTTTT TGCGCGGCAC GAATCATCAG GCTGTCGTCG AAGGGCGCGC CCCTCGCCAC CGCGGTTTGT TGATGGGACG GGTGGTGCGG GTGCGTGCAG ACGCGATCAT TATCATACCT GAACGGGGCC GCGAGGCGGC ACCGCTCAAG CCTGGCGATG GTGTTGTCTT TGATGCCGCC GATTGGCGCA GCCCGGAAGA GCCAGAAGAG GGTGGTCGCA TCTTTACGGT TGAACCGGTC GGTGATGGAT TGCTGGCAAT TCGTTTCGCA AAGGGGGCAA TTAATCCACG CCGTATTCGC GCCGGAGATC TGCTGTGGCG CACAAGTGAT CCGCAAACCG AGCGAATAGC CCGCCCCTTC GTGCAAGCGG CTGCACCGGC ACGCCGCCAA CCGGTGCGCG TCACAGCGCT GGTACGCGCC GGTGCGCCGC TTGAACTACA CTGGTCGCTT ATTGCCCAAC CGAGCTTGAC CGTAACGGTA CAGAGTCCGA CACCTCTGAC CACAGCCCAG AATCGGCCAC TCGATGAAAC GACGCTGCGT GAGCAATTGG GGCGGTTAGG CGATACTCCT TACCAACTGA CCGAACTCAA CGCGGTCATT GAAGGCAACC CGTTTGTGCC GGTATCGCTG CTCAACCAAT TGCGCCGGCA AGCCACCGCT GCGCTCGCCG AGTTGCAAGG ACGGCCACCG GCAATGAACA TCCTCGATCC GGAGCAGGTG TTAGATACCA TGCTCGCTGC CGTTGCCCCA TCCGCACCAC CTGACACTGC TCAGATTCAC CTCCTGATCC GTTCACCCGA TCAACTCGAA GCGGCGATTG CGCTACAGCC GGCCAGTATT ACGCTCGATT ATCTCGACCT TGAAGGATTG AAGCCGGCCG TGACCCGCGT GCGCGCTGCC GGGATCGCAG TCCGTGTTGC CGCACCGCGG GTGCTCAAAC CGGAAGATGA GCGAGTAGCC CGCTTTTTAC GTAAACTCAA TGTACCGCTG CTCGTGCGCT CGACCGGCTT GCTCGACAGG CTGCGCGACG ATCCGACGGT TGAGTTGACC GGTGATTTTA GCCTCAACGC AGCCAATATC CTCACCGCCG ATCTGCTGCT GCGGTCGGGA TTACAACGGC TCACGCTGAC CCACGATCTC AACGCCGAGC AGATCGCGCA CTTAGCCGAA CGGATCGGCG GTAGTCGGCT AGAAGCCATC GTCTATCACC ATCTCCCTGT CTTTCATACC GAGCACTGTG TCTTTTGTCG TTTCTTATCA ACTGGAACCA GTTACAAAGA CTGCGGTCGC CCGTGTGAAC GTCACCACGT TGCGCTGCGC GACACTCACG GTCGGGCACA CCCGGTCATT GCTGATGTTG GTTGCCGCAA CACGGTCTTT GGGGCTGAGG CGCAAGAAGC GAGCAAATAT CTCGACCGCT GGCGCGCTGC CGGGATTGCT CACTACCGGC TTGAATTTGT CCATGAAACA GCAGCGCAGA TTACTGCGGT GACAGAAGCC TTCCGTGCGT ATTTGCAGGA AGAGATTGAT GCTGCTGAGC TAGGGCGACG TTGGCGACAA AGCGCACCAC AAGGCGTCAC CGAAGGTAGT TTCTTCGTAC CGGCAAATTA TCAGTACATT CCACTGATGT GA
|
Protein sequence | MHKPEIMSPA GYWPQLHAAI EAGADAVYFG LTHFTARAKV GFTLDELPEV MRTLHRRGVK GYVTFNTLVF DHELRTAART LAAIAAAGAD AIIVQDLGIA ALAHQIVPDL PIHGSTQMSI TSAEGVAFAA RYGVSRVVLA RELSLAEVAA IRSRSPIELE IFVHGALCVS YSGQCFSSEA WGGRSANRGQ CAQACRLPYE LIVDGKPRPL GAARYLLSPG DLAAIDDMTT IARLGVSALK IEGRYKDAEY VAITTNAYRR ALDAVWAGLP SDLTVADRLY LEQVYSRGLG PHFLRGTNHQ AVVEGRAPRH RGLLMGRVVR VRADAIIIIP ERGREAAPLK PGDGVVFDAA DWRSPEEPEE GGRIFTVEPV GDGLLAIRFA KGAINPRRIR AGDLLWRTSD PQTERIARPF VQAAAPARRQ PVRVTALVRA GAPLELHWSL IAQPSLTVTV QSPTPLTTAQ NRPLDETTLR EQLGRLGDTP YQLTELNAVI EGNPFVPVSL LNQLRRQATA ALAELQGRPP AMNILDPEQV LDTMLAAVAP SAPPDTAQIH LLIRSPDQLE AAIALQPASI TLDYLDLEGL KPAVTRVRAA GIAVRVAAPR VLKPEDERVA RFLRKLNVPL LVRSTGLLDR LRDDPTVELT GDFSLNAANI LTADLLLRSG LQRLTLTHDL NAEQIAHLAE RIGGSRLEAI VYHHLPVFHT EHCVFCRFLS TGTSYKDCGR PCERHHVALR DTHGRAHPVI ADVGCRNTVF GAEAQEASKY LDRWRAAGIA HYRLEFVHET AAQITAVTEA FRAYLQEEID AAELGRRWRQ SAPQGVTEGS FFVPANYQYI PLM
|
| |