Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cag_0547 |
Symbol | |
ID | 3747051 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Chlorobium chlorochromatii CaD3 |
Kingdom | Bacteria |
Replicon accession | NC_007514 |
Strand | - |
Start bp | 651268 |
End bp | 652536 |
Gene Length | 1269 bp |
Protein Length | 422 aa |
Translation table | 11 |
GC content | 47% |
IMG OID | 637773081 |
Product | collagenase |
Protein accession | YP_378863 |
Protein GI | 78188525 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG0826] Collagenase and related proteases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 23 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGACACCCT TGAAAAGTCA ACCGAACCAT TCAATTCCAC AAGCTGAACT TATTGCCCCC GCAGGCGATA TGACGGCGCT TGTAACTGCT TTACAGGCAG GTGCCGATGC CGTCTATTTT GGTGCCGAAG GCTATAACAT GCGAGCAGGC AGCAACAACT TTACAAATAG CGACTTTGCT ACGGTGCGTG CCCTCTGTTC CAAGCACAAC GCAAAAGCCT ATCTTGCACT TAACACCATT ATTTACGATA GCGAGCTCAA GCAGATGCGC CAAAGCGTTG AAAGCGCAAA AGCCGCTGGT ATTGACGCTA TTATTTGCTC CGATATGGCA GTTATTGAAG CGTGTCGCCA AGCCGAAATG CCCATCCACC TCTCCACCCA AGCCTCTGTA AGCAACTACA ACACACTCCG CTTTTTTGCA GAGCAGGGTG CCGCCATGGT TGTGCTTGCT CGCGAGCTTA CCATCGAGCA AGTGCGCCAC ATTACACGCA ACATTCAGCA CGACAGCTTA CCTGTTCGCA TTGAGTGCTT TGTGCACGGA GCCATGTGCG TTGCGGTGTC GGGGCGTTGC TTTCTTTCAC AGGAGTTATT TGGGCGATCA GCAAATCGAG GACAATGTGT TCAGCCATGT CGCCGTAGCT ACATTATTAC CGATCCCGAA GAGAACGAAG AGCTTGAGCT TGGCGCTGAT TACGTGATGA GCCCAAAAGA TTTATGCGCC ATTGAATTTC TTGATGTATT GCTTGATGCT GGCATTAGCG CCTTTAAAAT TGAAGGAAGG AGCCGTAGCC CCGAATACGT TCACACTACT ACTACGGCAT ATCGCCAAGC ACTCAATATG TGCATGCAGC AGCGCCACCA AGCCGATTTT AGAAACCGTT ACAGCGCCTT AACAGCTTCG TTAAAGCACG ATTTAGCAAC AGTTTACAAT CGAGGATTTT CTAACGGCTT TTATTTCGGC AAGCCTATGG AGGCATGGGC ACAAACGTAT GGATCACAAG CAACGGAGAA AAAAACCTAT ATAGGCGACA TCAATAAATA CTTTCCAAAA GCAGGAATTG CTGAATTACA CATTCGAGCA CGAGGTTTAA AGCAAGGCGA TAAACTTTCT ATTCTTGGTG TAAAAAGTGG GATGGTAACG GTTATAGCTG ATTCATTTCT TACCAACGAT CAACCAAATA CGGAAGCAAT AAAAGGGGAT AGCGTTACCT TTAAATGCCC TCCCGTTCGC AAAAATGATA AAGTATATGT TTTAGAGGAG AGAAAGTAA
|
Protein sequence | MTPLKSQPNH SIPQAELIAP AGDMTALVTA LQAGADAVYF GAEGYNMRAG SNNFTNSDFA TVRALCSKHN AKAYLALNTI IYDSELKQMR QSVESAKAAG IDAIICSDMA VIEACRQAEM PIHLSTQASV SNYNTLRFFA EQGAAMVVLA RELTIEQVRH ITRNIQHDSL PVRIECFVHG AMCVAVSGRC FLSQELFGRS ANRGQCVQPC RRSYIITDPE ENEELELGAD YVMSPKDLCA IEFLDVLLDA GISAFKIEGR SRSPEYVHTT TTAYRQALNM CMQQRHQADF RNRYSALTAS LKHDLATVYN RGFSNGFYFG KPMEAWAQTY GSQATEKKTY IGDINKYFPK AGIAELHIRA RGLKQGDKLS ILGVKSGMVT VIADSFLTND QPNTEAIKGD SVTFKCPPVR KNDKVYVLEE RK
|
| |