Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | CPS_4338 |
Symbol | gcp |
ID | 3519156 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Colwellia psychrerythraea 34H |
Kingdom | Bacteria |
Replicon accession | NC_003910 |
Strand | + |
Start bp | 4572004 |
End bp | 4573053 |
Gene Length | 1050 bp |
Protein Length | 349 aa |
Translation table | 11 |
GC content | 44% |
IMG OID | 637286779 |
Product | O-sialoglycoprotein endopeptidase |
Protein accession | YP_270987 |
Protein GI | 71278237 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG0533] Metal-dependent proteases with possible chaperone activity |
TIGRFAM ID | [TIGR00329] metallohydrolase, glycoprotease/Kae1 family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCGTATTT TGGGTATTGA AACTTCTTGT GATGAAACAG GTATCGCCAT TTATGATGAC GGTTTAGGTG ATAGTCCTGA AGGTATACTC GCCCATCGTT TGTATAGTCA AATAGCCGTT CATGCTGATT ACGGTGGCGT AGTACCAGAA CTCGCATCAA GAGACCATGT ACGTAAGACT ATTCCTCTGA TTAAGGAAGT ATTGGCAGAT GCCAACCTTA CCCCCAAAGA TCTAGACGGT GTTGCGTACA CTGCTGGTCC TGGTTTAGTT GGCGCACTCT TGGTTGGTTG TTCCATTGGT CGAAGTTTGG CTTATGGTTG GGAGCTACCC GCAGTTCCTG TACATCATAT GGAAGGTCAT CTATTAGCGC CTATGCTTGA AGATGATGTA CCTGAGTTTC CTTTTGTCGC TTTATTAGTT TCTGGCGGAC ATACCATGCT GGTACGTGTC GATGCTATTG GTGAATACAA ACTACTTGGT GAATCAGTTG ATGATGCTGC CGGTGAAGCG TTTGATAAAA CAGCTAAATT GCTAGGTTTA GACTACCCTG GTGGACCGGC TTTATCTAAG ATGGCTGAAA GCGGTGAAGC AGGACGCTTT AAATTACCCC GCCCTATGAC TGATAGACCT GGTTTAGATT TTAGCTTTAG TGGTTTAAAA ACAGCGGCAG GAACCTTAGT TCGCAAAGAA TGTTTGAATT TGTCAGATAG CGATCTTAAG CAAACCCATG CTGATATTGC CAATGCGTTT CAACAAGCAG TGGTAGACAC CTTAGCGATA AAATGTAAAC GCGCCTTGCA ACAAGAAAAA TTAAGCAGAT TAGTAATTGC TGGTGGTGTA AGCGCCAATA CAGCTTTACG AGAGCAACTG GCGATAACCA CAAAGAAACT TGGTGGCAGT GTATTTTATC CACGTCCTGA GTTTTGCACT GATAACGGCG CTATGATTGC TTATGCTGGT TTACAGCGTT TAAAAGCAGG AACAGATGCC GACTTAACTT TTAAAGCTAA CCCTCGATGG GCGCTCGATT CATTACCTCC AGTAAAATAG
|
Protein sequence | MRILGIETSC DETGIAIYDD GLGDSPEGIL AHRLYSQIAV HADYGGVVPE LASRDHVRKT IPLIKEVLAD ANLTPKDLDG VAYTAGPGLV GALLVGCSIG RSLAYGWELP AVPVHHMEGH LLAPMLEDDV PEFPFVALLV SGGHTMLVRV DAIGEYKLLG ESVDDAAGEA FDKTAKLLGL DYPGGPALSK MAESGEAGRF KLPRPMTDRP GLDFSFSGLK TAAGTLVRKE CLNLSDSDLK QTHADIANAF QQAVVDTLAI KCKRALQQEK LSRLVIAGGV SANTALREQL AITTKKLGGS VFYPRPEFCT DNGAMIAYAG LQRLKAGTDA DLTFKANPRW ALDSLPPVK
|
| |