Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | CPS_2074 |
Symbol | |
ID | 3518372 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Colwellia psychrerythraea 34H |
Kingdom | Bacteria |
Replicon accession | NC_003910 |
Strand | - |
Start bp | 2150151 |
End bp | 2151452 |
Gene Length | 1302 bp |
Protein Length | 433 aa |
Translation table | 11 |
GC content | 41% |
IMG OID | 637284533 |
Product | Xaa-Pro dipeptidase |
Protein accession | YP_268801 |
Protein GI | 71278779 |
COG category | [Q] Secondary metabolites biosynthesis, transport and catabolism |
COG ID | [COG1228] Imidazolonepropionase and related amidohydrolases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 5 |
Plasmid unclonability p-value | 0.63699 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGGAGTGA TAACGCCCAT TTCTGCAAGT GAAAAGCAAT CTACTATATT ACTCAAAAAC GTTCATGTCT TTGATGGTAT GAATGAAAAG CGCATGATGA ATGCCAATGT ACTAATCGAA AATAATATTA TCAAGGAAAT CACCCAGAAA AATATTAATG CTCCCGAGGC AACTCAAATT GATGGTAAAG GCCGAACGTT AATACCTGGA TTGATTGATA TGCATTGGCA TTCCGCCTAC TCCAGTATTC CTATGCAAAA AGGCCTTACC CTTGACCACG CATATCATTT GCTGATTGGT GCAAAAGCAA ATGAAAAAGC ACTTTTACGT GGCTTTACTA CGGTGAGAGA TGTTGGCGGT AATGTTTTTT CTTTAGCTAA ACTGACTGAT GAAGGGGTCT ATAACGGCCC TCGTATCTTT CCGTCTGGGC CGGCAATAAG TCAAACTTCT GGGCATACAG ACTTTAGACC AGGTACGGCA GTACCTGCTG AAACTAATGC CCCGCTGGTG TATATGGAGT CAATCGGCCA TGTAATGGTG GCAGATGGCG TTCCTGAAGT ATTAAAACGT ACTCGAGAGG CACTTCGAAT GGGTGCGACA CAAATCAAAA TTAATTCAGG TGGTGGAGTC TCATCTTCGT TCGATCCCCT TGATGTTACT CAATTTACGT TAGAGGAGAC TAAAGCCGCA GTAGCTGCCG CTTCTGATTG GAATACGTAT GTGGCCACAC ACACATTTAC CGATGCTGCT ACTCAACGAG CTTTAGAGGC TGGTGTAATA AGTATAGAGC ACGGTCACTT ATTGAGTGAA AAAACGCTAC GTTTGATGAA GAAAAAAGGG GCTTACCTTA GCATTCAACC TATATTGGAT GACGAAGATG CCATCGCATT TCCAGAAGGA TCATTTAGTC GTCAGAAATA TGTCGAAGTC ACTAAAGGTA CTGATCGTGT TTATCGTTTG GCTAAGAAAG TGGGCGTTAA AACAGTGTTC GGCACAGATA CACTATTTGA CCCTTTACTT GCAGAGAAGC AAGGAAAACA GTTAGCCAAA TTATCAAAAT GGCATTCACC TGTTGAAGCA TTACGTCAGG CTACGTCAAC GGCTGGGGAA TTATTAGCTT TATCAGGCCA ACGCAGCCCT TATCCTCAAG GTGCATTGGG AGTCATTAAA GTAGGTGCTT ATGCAGACTT AATTTTAGTT GATGGTAATC CGTTGAAAAA CCTTAATTTA GTTTCGAATC CAAAGGATAA TTTTGATTTA ATTATGAAAG ATGGAAAAAT ATATAAGAAT ACTCTTAATT GA
|
Protein sequence | MGVITPISAS EKQSTILLKN VHVFDGMNEK RMMNANVLIE NNIIKEITQK NINAPEATQI DGKGRTLIPG LIDMHWHSAY SSIPMQKGLT LDHAYHLLIG AKANEKALLR GFTTVRDVGG NVFSLAKLTD EGVYNGPRIF PSGPAISQTS GHTDFRPGTA VPAETNAPLV YMESIGHVMV ADGVPEVLKR TREALRMGAT QIKINSGGGV SSSFDPLDVT QFTLEETKAA VAAASDWNTY VATHTFTDAA TQRALEAGVI SIEHGHLLSE KTLRLMKKKG AYLSIQPILD DEDAIAFPEG SFSRQKYVEV TKGTDRVYRL AKKVGVKTVF GTDTLFDPLL AEKQGKQLAK LSKWHSPVEA LRQATSTAGE LLALSGQRSP YPQGALGVIK VGAYADLILV DGNPLKNLNL VSNPKDNFDL IMKDGKIYKN TLN
|
| |