Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cpha266_2165 |
Symbol | |
ID | 4570766 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Chlorobium phaeobacteroides DSM 266 |
Kingdom | Bacteria |
Replicon accession | NC_008639 |
Strand | + |
Start bp | 2506762 |
End bp | 2508024 |
Gene Length | 1263 bp |
Protein Length | 420 aa |
Translation table | 11 |
GC content | 50% |
IMG OID | 639766740 |
Product | peptidase U32 |
Protein accession | YP_912594 |
Protein GI | 119357950 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG0826] Collagenase and related proteases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAACCTTT CCGAGTCTGT GACAAGCGAA AAAAAAATTG AACTTATAGC GCCTGCCGGC GACTGGACCT CACTCCGCAC CGCACTGCAA GCAGGAGCCG ATGCCGTCTA TTTCGGAGCT GAAGGCTATA ACATGAGGGC CGGAAGCAAT AACTTCACTC CGGCTGATTT TCCCGCCATC ATGACGCTCT GCAGTGAGTT CAACGCCAAA GCGTATCTGG CGCTGAACAC GATCGTCTAT GACGGCGAAC TGAAAAAGAT GGTTCAAACC GTCTCCGCTG CCAAAACGAC AGGCTTCGAT GCCGTTATCT GCTCGGACAT GGCTGTCGTC GATGCATGCC GAAAAGCAGC AATGCCCTTT CATATGTCAA CACAGGCTTC GATCAGCAAC TACAGCGCAG TAAAATTCTA TGCCGACCTT GGCGCAAAAA TGATCGTGCT GGCCCGCGAG CTTACCATTG ACCAGGTACG CCATATTACC TCGAAATTAA AGGCCGACCG TCTCGATGTA CAGATCGAGT GCTTTGTTCA CGGAGCGATG TGCGTCGCTG TTTCCGGGCG CTGCTTCATG TCACAGGAAC TTTTCGGACG CTCCGCCAAC CGGGGACAGT GCGTTCAGCC CTGCCGAAGG CAATATATCA TCACCGATCC TGAAGAGAAC CAGGAGCTTG AGCTTGGTAC CGATTATGTT ATGAGTCCGA AAGACATGTG CGCAGTGGAA TTTCTTGACG TTCTCATGGA TGCGGGAATC AGCGCATTCA AAATCGAAGG ACGAAGCCGC AGTCCGGAAT ATGTTCATAC TGCGACAACA GCTTACCGAC GGGCGATCGA CTTCTGCACG AGCCACCGCA ACAGTCCGGA ATTCAGAACA GAGTACAACT CCTTATCGAA ACAGCTTAAA GAGGAACTCG CACGGGTATA TAACCGGGGA TTTTCGGAAG GATTTTATTT TGGAAAACCC TTCGATGCCT GGACCAGAGA GTACGGCTCA ATGGCCTCCG AAAAAAAAAT CTATATCGGA GAGGTTAAAA AATATTATCC AAAAGCGGAG GTGGCTGAAA TCCTCATCTT TGCCCGAGGC CTCAAACAAG GCGATAAGCT CTCTGTTCTC GGCCCGAAGA CAGGAGTTAC AACCCTTTTT GCCGAAAGCT TTTATACCAA CGATCTTCCT GCAAAAACGG CTGTCAGGGG CGACAGCGTC ACCATCAAAT GTGCAAAAGT GAGAAAGAAC GACAAGGTAT ATGTGCTTGA AAAAAGAAGC TGA
|
Protein sequence | MNLSESVTSE KKIELIAPAG DWTSLRTALQ AGADAVYFGA EGYNMRAGSN NFTPADFPAI MTLCSEFNAK AYLALNTIVY DGELKKMVQT VSAAKTTGFD AVICSDMAVV DACRKAAMPF HMSTQASISN YSAVKFYADL GAKMIVLARE LTIDQVRHIT SKLKADRLDV QIECFVHGAM CVAVSGRCFM SQELFGRSAN RGQCVQPCRR QYIITDPEEN QELELGTDYV MSPKDMCAVE FLDVLMDAGI SAFKIEGRSR SPEYVHTATT AYRRAIDFCT SHRNSPEFRT EYNSLSKQLK EELARVYNRG FSEGFYFGKP FDAWTREYGS MASEKKIYIG EVKKYYPKAE VAEILIFARG LKQGDKLSVL GPKTGVTTLF AESFYTNDLP AKTAVRGDSV TIKCAKVRKN DKVYVLEKRS
|
| |