Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cphamn1_2127 |
Symbol | |
ID | 6375821 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Chlorobium phaeobacteroides BS1 |
Kingdom | Bacteria |
Replicon accession | NC_010831 |
Strand | + |
Start bp | 2304158 |
End bp | 2305405 |
Gene Length | 1248 bp |
Protein Length | 415 aa |
Translation table | 11 |
GC content | 52% |
IMG OID | 642684617 |
Product | peptidase U32 |
Protein accession | YP_001960516 |
Protein GI | 189501046 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG0826] Collagenase and related proteases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 4 |
Fosmid unclonability p-value | 0.00706152 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGGACAGAC ACCGTAAAAT CGAGCTCATT TCACCTGCCG GAGACCACAC ATCGCTTCTT GCCGCCCTTC AGGCGGGAGC AGATGCCGTC TATTTCGGTG CGGAGGGATA TAATATGCGG GCGGCCAGCA GAAGTTTCAC GCCCGATGAT TTTCCGACTG TCTCCGGCCT TTGCGCGACC TATGGCGCAA AAGCCTACCT GGCACTCAAT ACCGTGATAT ACGATGAAGA ACTGCCGGAT GTTCAAAAGA CGGTCCGGGC AGCAAAAGCT GGTGGTCTCG ACGCGATCAT CTGCTGGGAC CAGTCGGTTA TAGAAGCGTG TCGGGAAGCC GGGATGCCCT TTCATCTCTC AACGCAGGCA TCAGTCAGCA ATTACCGCGC GGTACGCTAC TATGCCTCGC TTGGCGCGGG AATGATCGTA CCCGCCCGTG AACTGACCCT TGAACAGATC ATAAAGATCA CCGAAAGAAT CCGCCTGGAA AAACTGGACG TAGCCATCGA ATGCTTTGTT CATGGCGCCA TGTGTATGGC CGTGTCGGGA AGATGCTTTC TCTCGCAGGA CATCTTCGGG CGTTCAGCCA ACCGCGGCGC ATGCATGCAG CCCTGCAGAC GCCGTTACAG GATCATCGAT AGTGATGACG GTCATGAACT GGATCTCGGG ACAGATACCG TGATGAGCCC TGAAGACCTT TGCACCATTT CGTTCATTGA CAAACTCATC GATGCAGGCA TAACCGGCTT CAAGATAGAA GGCCGGAACC GAAGTCCTGA ATATGTCCAT ACTACAACGA AATGCTACCG CAAGGCCATC GACTACACTC TCGAACATGG ACACGAAAAA CAGTTCAGAC GCCATTTTGA AGCTCTGGCG AAAGAACTGG CCACGGAACT TCACAAGGTC TACAACCGCG GATTTTCACA TGGATTTTAC CTTGGCGTTC CTGTTGATTC ATGGACACAG CAGTACGGGT CTCTTGCCAC GGAAAAAAAA GTGTATGCAG GTACTGTGCA GAAATACTAC CCTAAAGCAA AGGTGGCGGA AATCCTGATA CACACCAGAG GAATACACTC GGAAGAAAAA CTCTCGATAC AGGGAACAAC AACCGGACTG GTTGTTCTCA ACGTCCAGTC GATGCGGGTT AACGATCAGC CTGCTCTCTC GGCATCAAAA GGAGATATTG CGACAATCCC CTGCGATAAA AAAGTCAGGA AAAACGACAA GGTGTATGTG CTGGAAGCTG CGGAATAA
|
Protein sequence | MDRHRKIELI SPAGDHTSLL AALQAGADAV YFGAEGYNMR AASRSFTPDD FPTVSGLCAT YGAKAYLALN TVIYDEELPD VQKTVRAAKA GGLDAIICWD QSVIEACREA GMPFHLSTQA SVSNYRAVRY YASLGAGMIV PARELTLEQI IKITERIRLE KLDVAIECFV HGAMCMAVSG RCFLSQDIFG RSANRGACMQ PCRRRYRIID SDDGHELDLG TDTVMSPEDL CTISFIDKLI DAGITGFKIE GRNRSPEYVH TTTKCYRKAI DYTLEHGHEK QFRRHFEALA KELATELHKV YNRGFSHGFY LGVPVDSWTQ QYGSLATEKK VYAGTVQKYY PKAKVAEILI HTRGIHSEEK LSIQGTTTGL VVLNVQSMRV NDQPALSASK GDIATIPCDK KVRKNDKVYV LEAAE
|
| |