Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cag_1991 |
Symbol | |
ID | 3747370 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Chlorobium chlorochromatii CaD3 |
Kingdom | Bacteria |
Replicon accession | NC_007514 |
Strand | + |
Start bp | 2527749 |
End bp | 2529497 |
Gene Length | 1749 bp |
Protein Length | 582 aa |
Translation table | 11 |
GC content | 50% |
IMG OID | 637774528 |
Product | peptidase S41A, C-terminal protease |
Protein accession | YP_380282 |
Protein GI | 78189944 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG0793] Periplasmic protease |
TIGRFAM ID | [TIGR00225] C-terminal peptidase (prc) |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 24 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTTCCCGC AAAGAGAGAG CAAGCCGCGC CATAAGCAAA GCCAACGCAA CGGCTGGCGA ATTATTCAAC GCATGGCAAC GGCACTGCTT GCACTCTCTC TGCCTACCAC CACGCTTGCT TACCCTCAAG CGGAAAGCCA AAGCTTTGCA GTTGTTTCAA GCATTGAGCT TCTCTCCGAA GTATATCGCG AATTAGCAGC AGGCTACGTT GAGCCGCTTG ATACCGCCCT CTTAATGAAA ACGGGCATTC GAGGCATGTT GCGTAGCCTT GATCCCTACA CCACCTTACT TGAGCGCGAT GATGCCGATG AATTAGCCGA TATTACTCGT GGACGCTATG TGGGCATTGG CATATCGCTC GCTACGCTCG AAAAAAAGCT CTACGTTACC GCTGTTAATG AGGAGAGCCC AGCCGCAGCG GCAGGCATTC GCACAGGCGA TGCCATTCTT GCCATTAACG AGGCGAAAGT TGCCAATATA GCGGTTGATA GCTTAAGAAC TCTTTTGCAC GGCACCAATG GTTCACCCAT CACCTTTCAA TTAGAGCGAC GAGGCAGCGC CCCACGAACA ACCACCGTAC AACGCCAATC GGTGCCGCTA AAAAGCGTAC CATATTACGA ATTACACAAC AACATTGGCT ACATAGCGCT TGATGGCTTT ACCACCCGCT CACCCCATGA AGTACGTAGC GCATGGCAGT CATTGCAACA GCAAGCAACC GCTAACAAGC AACCCTTACG TGGCTTAATA GTTGATTTAC GCGACAACTC AGGTGGCTTG CTTGATGCCG CATTGGAAAT TACCTCGCTT TTTGTGCCCA ACGGCAGCGA AGTGGTTTCC ATTAAAGGGC GCTCTACCCA TAGCCATAGC ACTCTTAAAA CCACCACCGA GCCGTTAGAT GCAACACTCC CCGTTGCGCT GCTGATTAAT GGCGATACCG CTTCGGCGGC TGAAATTGTA GCTGGTGCTC TGCAAGATGT TGATCGAGCC ATTATTCTTG GCGAACGCTC TTACGGCAAA GGCTTAGTAC AATCGGTAAA AAAACTCTCT TATGGCAACA CACTGAAATT TACCACAGCA AAATATTACA CCCCTTCAGG GCGCCTCATT CAAAAAGAGC TGAAAAAAGA GAGCTCACCA CACTCAACCA ACGCTGATAG CAAACAAGCT CTTGCCTCCG CAGTACCCGA TACAACACAA CGCTTTTACA CCCGCAATCA CCGTATTGTG TATGGGGGAG GTGGAATTAT GCCCGATGTG GAGATAAAGG AACCAGCCTC GCCCTACGTA ACCGCATTGC GCAAACGAGG GATGATTTTT CTTTTTGCTA ATGAATGGTA CGCCACCCAT TCTGATGATG CTCCAGCCTC ATCCGCTTTG CTACCAAGCC AAACGGAGCT GTTAGCGCAC TTTGAAAAAT TCCTTCAGCA AAAAGAGTTT CGCTACACCA GCAATGCCGC AAAACGTTTA GAGGAATTAA AAAGCGCCAT GAAAGAGTCA GGCAGAGAGA ATCCTGAAGC CTTACGCACT ATGGAGCGCG AAGTTGAACT TGCAGATACA GAGGAGCGCA ATCGTGAAGC CAAGCAAGTA GCCGTAGCGC TTGAGTCAGC AATTTTGCGC CATGCCAGCG AACACTTAGC ACGCCAAGCC GAACTTCGCC ACGATGCGCT TGTGTTGCAA GCTGAAGAGC TGCTTATCTA TCCCGCCCGT TATCGTGCTA TGCTGAAAGC TTCAAGCACA AGAAAATAG
|
Protein sequence | MFPQRESKPR HKQSQRNGWR IIQRMATALL ALSLPTTTLA YPQAESQSFA VVSSIELLSE VYRELAAGYV EPLDTALLMK TGIRGMLRSL DPYTTLLERD DADELADITR GRYVGIGISL ATLEKKLYVT AVNEESPAAA AGIRTGDAIL AINEAKVANI AVDSLRTLLH GTNGSPITFQ LERRGSAPRT TTVQRQSVPL KSVPYYELHN NIGYIALDGF TTRSPHEVRS AWQSLQQQAT ANKQPLRGLI VDLRDNSGGL LDAALEITSL FVPNGSEVVS IKGRSTHSHS TLKTTTEPLD ATLPVALLIN GDTASAAEIV AGALQDVDRA IILGERSYGK GLVQSVKKLS YGNTLKFTTA KYYTPSGRLI QKELKKESSP HSTNADSKQA LASAVPDTTQ RFYTRNHRIV YGGGGIMPDV EIKEPASPYV TALRKRGMIF LFANEWYATH SDDAPASSAL LPSQTELLAH FEKFLQQKEF RYTSNAAKRL EELKSAMKES GRENPEALRT MEREVELADT EERNREAKQV AVALESAILR HASEHLARQA ELRHDALVLQ AEELLIYPAR YRAMLKASST RK
|
| |