Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cag_0494 |
Symbol | |
ID | 3746363 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Chlorobium chlorochromatii CaD3 |
Kingdom | Bacteria |
Replicon accession | NC_007514 |
Strand | + |
Start bp | 577652 |
End bp | 579172 |
Gene Length | 1521 bp |
Protein Length | 506 aa |
Translation table | 11 |
GC content | 48% |
IMG OID | 637773028 |
Product | peptidase S1C, Do |
Protein accession | YP_378810 |
Protein GI | 78188472 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG0265] Trypsin-like serine proteases, typically periplasmic, contain C-terminal PDZ domain |
TIGRFAM ID | [TIGR02037] periplasmic serine protease, Do/DeqQ family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 22 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAAAAGA GTGAGAAAAT ATCTTCACGC ATAAAAAAAG TGCTGTTGGT GTTAAGCGGC GTTGCGGTTG GTGCGCTTGT TTTTTCCAAC ATGGAGTACT CAGTTTCTTT TAACGGTACA ACCTTTTCTA ACACTCCCTC TTTTGCCACA GCAACCAGCA ATATTGCTGA TGCTCCCATT AGTTCACTAC GGAACTTTAA TGAGGCGTTT GTGCAAATTG CCGAATCGGC AACGCCTTCG GTAGTAACTA TTTTTACCGA GAAAACGGTC AATCAGCGGG TTGTTTCGCC CTTTAACTTT TTTGGAAGCC CTTTTGATGA CTTTTTTGGT CGTCCTGATG GGAATAGTGC CGAGCGTAAG AATGTGCGGC GTGGCATTGG TTCAGGCGTT ATTGTAACGG CTGACGGCTA CATTCTTACC AACAACCATG TGATTGATGG TGCCGATGTG GTTTATGTGC GCACGGCTGA TAAGCGCCGC CTTGATGCTA AGGTGATTGG TACTGATCCC AAAACCGATA TTGCCGTTAT TAAGGTAAAT CAGCAAGGGT TAAAGCCTAT TGTAATTGGC GATAGCGATA AGTTGCGAGT AGGGGAGTGG GTAATTGCTA TTGGCAGTCC ACTTGGCGAA AATCTTGCAC GCACCGTAAC GCAAGGTATT GTAAGCGCGA AAGGGCGTGC CAATGTAGGG TTAGCCGATT ATGAAGATTT TATTCAAACC GATGCCGCCA TTAATCCGGG CAATTCAGGT GGTGCGCTGG TTAATATCAA TGGGGAATTA GTTGGCATTA ACACGGCAAT TGCCAGCCGC ACGGGTGGCT TTGAGGGGAT TGGTTTTGCG GTGCCATCCA ACATGGCAAA AAGCGTTTTA ACGGCGCTTA TTACCACAGG AAAAGTAACG CGCTCCTACC TTGGCGTAAG CATTCAAGAT ATTGATGATA ACATTGCAAA AGCAATGAAT GTAAAGGCGG GCGAAGGTGC TTTAGTGGGC ACGGTTATGG AGAATAGCCC TGCCGCACGA GCTGGTATGC AAACAGGTGA TGTTATTTTG GAATTTAATG GCGCAAAAGT AACCAGCAGC GCCGCCTTGC GTAATGCCAT TGCTACGCAA ACGCCCGGCA GCATGGTCTA TATTAGAGTG TTACGCGATG GAGCGCTGAA GTCGTTTGCG GCACGCCTTG AAGAGCAAAC CCCAAAAACC GCAAGTAGCA CAACTCCCGC TAAAAAAGCC GACATTAATA GTGCGCTTGG CTTTCGTGCC GAAGAGCTGA CACCCGAATT GGCGCAGCGC TTAAAGCTGA AAGGGAGCAG CGGCAAAGTG GTGATTACCG CAATTCAGCA ACAATCAACC GCCTATCGTG CAGGCTTGCG TCCGGGCGAT GTGATTCTTT CGGTTAACAA GCAAGCGGTA AGTTCGGTAG CAAGCTATAA CGCATTGGTT AAAAATCTTG CAAAAGGCGA ATTGCTGTTG CTCTTGATTG AGCGCGGGGG GAATAAGAGC TACATTGCCT TTACGCTGTA A
|
Protein sequence | MKKSEKISSR IKKVLLVLSG VAVGALVFSN MEYSVSFNGT TFSNTPSFAT ATSNIADAPI SSLRNFNEAF VQIAESATPS VVTIFTEKTV NQRVVSPFNF FGSPFDDFFG RPDGNSAERK NVRRGIGSGV IVTADGYILT NNHVIDGADV VYVRTADKRR LDAKVIGTDP KTDIAVIKVN QQGLKPIVIG DSDKLRVGEW VIAIGSPLGE NLARTVTQGI VSAKGRANVG LADYEDFIQT DAAINPGNSG GALVNINGEL VGINTAIASR TGGFEGIGFA VPSNMAKSVL TALITTGKVT RSYLGVSIQD IDDNIAKAMN VKAGEGALVG TVMENSPAAR AGMQTGDVIL EFNGAKVTSS AALRNAIATQ TPGSMVYIRV LRDGALKSFA ARLEEQTPKT ASSTTPAKKA DINSALGFRA EELTPELAQR LKLKGSSGKV VITAIQQQST AYRAGLRPGD VILSVNKQAV SSVASYNALV KNLAKGELLL LLIERGGNKS YIAFTL
|
| |