Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Clim_1673 |
Symbol | |
ID | 6353980 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Chlorobium limicola DSM 245 |
Kingdom | Bacteria |
Replicon accession | NC_010803 |
Strand | - |
Start bp | 1838877 |
End bp | 1840394 |
Gene Length | 1518 bp |
Protein Length | 505 aa |
Translation table | 11 |
GC content | 49% |
IMG OID | 642669278 |
Product | protease Do |
Protein accession | YP_001943694 |
Protein GI | 189347165 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG0265] Trypsin-like serine proteases, typically periplasmic, contain C-terminal PDZ domain |
TIGRFAM ID | [TIGR02037] periplasmic serine protease, Do/DeqQ family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 0.0684799 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAACAAGA GAAAAAAAAA ATTGAAATAC CTGCTGCTGC TCCTTGTGGG AATAGCTGTG GGCAGCGTTG TTTTTGCCAA TGTCGAGTTT AGTTTTCCCG GTCGGAATGC GGGTTTGAAT CTCAGTAACA AACCAAATTA CGCCAATGCG AAAAATAACT TTGAGAATTA CCCGATACAG ACACTTCGAG ATTTCAACGA AGCGTTCGTA AAAATTGCAG AATCGGCAAC ACCTTCGGTG GTGACCATTT TTACCGAAAA GACCGTGAAT AGAAGGGTTA TCAGTCCGTT CGAGTTTTTC GGACGCAGCC CTTTTGATGA TTTTTTTGGT TCGCCGAAGG GGGGGAGCCA GAATGGACGC AAGGAGGTGC AGCGCGGTCT CGGTTCGGGT GTGATTGTGA CGTCGGACGG ATATATCCTT ACCAACAATC ATGTCGTAGA TAAGGCAGAT GCCATCTATA TCCGCACATC CGACAACAGA AAAATAGAGG CTAAAATCAT CGGTACGGAT CCTAAAACCG ATCTTGCCGT GATCAAGGTG AACGTAAGAG GGCTTAAACC CATCATGATA GGGGACAGCG ACCGGCTTCG TGTTGGCGAG TGGGTTATCG CTATCGGCAG TCCTCTCGGA GAGAGCCTCG CCAGAACCGT TACCCAGGGC ATTGTCAGCG CTATCGGACG TTCCAATGTC GGCCTTGCCG ATTACGAGGA TTTTATCCAG ACCGATGCAG CGATCAATCC GGGCAACTCG GGAGGCCCGC TGGTCAATAT CAACGGTGAA TTGGTAGGCA TCAATACCGC AATTGCGAGC CGTACCGGGG GGTTTGAAGG TATTGGTTTT GCGGTTCCTT CAAATATGGC ACAGAAAGTA CTGAATTCAC TCATTACGAC GGGAAAGGTT ACCCGGGGCT ATCTTGGTGT GAGTATTCAG GACATTGACG AGAATATTGC AAAAGGTCTC CAGCTTCAGA GCGCATCCGG GGTGCTTGTC GGTACCGTTG TGGCCGGCAG TCCAGCTGCA AAATCAGGAA TCAGAACAGG TGATGTCATT CTTGATTTCA ACGACAGGAA ATCTACATCG AGTGTTGATC TTCGCAATGA GATTGCCGTG ATGTCTCCGG GGTCCGTGGT TAAAATAAGG ATTCTCAGAG ACGGAGAGGT AAAAGTTTTC AGTGTCCGTC TTGAAGAACA GCCGGATCAG GCTTCTGTTT CTGCTGCACC TGCACAGCAG AACCCCGAAG TGCTTGGTTT CAGGTCACAG GAGCTGACGC CTCAGCTCGC TCAGCAATTT CAGTTGCAGC AGGGTGCCGG AAAAGTGCTT GTCACCGATG TCGACCAGTC AAGCAATGCA TTTCGTGCCG GTCTTCGTGC CGGTGACGTT ATTGTAGCGG TCAACAAGCA GGCGATAAGT TCATATGCAC AGTACGCAGC GCTTCTGCAA AAAGTGAAAG GCGGCGATCT TGTGTTTCTG CTTATCGACC GGCGAGGCAG TAAGGTCTAT TTTGCCTTTA ATGTATAA
|
Protein sequence | MNKRKKKLKY LLLLLVGIAV GSVVFANVEF SFPGRNAGLN LSNKPNYANA KNNFENYPIQ TLRDFNEAFV KIAESATPSV VTIFTEKTVN RRVISPFEFF GRSPFDDFFG SPKGGSQNGR KEVQRGLGSG VIVTSDGYIL TNNHVVDKAD AIYIRTSDNR KIEAKIIGTD PKTDLAVIKV NVRGLKPIMI GDSDRLRVGE WVIAIGSPLG ESLARTVTQG IVSAIGRSNV GLADYEDFIQ TDAAINPGNS GGPLVNINGE LVGINTAIAS RTGGFEGIGF AVPSNMAQKV LNSLITTGKV TRGYLGVSIQ DIDENIAKGL QLQSASGVLV GTVVAGSPAA KSGIRTGDVI LDFNDRKSTS SVDLRNEIAV MSPGSVVKIR ILRDGEVKVF SVRLEEQPDQ ASVSAAPAQQ NPEVLGFRSQ ELTPQLAQQF QLQQGAGKVL VTDVDQSSNA FRAGLRAGDV IVAVNKQAIS SYAQYAALLQ KVKGGDLVFL LIDRRGSKVY FAFNV
|
| |