Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Clim_2136 |
Symbol | |
ID | 6355930 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Chlorobium limicola DSM 245 |
Kingdom | Bacteria |
Replicon accession | NC_010803 |
Strand | + |
Start bp | 2354324 |
End bp | 2355562 |
Gene Length | 1239 bp |
Protein Length | 412 aa |
Translation table | 11 |
GC content | 51% |
IMG OID | 642669727 |
Product | peptidase M16 domain protein |
Protein accession | YP_001944139 |
Protein GI | 189347610 |
COG category | [R] General function prediction only |
COG ID | [COG0612] Predicted Zn-dependent peptidases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 35 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGAAGGTA TTCAAAGCAG CACCCTGAAA AACGGCCTCC GGGTGATAAC CGATCATGTC CCCTGGGTAC AGAGCGTCAC CCTCGGCATT CATATCGACG TCGGCTCCCG GGACGATCCC GATAAAAAAA GCGGCCTTGC GCATTTTCTT GAACACGCGG TCTTCAAGGG AACAAAAAAA CGGGACTATA TCGAGATCGC CTGCGGAATC GAACGAAACG GCGGTTATCT CGATGCCTAT ACGACAAAGG AACAGACCTG TATCTACCTG CGGTGCCTCG ACCGGTTTAC CGAACCCGCT CTCGATCTTC TGGCCGACCT GGTCTGCAAC CCGGTATTTC CCGAAGAGGA AATCGAAAAA GAAAAAGAGG TGGTTCTCGA AGAGATCAGC AGCATCAACG ACACGCCGGA AGAGGTCGTT TTCGAGGATT TTGACCGCTA CCTTTTCCGC CGTCACCCGC TCGGAACCCC TATTCTTGGA ACGGACAAAA GCGTCAGCAA CTTTGAAAGC AGCGATCTCA CCGCTTTCAT GGCCAACTTT TACCGGCCTG AAAACATGTT TTTAACGGCG ACCGGCAATA TCCGTCACGC GGAACTCGCA AAACTGGCGG AACGCTGTTT TTCAACGTTA TCCCAAAACC TTACCCCTGC CCCTGAAAGA AAACCATTTC TTCCCGGACA ATACAAAGCG TTCTCCCGGA CAGTCAAAAA AAGAGCCCAT CAGGCCCAGA TTGTGCTTGG CTCAGCAGTT GCCCGTCACG ACAGATCTTT TTACAGCCTG ATGGTACTGA ACACCCTCCT TGGCAGCGGA ATGAGCTCGA TACTGAACCT TGAACTGCGG GAAAAGCTTG CCCTTGTTTA CAGTACCTAT TCATCAATTG CATTCTATGA TGATCTGACG GTCATGAACA TCTATGCGGG AACCGACAGC AACAAAATCA CACAGACACT CGATGTCCTC GCATCTGTCA TGAAAAGCCC TGAACTTATA GCTCCTGCCA AAGAAGAGCT GCGGTCGGCA AAATCAAAAC TCCTCGGTTC GTTCCTGATG GGCACGGAAA AAATGACACG CCGCATGTCG CACCTGGCCA CCGATCTCTC GTATTTCGGA AAATATATCC CGCTCGAAGA GAAAACCGCC GCCATTGAGA ACGTTACGGT AACGGATATA ACAACAGCTG CCCGAATGCT GCTTGAAGAA GTTCCGCTCT CGACTCTCGT GTTCAAACCC ATGCGATAA
|
Protein sequence | MEGIQSSTLK NGLRVITDHV PWVQSVTLGI HIDVGSRDDP DKKSGLAHFL EHAVFKGTKK RDYIEIACGI ERNGGYLDAY TTKEQTCIYL RCLDRFTEPA LDLLADLVCN PVFPEEEIEK EKEVVLEEIS SINDTPEEVV FEDFDRYLFR RHPLGTPILG TDKSVSNFES SDLTAFMANF YRPENMFLTA TGNIRHAELA KLAERCFSTL SQNLTPAPER KPFLPGQYKA FSRTVKKRAH QAQIVLGSAV ARHDRSFYSL MVLNTLLGSG MSSILNLELR EKLALVYSTY SSIAFYDDLT VMNIYAGTDS NKITQTLDVL ASVMKSPELI APAKEELRSA KSKLLGSFLM GTEKMTRRMS HLATDLSYFG KYIPLEEKTA AIENVTVTDI TTAARMLLEE VPLSTLVFKP MR
|
| |