Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Lcho_0602 |
Symbol | |
ID | 6159880 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Leptothrix cholodnii SP-6 |
Kingdom | Bacteria |
Replicon accession | NC_010524 |
Strand | - |
Start bp | 651711 |
End bp | 653261 |
Gene Length | 1551 bp |
Protein Length | 516 aa |
Translation table | 11 |
GC content | 72% |
IMG OID | 641663352 |
Product | protease Do |
Protein accession | YP_001789642 |
Protein GI | 171057293 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG0265] Trypsin-like serine proteases, typically periplasmic, contain C-terminal PDZ domain |
TIGRFAM ID | [TIGR02037] periplasmic serine protease, Do/DeqQ family |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | 7 |
Fosmid unclonability p-value | 0.00000000166487 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGTCTGAAC ATTCGTCGCG CACGAGCGTG CACAAGCGAT CCGATTCCAC CCCTCGCAGC GTTGCCGCCC TGCGCTGGCT CGCCCCCCTG CTGACGGCGG CGTGCCTGCT GCCGACACCG GCCCTGGCCC AGTCGGGCGG CGCGCAGGCT GCAGCTGCCC AGGCCGGCGC GGCGGCACCG CTGGTGCGCG GACTGCCCGA CTTCACCGAA CTGGTCGAGC AGGTCGGCCC GGCGGTGGTC AACATCCGCA CCACCGCCAA GGCCCGCACG GCGCGCTCGG ACAACCCGGC CGACGAGGAG ATGCAGGAGT TCTTCCGCCG CTTCTTCGGC GTGCCGATCC CGCGCCAGGG CCCACGTCAG GGGCCGCCCG GCCAGGGCCA GAGCGAAGAA GAAGCGGTGC CGCGCGGCGT CGGCTCGGGC TTCATCGTCA GCAGCGACGG CTTCGTGATG ACCAACGCGC ATGTGGTCGA GGGCGCCGAC GAAGTCACGG TGCGCCTGAC CGACAAGCGC GAGTTCAAGG CCCGCGTGGT GGGCGCCGAC AAGCGCACCG ACATCGCGGT GCTCAAGCTC GACGCCACCG GCCTGCCGGC GGTGCGCCTG GGCGACGTCA GCCGTCTCAA GGTCGGCGAA TGGGTGATCG CGATCGGCTC GCCCTTCGAT CTCGACAACA CGGTGACGGC CGGCATCGTC AGCGCCAAGG CGCGTGACAC CGGCGACCTG GTGCCGTTCA TCCAGACCGA CGTGGCGATC AACCCCGGCA ACTCCGGCGG GCCGCTGATC AACCTGCGCG GCGAGGTGGT GGGCGTGAAC TCGCAGATCT ACAGCCGCTC GGGCGGCTAC ATGGGCATCT CGTTCGCGAT CCCGATCGAC GAGGCCAGCC GCGTGGCCGA CCAGCTGCGC ACCAGCGGCC GGGTGGTGCG CGGGCGCATC GGCGTGCAGA TCGGCGAGGT CACCAAGGAC GTGGCCGAGT CGCTCGGCCT GGGCAAGGCG GCCGGCGCGC TGGTGCGCTC GGTCGAGGAC GGCAGCCCGG CCGGCAAGGC GGGCCTGGAA GCCGGTGACA TCGTGACGCG CTTCGACGGC AAGCCGGTCG AGAAGTGGAA CGACCTGCCG CGCCTGGTCG GCAAGACCGC ACCGGGCACC AAGACCACGA TCCAGGTGTT CCGCCGCGGC AGCATGCGCG ATCTCAGCGT CACCGTCGCC GAGCTCGAAG CCGAAGCCGC CGCCAGGCCG GCCAGCACCG AGCCCGCACC GGCCAAGCCG GCCGCACCCG CCACGGTCAG CCTGCTGGGC CTGACGGTGA GCGACCTGAG TGCCAAGCAG CGCGAGGAGC TCAAGGTCAA GGGCGGCGTG CGGGTCGACG CGGTGGACGG CGCGGGCGGG CGGGCCGGCC TGCGCGAGGG CGACATCATC CTGGCGGTGG CCAACACCGA GATCACCAAC CTGCGCCAGT TCGAGGCGGT GGTCGGCAAG CTCGACAAGA GCAAGCCGGT CAACCTGCTG TTCCGCCGCG GCGAGTGGGC CCAGTACGCG GTGATCCGGC CCGGCAAATG A
|
Protein sequence | MSEHSSRTSV HKRSDSTPRS VAALRWLAPL LTAACLLPTP ALAQSGGAQA AAAQAGAAAP LVRGLPDFTE LVEQVGPAVV NIRTTAKART ARSDNPADEE MQEFFRRFFG VPIPRQGPRQ GPPGQGQSEE EAVPRGVGSG FIVSSDGFVM TNAHVVEGAD EVTVRLTDKR EFKARVVGAD KRTDIAVLKL DATGLPAVRL GDVSRLKVGE WVIAIGSPFD LDNTVTAGIV SAKARDTGDL VPFIQTDVAI NPGNSGGPLI NLRGEVVGVN SQIYSRSGGY MGISFAIPID EASRVADQLR TSGRVVRGRI GVQIGEVTKD VAESLGLGKA AGALVRSVED GSPAGKAGLE AGDIVTRFDG KPVEKWNDLP RLVGKTAPGT KTTIQVFRRG SMRDLSVTVA ELEAEAAARP ASTEPAPAKP AAPATVSLLG LTVSDLSAKQ REELKVKGGV RVDAVDGAGG RAGLREGDII LAVANTEITN LRQFEAVVGK LDKSKPVNLL FRRGEWAQYA VIRPGK
|
| |