Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | DET1037 |
Symbol | |
ID | 3229670 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Dehalococcoides ethenogenes 195 |
Kingdom | Bacteria |
Replicon accession | NC_002936 |
Strand | + |
Start bp | 940964 |
End bp | 942085 |
Gene Length | 1122 bp |
Protein Length | 373 aa |
Translation table | 11 |
GC content | 51% |
IMG OID | 637120601 |
Product | serine protease |
Protein accession | YP_181753 |
Protein GI | 57234203 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG0265] Trypsin-like serine proteases, typically periplasmic, contain C-terminal PDZ domain |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 5 |
Plasmid unclonability p-value | 0.0643787 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAAAAGA AACAAAAATT ATTAAGTCTG TTTCTGGGCA TAGTACTTGT TGTTAGTGTT CTCAGCGGAG GCTGTGATTA CTTATCCCAG CCTATAAACT CTGACAATAC CGACAGCACT TCTACCCCTA TTGATGCAGA CTGGACATTC CCCACTCCCC AGCAGAATCT GCCGGAACTG GCCAACTATG CCATGGTGGT TGCCATGGTA AAACCAGCCG TGGTAGCGGT AGATGTGGAA TACATAACCC AGGATATATT CGGCCGCCAA ACGGTTGCCG TAGCCTCAGG TTCGGGTTTC ATAATAGACC CCAGCGGCTA TATTATTACC AACAACCACG TAGTTGAAGG CGGAAGCACT GTCACCGTCA CCCTTTCAGA CGGCCGTACC TTTACCGCCA GCCAGGTGGT AACAGATTCA CGCACAGACC TGGCGGTAAT CAAGGTGGAT ACACTGGGTG AAGACCTGCC GTTTGTATAT ATAGGTGATT CGTCAGCTTT GGAAGTAGGC GAACCGGTGG CGGCTATCGG CAATGCATTG GGGCTGGGGA TAACCATGAA AGGCGGCTGG ATAAGCCGTC TGGATGCCCA GATAACCGTT GACCAGAGTG TAACCCTGTA CGGTTTGATA GGTACAGATG TAGCCATAAA CGAAGGCAAT TCCGGCGGCC CGCTGGTAAA TATGGCCGGT GAGGTTATCG GCATTACCTC TGCCAAAATA GCGGAAGTGG GGGTGGAAGG GGTAGGCTAC GCTATAAATA TAAACTCCGC CCGCACCTTC ATTGAAGAGC TGGTCAAAAA AGGCTATATT ACCCGGCCTT TTATGGGAGT GGCCGGCATA CTGACCGTAG ACAGTTCAAT CCAGTCATAC TTCAGGCTGG GCATAGACAG AGGGGTGCTT ATCCGGGGCG TGTCTGAAGG CGGACCCGCC GAAAAAGCAG GTCTAATGGC AAATGATGTT ATTCTGGCCA TAAACGGCCA GCCAGTGCTG ACTGATGAAG AACTGATACT AGCTATCCAC GGCAAAAAGA TAGGCGATAA AATAGAGGTC AGCTATTTCC GGGACGGAGT AACCGCTACT GTCACTCTGA CACTGGCAGA GACCCCGCCG CCGGAAAGCT AG
|
Protein sequence | MKKKQKLLSL FLGIVLVVSV LSGGCDYLSQ PINSDNTDST STPIDADWTF PTPQQNLPEL ANYAMVVAMV KPAVVAVDVE YITQDIFGRQ TVAVASGSGF IIDPSGYIIT NNHVVEGGST VTVTLSDGRT FTASQVVTDS RTDLAVIKVD TLGEDLPFVY IGDSSALEVG EPVAAIGNAL GLGITMKGGW ISRLDAQITV DQSVTLYGLI GTDVAINEGN SGGPLVNMAG EVIGITSAKI AEVGVEGVGY AININSARTF IEELVKKGYI TRPFMGVAGI LTVDSSIQSY FRLGIDRGVL IRGVSEGGPA EKAGLMANDV ILAINGQPVL TDEELILAIH GKKIGDKIEV SYFRDGVTAT VTLTLAETPP PES
|
| |