Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Daud_1880 |
Symbol | |
ID | 6027150 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Candidatus Desulforudis audaxviator MP104C |
Kingdom | Bacteria |
Replicon accession | NC_010424 |
Strand | - |
Start bp | 1979881 |
End bp | 1981041 |
Gene Length | 1161 bp |
Protein Length | 386 aa |
Translation table | 11 |
GC content | 62% |
IMG OID | 641594698 |
Product | peptidase S1 and S6, chymotrypsin/Hap |
Protein accession | YP_001718005 |
Protein GI | 169832023 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG0265] Trypsin-like serine proteases, typically periplasmic, contain C-terminal PDZ domain |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 0.0947444 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGTTGGA GACGGAAGTT AGTGTTCACC ACCCTTTTGG CTCTGGTAGC CGGGATAATG TTCGCCGCCG GCAATATGGC GACCAGGGAT TTTTGGCCAC AGGAGCCCCT CGTCAACACC AACAACCGCA CGGCGGTGGC CGCTCCCGAA TTGCCCGGCG TAAACCCGGG TACGATCGTC GAAATCGTTG GCCGGGCCGG GCCGGCGGTA GTGAAAATCG ACACAGTGGT ACCAACCTCC GGCAGGGAAT GGAGCCCGTT CTTTGACGAT CCCTTCTTCC GTGACTTTTT CGGCCTTCCC GACATTTCAC CCCCGCAGTC GAGCCCCAGC CGCCGGGGTA TGGGGTCCGG GTTTCTGTTT TCCGAGGACG GTTACATCCT GACTAATGAA CACGTAATCC GCGGTGCCGA GGAGATCTGG GTGACCCTGA CCGGGTTCGA AACACCGCTG GCCGCAAAGG TGGTGGGTTC GGACTACGAC CTCGACCTGG CGGTGCTCCG GGTGAACGCC CCGCGCAAGC TGCCCCACCT GAAACTTGGT GATTCGGACA ATGTCCGGGT TGGTGAGTGG GTGATCGCTA TCGGCAACCC CTACGGCCTG GACCATACGG TCACCGTCGG GGTGATCAGC GCCAAGGGCC GCCCGGTGAC CATCGAGGAC CGGTACTACG ACAATCTCCT GCAGACTGAC GCCTCCATCA ACCCAGGGAA CAGCGGCGGC CCGCTGTTGA ATCTCCGGGG CGAGGTCGTG GCGATCAACA CGGCTGTCAA CGCCCAGGCC CAGGGGATCG GCTTCGCCAT CCCGACCAGC ACCATCCGTC CCGTCCTGGA TGAGTTGATC AGAACCGGCG GCATCAGTCA CGCTTGGCTG GGTGTGCAGT TGGACACTGT TTCTCCGGAA CTGGCCCGGT ACCTGAAACT TCGGGGCACT ACCGGGGCGC TGGTAATCGG GATTGTGGCC GACAGCCCGG CGGCCAGAGC CGGTTTCCGG CCGGGTGACG TGATCCTGGA GTTAAACGGG GCTCCGGTGA ACAACCCGGA AAAAGTGATC CGGGCGATCA GGACCCATAA GGCGGGCGAA ACCCTAAAGG TGAAGATATT CCGGGACGGC AGCGTCCGCG AGCTGGAAGT GAAACTGGGC GAGAAACCGG TCCGCCGTTA A
|
Protein sequence | MSWRRKLVFT TLLALVAGIM FAAGNMATRD FWPQEPLVNT NNRTAVAAPE LPGVNPGTIV EIVGRAGPAV VKIDTVVPTS GREWSPFFDD PFFRDFFGLP DISPPQSSPS RRGMGSGFLF SEDGYILTNE HVIRGAEEIW VTLTGFETPL AAKVVGSDYD LDLAVLRVNA PRKLPHLKLG DSDNVRVGEW VIAIGNPYGL DHTVTVGVIS AKGRPVTIED RYYDNLLQTD ASINPGNSGG PLLNLRGEVV AINTAVNAQA QGIGFAIPTS TIRPVLDELI RTGGISHAWL GVQLDTVSPE LARYLKLRGT TGALVIGIVA DSPAARAGFR PGDVILELNG APVNNPEKVI RAIRTHKAGE TLKVKIFRDG SVRELEVKLG EKPVRR
|
| |