Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Daud_2012 |
Symbol | |
ID | 6026620 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Candidatus Desulforudis audaxviator MP104C |
Kingdom | Bacteria |
Replicon accession | NC_010424 |
Strand | - |
Start bp | 2117565 |
End bp | 2118581 |
Gene Length | 1017 bp |
Protein Length | 338 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | 641594834 |
Product | metalloendopeptidase glycoprotease family |
Protein accession | YP_001718135 |
Protein GI | 169832153 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG0533] Metal-dependent proteases with possible chaperone activity |
TIGRFAM ID | [TIGR00329] metallohydrolase, glycoprotease/Kae1 family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 1 |
Plasmid unclonability p-value | 0.0000000649848 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGCGTGA TCTTGCTGGG CGTGGAAACC TCCTGCGACG AGACTGCGGC GGCGGTGGTG GAGAACGGTC ATCTGGTCCG TTCCAACACC ATCGCTTCGC AGTTTGACCT GCACGGCAAG TTCGGCGGTG TCGTCCCCGA GGTGGCCTCC CGCCGCCACC TGGAGAGCAT AAACCCCGTC ATCCGGCAGG CGCTGGAGGA AGCCGATGTC TCTTTCCGGG ATCTCGACGG GGTCGCGGTC ACCTACGGCC CGGGTCTGGC CGGGTCGCTT TTGGTGGGGT TGATGGCGGC CAAGACCATC GCCTATGCAC TGGACATCCC CCTGTTCGGC ATCAACCACC TGGAGGCCCA CATTTACGCG AACTTTCTGG TAGCACCCGA CCTGCCCTTT CCCCTGCTCT GCCTGATCGT ATCCGGCGGG CACACCGATC TTGTGCTCAT CACGCGGCTC GGCGAGTACC GTCTTCTGGG TCGTACCCGT GATGACGCCG CGGGCGAGGC CTTCGACAAG GTGGCCCGGG TCCTGGAACT GGGTTATCCG GGCGGCCCGC TCATCGAAAA GCTGGCCCGG GAGGGGGACC CGGAGGCGGT GCCTTTTCCC CGCGCCTACC TGGAGGAGGG CACCCTGGAC TTCAGCTTCA GCGGCCTGAA AACGGCGGTG ATCAACTACC TGGACCGGGC GCGGCGCGAG GGCCGGAAAG TGGCCGAGGC CGACGTGGCG GCCGGTTTTC AGGAGGCGGT GGTGGGCGTG CTGGTGGACA AGGTACTGGC CGCGGCGCGA GTCCACCGCC CGGCCCGCAT CCTGCTGGCC GGCGGGGTGG CCGCCAACCG CGTGCTGGCC CGGGAGTTGG AGCGGCGGGC GGCGGCAGAG GGTTTCGGCG TCACCGTGCC GCCGCCCGTA TTCTGCACCG ACAACGCGGC CATGGTCGCC TGCGCCGGTT ACTACCGCTA CCTGCACGGG GATTCGTCCC CCCTGACTTT AAACGCCCTG GCCGGGCTTG GTCTGGGCTG TGAATGA
|
Protein sequence | MSVILLGVET SCDETAAAVV ENGHLVRSNT IASQFDLHGK FGGVVPEVAS RRHLESINPV IRQALEEADV SFRDLDGVAV TYGPGLAGSL LVGLMAAKTI AYALDIPLFG INHLEAHIYA NFLVAPDLPF PLLCLIVSGG HTDLVLITRL GEYRLLGRTR DDAAGEAFDK VARVLELGYP GGPLIEKLAR EGDPEAVPFP RAYLEEGTLD FSFSGLKTAV INYLDRARRE GRKVAEADVA AGFQEAVVGV LVDKVLAAAR VHRPARILLA GGVAANRVLA RELERRAAAE GFGVTVPPPV FCTDNAAMVA CAGYYRYLHG DSSPLTLNAL AGLGLGCE
|
| |