Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Avin_13730 |
Symbol | mucD |
ID | 7760311 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Azotobacter vinelandii DJ |
Kingdom | Bacteria |
Replicon accession | NC_012560 |
Strand | + |
Start bp | 1334230 |
End bp | 1335651 |
Gene Length | 1422 bp |
Protein Length | 473 aa |
Translation table | 11 |
GC content | 59% |
IMG OID | 643804268 |
Product | HtrA serine protease |
Protein accession | YP_002798567 |
Protein GI | 226943494 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG0265] Trypsin-like serine proteases, typically periplasmic, contain C-terminal PDZ domain |
TIGRFAM ID | [TIGR02037] periplasmic serine protease, Do/DeqQ family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 24 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTCAAAGG TTGGTTTTAA GTCCTTCGCC TCCGTTTTGG CAGGTGCCCT GCTGCTCGGG CAGTCGCTCT TCGTGCAGGC GCAATTACCG GAGTTCACCT CGCTGGTGGA GGAAGCCTCG CCGGCGGTAG TCAATATCAG TACCCGGCAA AAGCTTCCCG ATCGCTCCAC GGTTCAGGGA TTGCCTGATC TCGAGGGGCT TCCGCCGCTT TTCAGGGAGT TCCTGGAGCG CAGCATTCCG CAACTTCCGC GTACTCCGGA TAACGGGCGG CAGCGTGAGG CGCACTCCCT GGGCTCGGGT TTCATCATTT CTCCAGATGG CTATGTTCTA ACCAACAACC ATGTGGTGGC CGATGCCGAT GAAATCATCG TGCGTTTGTC CGATCGCAGT GAGCTCGAGG CCGAGCTGGT CGGGGCCGAT CCTCTTACCG ATGTAGCTTT GTTGAAGGTC AAGGGTTCGA ATCTCCCCAC AGTCAAACTG GGACGTACCG ACCAATTGAG AGTCGGGGAA TGGGTTCTGG CCATCGGTTC CCCTTTCGGT TTCGATCATT CCGTGACTGC GGGCATCATC AGTGCCACGG GGCGAAGCCT GCCGAACGAG AGTTACGTTC CTTTCATCCA GACCGATGTG GCCATCAATC CCGGTAACTC CGGCGGGCCG CTCTTCGATC TGGATGGACG GGTCATAGGC ATCAACTCCC AGATATTCAC CCGGTCGGGG GGCTTCATGG GCTTGTCTTT CGCGATTCCC ATCGAGGTTG CCATGGGCGT GGCCGATCAG TTGAAGGCCA CTGGCAAGGT TGCTCGCGGT TGGTTGGGAG TAATCATTCA GGAAGTCAAC AAGGATCTGG CTGAGTCCTT TGGTCTGGAT CGGCCGGCCG GAGCGTTGGT CGCCCAGGTC TTGGAGGATG GACCGGCGGA CAAGGGCGGT TTGCAGGTCG GCGATGTCAT TCTCAGCCTA GATGGTCATC CTATTGTGAT GTCGGCCGAT CTGCCGCATC TGGTTGGGGG GCTCAAACCC GGGGCTGCAG CCAATCTCGA GGTGGTGCGT GACGGCAAGC GGAGGAACAT CGCTATCACT GTCGGGGCCT TGCCGGAGGA GGGGAATGGG GTTCAGCCGA GTATCGCGGG CACGGAACAG AGCAGCAATC GCCTTGGAGT GACGGTCACC GAACTGACGG CTGAGCAGAA GAAATCCCTT GATCTCAAGG GTGGTGTGGT CATTCGCGAA GTGCTGAACG GTCCGGCCGC ATTGATCGGA CTGCGGCCTG GCGATGTAGT TACCCATTTG AACAATCAGC CGATCGACTC GGCGAAGACC TTTGCCGAAG TGGCCGGTGC GTTGCCAAAA GGCCGGTCGG TATCCATGAG GGTGCTGCGC CAGGGGCGTG CCAGTTTCAT TACTTTCAAA CTGGCCGAGT GA
|
Protein sequence | MSKVGFKSFA SVLAGALLLG QSLFVQAQLP EFTSLVEEAS PAVVNISTRQ KLPDRSTVQG LPDLEGLPPL FREFLERSIP QLPRTPDNGR QREAHSLGSG FIISPDGYVL TNNHVVADAD EIIVRLSDRS ELEAELVGAD PLTDVALLKV KGSNLPTVKL GRTDQLRVGE WVLAIGSPFG FDHSVTAGII SATGRSLPNE SYVPFIQTDV AINPGNSGGP LFDLDGRVIG INSQIFTRSG GFMGLSFAIP IEVAMGVADQ LKATGKVARG WLGVIIQEVN KDLAESFGLD RPAGALVAQV LEDGPADKGG LQVGDVILSL DGHPIVMSAD LPHLVGGLKP GAAANLEVVR DGKRRNIAIT VGALPEEGNG VQPSIAGTEQ SSNRLGVTVT ELTAEQKKSL DLKGGVVIRE VLNGPAALIG LRPGDVVTHL NNQPIDSAKT FAEVAGALPK GRSVSMRVLR QGRASFITFK LAE
|
| |