Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Xaut_1661 |
Symbol | |
ID | 5424439 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Xanthobacter autotrophicus Py2 |
Kingdom | Bacteria |
Replicon accession | NC_009720 |
Strand | - |
Start bp | 1880330 |
End bp | 1881826 |
Gene Length | 1497 bp |
Protein Length | 498 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | 640880907 |
Product | protease Do |
Protein accession | YP_001416563 |
Protein GI | 154245605 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG0265] Trypsin-like serine proteases, typically periplasmic, contain C-terminal PDZ domain |
TIGRFAM ID | [TIGR02037] periplasmic serine protease, Do/DeqQ family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 17 |
Fosmid unclonability p-value | 0.179012 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCCCCACC CGCTTCGTTC CTCAGCTCCT GCCGCCCGCC TGCCCGCCTT CGTCCTGGCA GCCGCCCTCG CACTCCTCAC CGCGACCTTT TCCACCCTCG CCCCGGCTCC GCTGCTGGCG GCGCCCGGCA AGGGGCCGGA CACGGTGGCG GACGTGGCCG AGCGGGTCAT GGACGCGGTG GTGAACATCT CCACCTCGCA GAACGTGGCT GCCTCGCGCT CGGTGCCCCA GCCGCAGCTG CCGCCGGGCT CGCCGTTCGA GGACTTCTTT GATGAATTCT TCAAGAAGAG GCAGGATGAC GGCGGCGAGC AGCGCTCGCG CCGCGTGTCC TCCCTCGGCT CGGGCTTCGT CATCGACCCG TCCGGCCTCA TCGTGACCAA CAATCACGTG ATCGCCGACG CCGACGAGAT CTTCGCCAAT TTCAACGACG GCTCGAAGCT GAAGGCCGAG CTGATCGGCC GCGACACCAA GACCGACCTC GCCCTGCTGC GTGTGAAGCC CGACAAGCCG CTGGTGGCGG TGAAGTTCGG CGACAGCGAG AAGCTGCGGG TGGGCGACTG GGTGATGGCC ATCGGCAACC CGTTCGGCCT CGGCGGCACG CTGACGGTTG GCGTGGTCTC CGCCCGCAAC CGCGACATCA ATTCCGGCCC CTACGACAAT TTCATCCAGA CCGACGCCGC CATCAATCGC GGCAATTCCG GCGGCCCGCT GTTCAACATG GACGGCGAGG TGATCGGCAT CAACACCGCC ATCATCTCGC CCTCGGGCGG CTCCATCGGC ATCGGCTTCG CGGTGCCGTC CGCCACCGCC AAGCCGGTGA TCGCCCAGCT CGGCGAGTTC AAGGAAGTGC GGCGCGGCTG GATCGGGGTG CGCATCCAGC CGGTCACCGA CGACATCGCC GAGAGCCTCG GCCTCGGCAA GCCGCGCGGC GCGCTCATCG GCGTGGTCTC GGAGAACAGC CCGGCGGCGC GGGGCGGCAT CAAGGCCGGC GACGTGATCG TGAAGTTCAA TGGCCGCGAC ATCAAGGAGG TGCGCGACCT CACCCGCACC GTGGCCGACG CCATGGCGGA CACCGAGGTG GAGGTGGTGG TGGTCCGCAA GGGCAAGGAA GAGACCCTGC ACCTCAAGGT CGCCCGCATG CCCGAGGACG ATAAGCCGGA GACCCCCAAG CCCGCCGCCG AGTCGCAGAA AGTAGCGCCC AAAAAAGCCC TCGGCATCGA GCTCTCCGCC ATGAGCGACG AGCTGCGCAA GCGCTATTCC ATCAAGAGCG AGATCAACGG CGTGGTCATC ACCGGCGTCG ATGCCTCCAC CCCTGCCTCG GACAAGGGGC TGAAGGCCGG GCAGGTCATC GTCCAGGTGG GGCAGGAAGC GGTGGCGAGC CCCTCCGACG TGGAGAAGCA GGTGGATGCC CTGCGCAAGC AGGGCAAGCG CTCCGCCCTG CTCCTCGTCT CCGATGGCGA GGGCAAGCAG GAGTTCGTGA CGATCCCGCT GAACTGA
|
Protein sequence | MPHPLRSSAP AARLPAFVLA AALALLTATF STLAPAPLLA APGKGPDTVA DVAERVMDAV VNISTSQNVA ASRSVPQPQL PPGSPFEDFF DEFFKKRQDD GGEQRSRRVS SLGSGFVIDP SGLIVTNNHV IADADEIFAN FNDGSKLKAE LIGRDTKTDL ALLRVKPDKP LVAVKFGDSE KLRVGDWVMA IGNPFGLGGT LTVGVVSARN RDINSGPYDN FIQTDAAINR GNSGGPLFNM DGEVIGINTA IISPSGGSIG IGFAVPSATA KPVIAQLGEF KEVRRGWIGV RIQPVTDDIA ESLGLGKPRG ALIGVVSENS PAARGGIKAG DVIVKFNGRD IKEVRDLTRT VADAMADTEV EVVVVRKGKE ETLHLKVARM PEDDKPETPK PAAESQKVAP KKALGIELSA MSDELRKRYS IKSEINGVVI TGVDASTPAS DKGLKAGQVI VQVGQEAVAS PSDVEKQVDA LRKQGKRSAL LLVSDGEGKQ EFVTIPLN
|
| |