Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Xaut_4771 |
Symbol | |
ID | 5421544 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Xanthobacter autotrophicus Py2 |
Kingdom | Bacteria |
Replicon accession | NC_009720 |
Strand | - |
Start bp | 5296501 |
End bp | 5297991 |
Gene Length | 1491 bp |
Protein Length | 496 aa |
Translation table | 11 |
GC content | 71% |
IMG OID | 640884034 |
Product | protease Do |
Protein accession | YP_001419646 |
Protein GI | 154248688 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG0265] Trypsin-like serine proteases, typically periplasmic, contain C-terminal PDZ domain |
TIGRFAM ID | [TIGR02037] periplasmic serine protease, Do/DeqQ family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 0.411686 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 7 |
Fosmid unclonability p-value | 0.000180606 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGAAACGAC AGTTCGCATT CACGCTCCTT CTGCTGTGCT CCGCCGCCGT GGGGGCGGCT GCCATGGCGG CCGGCGCGGG GCCGGGTGAA GCGCCCGCGC GCGCCGCCAG CGTCCCGCTC CGGGTCGAGG CGCAGGCGCA GATCCAACCG CAGGTGCCGG CCTCCGCCGC CGAGGTGAAG CTCTCGTTTG CGCCCATCGT GGCGCGCACC GCGCCGGCGG TGGTCAATGT CTATGCCCAG AAGGCCGCCC AGCAGCGGGC CAATCCCATT TTCGACGACC CGTTCTTCCG CCGGTTCTTC GGCGGCCAGG GTGGCCCCGG CCTGCGCGCG CCGGAGCGGG TGCAGCGCTC CCTGGGGTCG GGGGTGGTGG TGGACCCCTC GGGCATCGTG GTCACCAATT TCCATGTCAT CGAGGGGGCG GACGAGATCC GCATCGCCCT CAACGACCGG CGCGAATACG AGGCAGAAGT GCTGCTGCGC GACCAGCGCA CTGACCTTGC CGTGCTCCGC ATCAAGACCG AGGCCAAGGA GACCTTCGCG TCCATCGAGC TCGGCAATTC GGACGAACTG GCCGTGGGCG ACCTGGCGCT GGCCATCGGC GATCCCTTCG GCGTCGGCCA GACGGTGACG CAGGGCATCA TCTCCGCGCT GGCGCGGACC CAGGTGGGGA TCTCCGACTA CCAGTTCTTC ATCCAGACCG ACGCGGCGAT CAATCCCGGC AATTCCGGCG GTGCGCTGGT GGACATGGCG GGCCGGCTGG TGGGCATCAA CAGCGCCATC TATTCGCGCT CGGGCGGCTC CATCGGCATC GGCTTCGCCA TTCCGGTGAA CATGGTGCGG GTGGTGGTGG AGCAGGCCAA GGGCGGCTCC AAGTCGGTGC GCCGGCCCTG GCTCGGGGCA AAGCTCCAGC GGGTCACGCC CGACATCGCC GAAAGCCTCG GCCTGCCGCG TCCCACCGGT GTGCTGGTGC AGGACGTGGT GGCCCACAGC CCGGCCGCCA AGGCGGGCCT CAGGCTGGGG GACCTCATCG TCGCGGTGGA GGGGCAGCCC ATCGACGACC CCGAGTCCCT CAACTACCGC TTCGCCACCC GGCCCATCGG CGGCAAGGCG GCGCTGGCGG TCAACCGCGG CGGCAAGGAG GTGAGCCTCG TGGTGGTGCT GGAGGGCGCG CCGGAGACCG TACCGCGCGA CGAACTGGTG GTGCGTGCCC GCTCGCCCTT CACCGGCGCC ACGGTGGTGA ACCTGTCACC GGCGGTGGCC GAGGAACTGA AGGTGGATGC CAACGCCACC GGCGTGGTGA TCTACGACGT GGACGACAAT TCCCCCGCCG CCGCCGCCGG CTTCAAGCCC GGCGACGTGC TGGTCGAGGT GAACGGCGAG AAGGTCGCCC GGACCCGCGA CCTGGAGCAG ATGGTTCGCA CGCCGCTCCG GGCCTGGCGG GTGACGGTGG GCCGCGCCGG GCGCACCATC ACCGCTCTGC TGCCGGGCTG A
|
Protein sequence | MKRQFAFTLL LLCSAAVGAA AMAAGAGPGE APARAASVPL RVEAQAQIQP QVPASAAEVK LSFAPIVART APAVVNVYAQ KAAQQRANPI FDDPFFRRFF GGQGGPGLRA PERVQRSLGS GVVVDPSGIV VTNFHVIEGA DEIRIALNDR REYEAEVLLR DQRTDLAVLR IKTEAKETFA SIELGNSDEL AVGDLALAIG DPFGVGQTVT QGIISALART QVGISDYQFF IQTDAAINPG NSGGALVDMA GRLVGINSAI YSRSGGSIGI GFAIPVNMVR VVVEQAKGGS KSVRRPWLGA KLQRVTPDIA ESLGLPRPTG VLVQDVVAHS PAAKAGLRLG DLIVAVEGQP IDDPESLNYR FATRPIGGKA ALAVNRGGKE VSLVVVLEGA PETVPRDELV VRARSPFTGA TVVNLSPAVA EELKVDANAT GVVIYDVDDN SPAAAAGFKP GDVLVEVNGE KVARTRDLEQ MVRTPLRAWR VTVGRAGRTI TALLPG
|
| |