Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | XfasM23_0220 |
Symbol | |
ID | 6202104 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Xylella fastidiosa M23 |
Kingdom | Bacteria |
Replicon accession | NC_010577 |
Strand | - |
Start bp | 293772 |
End bp | 295217 |
Gene Length | 1446 bp |
Protein Length | 481 aa |
Translation table | 11 |
GC content | 55% |
IMG OID | 641701758 |
Product | protease Do |
Protein accession | YP_001828951 |
Protein GI | 182680791 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG0265] Trypsin-like serine proteases, typically periplasmic, contain C-terminal PDZ domain |
TIGRFAM ID | [TIGR02037] periplasmic serine protease, Do/DeqQ family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 0.850727 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCGACCGC TACCTACTTT ACTGACGCTA TCTATCGCTG CCGCATTCGG CGGTTTTGCA GCCACTGGGA TGAATGCTTG GCTTGATAAC CGCGCCGAAG CGGCATCCAA CACCAATGCC ATCTCACCAA TATCATCACT GCCAACGGGC ACGGTGCCTC AAACCACGGC TACCAACCAG CCGCTACCAT CGTTAGCACC CATGCTGCAA CAAGTGATGC CAGCAGTGGT CAGCATCAAC AGTAAACAAG TGGTACGGGT GCGCAATCCA TTTTTCGACG ACCCGATCCT ACGTCGGCTA TTCCCAGAGA TCCCCCAAGA ACGGATTAAT GAGTCGCTCG GATCTGGGGT GATCATCGAC GCGCGCAATG GCTACGTACT TACCAATCAT CACGTGATCG AAAATGCCGA CGCCGTGCAG GTGACATTAG CAGATGGGCG CAGCTTCAAG GCCGAGTTCC TCGGTTCTGA CGCAGACACC GACATCGCCT TGATCCGGAT CAAAGCAAAT AAACTGACCG AAATCAAACT CGCAGACAGT AACAAATTAC GCGTGGGCGA CTTCGTCGTA GCCATTGGTA ACCCGTTCGG CTTTACCCAA ACGGTGACCT CAGGCATCGT CTCGGCGGTA GGTCGCAGTG GCATCCTCGG CCTGGGTTAC CAAAACTTCA TCCAAACCGA CGCATCGATC AACCCGGGTA ACTCAGGCGG CGCACTGGTG AATCTTCATG GCCAGTTGGT TGGCATCAAC ACGGCCAGCT TCAACCCACA GGGCAGCATG GCTGGCAACA TCGGCTTAGG CCTGGCAATT CCTTCGAATC TAGCGCGCAA CGTCGTCGAG CAATTGGTCA CGAAAGGCGT TGTGGTACGC GGGACAATCG GCGTACAAAC ACAGAATATT GATGCACGAA TGGCACAAAG CTTAGGCCTG AGTAATCCAC ACGGCGCATT AGTGACTCGC GTATTACCCA ATTCCGCTGG TGCCGCAGCC GGACTGCAAC CAGGTGATGT GATCCTGGCA GCCAATGACC AAAGGGTGGA CAACGCGGAA ACATTGCACA ACTACGAAGG ACTACAGCCC GTCGGTAGCT CAGTAACACT GGAAGTACAC CGTGGCGGCA AGCCACTCAA AATACGCCTC ACACTCAAAG AATTGCCACG CGCAATCGCC GGAGAAACGC TGGATTCACG ACTGTCGGGC GCCATCTTCG TTGACCTGCC AGAGTCCCTC CGTCAATCAG GAATCGGTGG AGTCATGGTC AACAAAATCA AACACGGCAG CCGCGCTGCG GCCAATGGGT TGGTAGCCGG AGATGTCATC ATTGCCGCAT CCATCGGTGA ATTCTCTGAT CTGGCGAGCT GGCGGGCAAG CTTTTCCCAC CCACCACAAC GGCTGATACT GCGTGTGCTG CGCGGTAACG CACAGTATGA TGCGCTGATG CGCTGA
|
Protein sequence | MRPLPTLLTL SIAAAFGGFA ATGMNAWLDN RAEAASNTNA ISPISSLPTG TVPQTTATNQ PLPSLAPMLQ QVMPAVVSIN SKQVVRVRNP FFDDPILRRL FPEIPQERIN ESLGSGVIID ARNGYVLTNH HVIENADAVQ VTLADGRSFK AEFLGSDADT DIALIRIKAN KLTEIKLADS NKLRVGDFVV AIGNPFGFTQ TVTSGIVSAV GRSGILGLGY QNFIQTDASI NPGNSGGALV NLHGQLVGIN TASFNPQGSM AGNIGLGLAI PSNLARNVVE QLVTKGVVVR GTIGVQTQNI DARMAQSLGL SNPHGALVTR VLPNSAGAAA GLQPGDVILA ANDQRVDNAE TLHNYEGLQP VGSSVTLEVH RGGKPLKIRL TLKELPRAIA GETLDSRLSG AIFVDLPESL RQSGIGGVMV NKIKHGSRAA ANGLVAGDVI IAASIGEFSD LASWRASFSH PPQRLILRVL RGNAQYDALM R
|
| |