Gene XfasM23_0220 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagXfasM23_0220 
Symbol 
ID6202104 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameXylella fastidiosa M23 
KingdomBacteria 
Replicon accessionNC_010577 
Strand
Start bp293772 
End bp295217 
Gene Length1446 bp 
Protein Length481 aa 
Translation table11 
GC content55% 
IMG OID641701758 
Productprotease Do 
Protein accessionYP_001828951 
Protein GI182680791 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0265] Trypsin-like serine proteases, typically periplasmic, contain C-terminal PDZ domain 
TIGRFAM ID[TIGR02037] periplasmic serine protease, Do/DeqQ family 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value0.850727 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCGACCGC TACCTACTTT ACTGACGCTA TCTATCGCTG CCGCATTCGG CGGTTTTGCA 
GCCACTGGGA TGAATGCTTG GCTTGATAAC CGCGCCGAAG CGGCATCCAA CACCAATGCC
ATCTCACCAA TATCATCACT GCCAACGGGC ACGGTGCCTC AAACCACGGC TACCAACCAG
CCGCTACCAT CGTTAGCACC CATGCTGCAA CAAGTGATGC CAGCAGTGGT CAGCATCAAC
AGTAAACAAG TGGTACGGGT GCGCAATCCA TTTTTCGACG ACCCGATCCT ACGTCGGCTA
TTCCCAGAGA TCCCCCAAGA ACGGATTAAT GAGTCGCTCG GATCTGGGGT GATCATCGAC
GCGCGCAATG GCTACGTACT TACCAATCAT CACGTGATCG AAAATGCCGA CGCCGTGCAG
GTGACATTAG CAGATGGGCG CAGCTTCAAG GCCGAGTTCC TCGGTTCTGA CGCAGACACC
GACATCGCCT TGATCCGGAT CAAAGCAAAT AAACTGACCG AAATCAAACT CGCAGACAGT
AACAAATTAC GCGTGGGCGA CTTCGTCGTA GCCATTGGTA ACCCGTTCGG CTTTACCCAA
ACGGTGACCT CAGGCATCGT CTCGGCGGTA GGTCGCAGTG GCATCCTCGG CCTGGGTTAC
CAAAACTTCA TCCAAACCGA CGCATCGATC AACCCGGGTA ACTCAGGCGG CGCACTGGTG
AATCTTCATG GCCAGTTGGT TGGCATCAAC ACGGCCAGCT TCAACCCACA GGGCAGCATG
GCTGGCAACA TCGGCTTAGG CCTGGCAATT CCTTCGAATC TAGCGCGCAA CGTCGTCGAG
CAATTGGTCA CGAAAGGCGT TGTGGTACGC GGGACAATCG GCGTACAAAC ACAGAATATT
GATGCACGAA TGGCACAAAG CTTAGGCCTG AGTAATCCAC ACGGCGCATT AGTGACTCGC
GTATTACCCA ATTCCGCTGG TGCCGCAGCC GGACTGCAAC CAGGTGATGT GATCCTGGCA
GCCAATGACC AAAGGGTGGA CAACGCGGAA ACATTGCACA ACTACGAAGG ACTACAGCCC
GTCGGTAGCT CAGTAACACT GGAAGTACAC CGTGGCGGCA AGCCACTCAA AATACGCCTC
ACACTCAAAG AATTGCCACG CGCAATCGCC GGAGAAACGC TGGATTCACG ACTGTCGGGC
GCCATCTTCG TTGACCTGCC AGAGTCCCTC CGTCAATCAG GAATCGGTGG AGTCATGGTC
AACAAAATCA AACACGGCAG CCGCGCTGCG GCCAATGGGT TGGTAGCCGG AGATGTCATC
ATTGCCGCAT CCATCGGTGA ATTCTCTGAT CTGGCGAGCT GGCGGGCAAG CTTTTCCCAC
CCACCACAAC GGCTGATACT GCGTGTGCTG CGCGGTAACG CACAGTATGA TGCGCTGATG
CGCTGA
 
Protein sequence
MRPLPTLLTL SIAAAFGGFA ATGMNAWLDN RAEAASNTNA ISPISSLPTG TVPQTTATNQ 
PLPSLAPMLQ QVMPAVVSIN SKQVVRVRNP FFDDPILRRL FPEIPQERIN ESLGSGVIID
ARNGYVLTNH HVIENADAVQ VTLADGRSFK AEFLGSDADT DIALIRIKAN KLTEIKLADS
NKLRVGDFVV AIGNPFGFTQ TVTSGIVSAV GRSGILGLGY QNFIQTDASI NPGNSGGALV
NLHGQLVGIN TASFNPQGSM AGNIGLGLAI PSNLARNVVE QLVTKGVVVR GTIGVQTQNI
DARMAQSLGL SNPHGALVTR VLPNSAGAAA GLQPGDVILA ANDQRVDNAE TLHNYEGLQP
VGSSVTLEVH RGGKPLKIRL TLKELPRAIA GETLDSRLSG AIFVDLPESL RQSGIGGVMV
NKIKHGSRAA ANGLVAGDVI IAASIGEFSD LASWRASFSH PPQRLILRVL RGNAQYDALM
R