Gene Xfasm12_2164 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagXfasm12_2164 
Symbol 
ID6120135 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameXylella fastidiosa M12 
KingdomBacteria 
Replicon accessionNC_010513 
Strand
Start bp2272158 
End bp2273864 
Gene Length1707 bp 
Protein Length568 aa 
Translation table11 
GC content54% 
IMG OID641650102 
Productprotease 
Protein accessionYP_001776647 
Protein GI170731214 
COG category[R] General function prediction only 
COG ID[COG4783] Putative Zn-dependent protease, contains TPR repeats 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones29 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCCCCGCA TGTCGAAGGT GGCGGTTGCA ACATTTAAAC TGGTGACCTC GCCAGAATTT 
CGCGACAGGA GCTTCACTTT GCGCCGACTA TCACTTGCTA TTGCCACTGC TCTCGTATTG
CCTCTGGGTG GTATTGGTCT CAATGCTCAG GAAAGCCGCT TACCTGACAT TGGTTCCTCG
GCGGGGCAAT TACTGACTCC GGCACGTCAG GCCGAATATG GCAAGTTGAT GATGGCTGAG
TTACGTAACT ACGGCTATGT GTTGGAAGAC CCGTTACTAC AGGGGTGGTT GCAGAGCATA
GGAGAACATT TGGCGGCCAA CAGTGATCAA CCGCACCAAC TATTTACCTT TGTATTGTTA
AAAGAACGTC AAATTAATGC GTTTGCCACA TTGGGCGGTT ATGTAGCGGT GAATTCCGGC
CTGTTGCTGA CCGCTGAGCG TGAGGATGAG GTTGCAGCGG TGCTCTCCCA CGAGATCATG
CACATTAATC AGAAGCATGT GCTGCGTAGC GTTGAGCGTG CACAGCGTGA TCAGATTCCG
ATTCTGCTAG GGATGCTGGC GGCTGTGATC GCCGCGCAGC ACGTTGGCGG TAATTCAAGT
GGTGATGCGA CGATGGCTGG TATTACCAGT GCCATGGGAC TCATGCAGCA GCGCCAGATC
AATTACACCC GTTCCAACGA AGCTGAAGCT GATCGTCTTG GAATTCACAC CTTGGCGCGT
AGTGGGTATG ACTTAGAGGC GATGGCTGGC TTTTTTGAGA GAATGTCGCT GGTGACACGC
GGTAATTCTG GTGGTGACCA GGCACCGGAC TATTTACAGA CCCATCCTGT GACCGTGACC
CGCATCAGCG AGGCGAAGGC ACGTGCCGAG CAACTCAAGA AACAAAAGGG ACTCACGGGG
GGCGAGGTTG ACCTAATGTC CCGGGAGCGT TTGAATTTGC ATCATGTTTT TCCTGGTGTT
CCTGCTAATC CGTTGTTGCC AAAAGTTCTA CAGCCAGCGT ATAGCGAGTT ATCTCGTGGG
CCTAGTGGTC AGTTCGGTTG GGCCAAAGAG CGCTTACGCG TACTGAGTGC GAAATCCCCA
GAGAGTGCGC TGCGTGAGTA TGAAGCCCTG CGTCGCAACA CAAAGGGGGG ATTGAATGAT
TTTCAGCGCT ATGGGATGGC GTTGGCCAAG TTCCGTAATG GCAGCAATCT GGACGAGGTC
ATGCATGAAT TCCGTACATT GCTTGAGGTA CACCCTAACA ATGTCTGGCT TGCTTCGGCA
TTGGCGCAGA CCCAAGCGCG GGCTGGACAG CGTCACGAGG CTGGTGAACG TTTCGATGCC
TTGTTGCGGC GCTTGTCTGG GCAGCGTGCA GTGGTGCTCA TGTATGCGGA GATGCTTAAC
GAAGGCGGTA ACCTTCAGGA TGGTAAGCGT GCTCAGGCGT TGTTATTGCC GTTACTGCGT
CAGTTGTCTG AGGATGCGTT ATTTCAACAA ACCTTCGCGC GCTCCTGCGA GTTGGCCGGT
GCCACTGCAC GCGCCAGTGA GGCCTATGCG GAAGCCGCGT TTCTGAACGG ACGTCCGGAG
CAGGCATTGA TTCAGTTGCA GGCGTTGAAG AAGGAGAATC TGGATTATGT GACACGTGCG
CGAGTGGATG CGCGTATTGC GGCGATTACT CCAGCGGTGC TGGAAATGCG TCGTCAGGGT
ATCCGCGATC CGGATTTGGA TCGATGA
 
Protein sequence
MPRMSKVAVA TFKLVTSPEF RDRSFTLRRL SLAIATALVL PLGGIGLNAQ ESRLPDIGSS 
AGQLLTPARQ AEYGKLMMAE LRNYGYVLED PLLQGWLQSI GEHLAANSDQ PHQLFTFVLL
KERQINAFAT LGGYVAVNSG LLLTAEREDE VAAVLSHEIM HINQKHVLRS VERAQRDQIP
ILLGMLAAVI AAQHVGGNSS GDATMAGITS AMGLMQQRQI NYTRSNEAEA DRLGIHTLAR
SGYDLEAMAG FFERMSLVTR GNSGGDQAPD YLQTHPVTVT RISEAKARAE QLKKQKGLTG
GEVDLMSRER LNLHHVFPGV PANPLLPKVL QPAYSELSRG PSGQFGWAKE RLRVLSAKSP
ESALREYEAL RRNTKGGLND FQRYGMALAK FRNGSNLDEV MHEFRTLLEV HPNNVWLASA
LAQTQARAGQ RHEAGERFDA LLRRLSGQRA VVLMYAEMLN EGGNLQDGKR AQALLLPLLR
QLSEDALFQQ TFARSCELAG ATARASEAYA EAAFLNGRPE QALIQLQALK KENLDYVTRA
RVDARIAAIT PAVLEMRRQG IRDPDLDR