Gene Xfasm12_2003 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagXfasm12_2003 
Symbol 
ID6121087 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameXylella fastidiosa M12 
KingdomBacteria 
Replicon accessionNC_010513 
Strand
Start bp2086827 
End bp2089265 
Gene Length2439 bp 
Protein Length812 aa 
Translation table11 
GC content60% 
IMG OID641649955 
ProductBeta-N-acetylhexosaminidase 
Protein accessionYP_001776503 
Protein GI170731070 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG3525] N-acetyl-beta-hexosaminidase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00352357 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCCCCCCA CCCCAGCCCG CCCAACGCAG GAACCCCCAT TACTGCGGGC ATTGCTCACC 
CTGTGCATTG CCGCACTGTC AACAACGAGC TTCGACACCA CCGCATCACA TCATAAAAAA
TCACAACACC CATTACCCCC GAGAACTGCT TCCGGCCCAC CACTACCACC ACTGCCACTG
ATCCCTGCAC CCGTTCAAAT ACAACGCGGT CACGGCCAAA TCCACATTGG CCCCCACACC
CTGATTTCCA TCCCCCCCAA CGATACCGAC GCACAACACA GCGCCACCTA CCTAGCCACA
CTGCTACAGC ACACCCGCAA CCTGACATTA CACATCCACA CCGAAACCAC CCCCACCCCA
GACAGCATCC GCCTACAACG CGACCCACAA TCACCCGTCA CTCAGACAGA AGGCTACACC
CTGCAAGCCC TCCCCAACCA AGGCATGCAT ATCACGGCAC GAGACGGAGC AGGACTGTTC
TACGGCGCGA TCACTGCATG GCAACTACTG ACTGCCGACA GCAACCAAGG CCCAACCGAA
ATCCCTACCG TCACCATTCA CGACTGGCCA CGCTTCAAAT GGCGCGGCCA ACTCCTTGAC
GTCGCCCGTC ACTTCCACGA CGTAGACACC GTCAAACACG TGATTGACGC CATGGCACAA
CACAAACTCA ACGTCCTGCA CCTACACCTC ACCGACGACC AAGGCTGGCG TATCGAAATC
AAACGCTACC CCAAACTCAC TGCAATCGGC GCCGAACGCA TCCCACCGGG CGCCGGACGC
CACGGCACCC CAGAACGCTA CGGCGGCTTC TACACCCAAG ATCAAATCCG CGAACTCGTT
GCCTACGCCA CCGAACGACA GATCACCATC CTCCCCGAAA TCGACATGCC CGGCCATGCA
CAAGCCGCCG TGGCAGCCTA CCCCGACATC ATCGGAGTCA CCAGCACCAC CCCACCCGTC
AGCGTCGACT GGGGCGTCAA CCCCTACCTC TTCGGCACCA GCACACCCAG CCTGGACTTC
ATCCGCAATG TACTCGACGA AGTACTCACC CTATTCCCCT CCCAGTACAT CCACATCGGC
GGCGACGAAG CCGTTAAAGA TCAATGGGAA GCCTCACACA CCATCCGCGC CCAAATGCGC
AAACTGGGCG TGAAAGACAC ACATGCCATG CAAGGCTGGC TCAACACACA ACTAGCCCAA
TACCTCACAA CACATGACCG ACGCCTGATC GGCTGGGATG AAATCATCCA AAGTGGCCTA
CCAGAGAGCG CCTCCGTGAT GTCATGGCGC GGCGTCGAAG GCGCCATTAC CGCCGCACAA
CAAGGACACG ACGTCGTCCT CGCCCCCGCT GGCTGGATGT ACCTAGACAA CCTGCAAACC
GAACGCAGCG ACGAACCAAA CGGCCGCCTC GCCACCCTGC CCCTCTCCCG CGTCTACACA
CTGGACCCCG TCCCCAAAGA ACTGACCCCC GACCAAGCCA TCCACATCCT GGGCCTACAA
AGCGCCCTGT GGAGCGAATA CATCCCCTCA CGCTGGCACA TCGACCACGC CCTATTCCCA
CGCCTCGCCG CCGTCGCCGA AGTCGCCTGG TCCCCCATGA CCGCACGCAA CTGGGACAAC
TTCCTCAAAC GCCTCCCCCC ACAACTACAC CGCTACCGCA CCCTGCACAT CGACTACAGC
GACGGCGCAT TCGCCCCCGA CATCATGCTG CAACACCGCT CAGCCTACGT CCTTGCTGGC
GAACCCCCTC ACATCACACT CAGCAACCAA ACCAACACCG GCCAAATTCA CTACACCACA
AACGGCAACG AACCGACCCT ACATTCCCCC CGCTACACCG CCCCATTTCC CATCACCCTC
CCCACCACAG TCAAAGCAGC CGTATTCACC GAAGACGGCC GCCCCCTGGC CGCCACCCGC
AGCCGCACCT TCGACCACAA CACACTGCTG AGTGTGGACA CCCAAGAATT ACGCAACTGC
TCCGACAAAG GACCACTGGG ATTACGCCTC CCCCTGCTAC CAGACATGCC CGACCCCAAC
ACCCCCGTGT ACAACGTCGA CCTATTCCAC GCCTGCTGGA TCGTCCCCCA AATACGCCTC
AACAACATAC AAGCCATCCA CATCGACGCC GCACGCCTAG CACGCAACTA CGGCCTGGCC
CACGACCAAT CCAAAGTCAT TCAATATCCC AAACACACCG CACACGGCGA ACTGGAAATC
CGCACCGACT GCAACAAAAA ACCACTGGCC GTGATCCCCC TCCCGCCCGG AGACACCATC
GGCGAACCAT TCACCCTCGA CGCCCCATTA CCACCGAACA TCGGCGTCCA CGACCTGTGC
CTACGCATCA CCGCCCCCAT CCACGGCCCA CTGTATGCCA TTGGTCGCGT CCAACTGATC
CACGACACCC CCGCATCACC TCCGCCCCCC ACACACTGA
 
Protein sequence
MPPTPARPTQ EPPLLRALLT LCIAALSTTS FDTTASHHKK SQHPLPPRTA SGPPLPPLPL 
IPAPVQIQRG HGQIHIGPHT LISIPPNDTD AQHSATYLAT LLQHTRNLTL HIHTETTPTP
DSIRLQRDPQ SPVTQTEGYT LQALPNQGMH ITARDGAGLF YGAITAWQLL TADSNQGPTE
IPTVTIHDWP RFKWRGQLLD VARHFHDVDT VKHVIDAMAQ HKLNVLHLHL TDDQGWRIEI
KRYPKLTAIG AERIPPGAGR HGTPERYGGF YTQDQIRELV AYATERQITI LPEIDMPGHA
QAAVAAYPDI IGVTSTTPPV SVDWGVNPYL FGTSTPSLDF IRNVLDEVLT LFPSQYIHIG
GDEAVKDQWE ASHTIRAQMR KLGVKDTHAM QGWLNTQLAQ YLTTHDRRLI GWDEIIQSGL
PESASVMSWR GVEGAITAAQ QGHDVVLAPA GWMYLDNLQT ERSDEPNGRL ATLPLSRVYT
LDPVPKELTP DQAIHILGLQ SALWSEYIPS RWHIDHALFP RLAAVAEVAW SPMTARNWDN
FLKRLPPQLH RYRTLHIDYS DGAFAPDIML QHRSAYVLAG EPPHITLSNQ TNTGQIHYTT
NGNEPTLHSP RYTAPFPITL PTTVKAAVFT EDGRPLAATR SRTFDHNTLL SVDTQELRNC
SDKGPLGLRL PLLPDMPDPN TPVYNVDLFH ACWIVPQIRL NNIQAIHIDA ARLARNYGLA
HDQSKVIQYP KHTAHGELEI RTDCNKKPLA VIPLPPGDTI GEPFTLDAPL PPNIGVHDLC
LRITAPIHGP LYAIGRVQLI HDTPASPPPP TH