Gene Xfasm12_2037 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagXfasm12_2037 
Symbol 
ID6121120 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameXylella fastidiosa M12 
KingdomBacteria 
Replicon accessionNC_010513 
Strand
Start bp2133564 
End bp2135318 
Gene Length1755 bp 
Protein Length584 aa 
Translation table11 
GC content53% 
IMG OID641649985 
Productextracellular endoglucanase precursor 
Protein accessionYP_001776533 
Protein GI170731100 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG4305] Endoglucanase C-terminal domain/subunit and related proteins 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCGACGCT TGGCAATCTT ATTTTGGCTT GCAACATCCG GTTGCGTGGC CTCCGCCATG 
TACGGTGGAA CGGTGGATGC ACAAAATGCA GTGTCTGATA CTCACTTTGT TGAACCACTC
CGCGGTGTTA ACTGGCGTGG TTTGGAAACG GCGCAGCACC TGCCGCAAGG TTTAGATCAA
CGTCCCTGGC GTGAAGTACT CGACCAAATG CAGAGCCTAG GTATTAACGC AATACGCCTG
CCCCTGTGCT CGGACACCCT ACATGGCGCC ATGCCTACCA ATCTGGATTT AGTGCGTAAT
CCAGATCTCA AAGGACGTAC CGCATTGCAG ATTGCCGACG CAATCATGGA TGAAGCTGGG
AAACGTGGGA TGCGGGTCCT CCTGGCGTAT CACGGCGTGG AGTGTCCTAC CGACGGAAAC
CCGTTATTGC GCAGCGTAGA TGAATCAGAG CACCAATGGA TCAGCGATAT GCAATTCATC
ACCTCGCATT ATCGTGCTCA ACAAAAAGTG GTGATCGGCG TGGACTTGGC CGACATGGCA
TACCATCGCC CCTTTCAGAG CGGTGGTGAT AGCACGCCTG ATTGGAACCG TGTCGTCGAG
CGTGCCGCCG CTGCCATTCT GGCGATGAAT CCCGACTGGT TGATTGGTGT GCAGCCTGTT
GGCCTGAATC CCCCCTGCTT GGATGCCTCC GCTCCCATCT CCGATGACAA CATAAAATCG
CAGCACTGTG TCCAATTACG CATTCCTGCT CGGAACCTCT TGCTCATGCC GCGCTTCGCG
GGCACTGACA TGGATACTGA AGCTGCCCTT GGCGCATTTT CTGGAAAACA AACCGTGTTA
CCCAACTCTC TGGATGCCAC GGATGCCGAG CAGCTCGCCC ATCGGATTGA TGCACTGCTG
GCATTTGGCA TACGCCAAGG CTTTTATGGA TCTTGGATGA CCTCAGCACA GATGCCATTT
GGCCTGCTCG ATAACGACGG CCGTACCCCA CGTACCGCAT TGATTGCGCA ACTACATCGT
TGGTGGGGTG TGAGCCGTGT CGATGTTGCC AGTGAGAATG CTGCGACGAA GAATCAAACA
ACGACCGATA CCAATGGATG CGTTACTGGT GATAGCAGCG TGCCCCTCAA TGGTTGGGAC
ACCTCTTTCA GTGGTGTCGC CACCTATACC TATACGGGTT ACAAAGGCGG CGCATTGATG
CTGGATCCGA TTCAGTCCCA TGTGCAAATC ACTGCACTGA ATCCCACTCA ACTCAATTTG
GGGGGGATTC CCGCTGCGAT GGCCGGTGCT TATTTGCGTG TGCAGGGTCC GAAGGGAAGC
ACAACCGTCT ACGTGACGGA CCTCTATCCC ACCGGATCTT CCGGGGGATT AGATCTTTCG
CCGAATGCCT TTGCCAGTAT CGGCAATATG GCTCAAGGGC GTATCCCCGT GCAATGGAAG
GTCGTCTCAG CACCGGTGAG TGGCAACCTG ATCTATCGGG TAAAAAAGGG AAGCTCGGGA
TGGTGGGCAG CGATCCAAGT ACGTGAACAT CGGTATCCAG TCCTCAAACT TGAAATCTGT
CAGGATGGCA CCTGGTTGAA TTTGCCAAAG AGAAATTACA ACTACTTTGT TGGTACCCGA
CTGGGTAACC AACCCTTGTC AATGCGCATG ACAGATATTC GTGGTCAGAC CTTGATTGAT
ACGCTTCCCG CCTTGCCCAA GAAGGCTTCC TCAAAAGCAT ATTCTGTGAA TGGAAATGTC
CAGTTTTCCG AATAA
 
Protein sequence
MRRLAILFWL ATSGCVASAM YGGTVDAQNA VSDTHFVEPL RGVNWRGLET AQHLPQGLDQ 
RPWREVLDQM QSLGINAIRL PLCSDTLHGA MPTNLDLVRN PDLKGRTALQ IADAIMDEAG
KRGMRVLLAY HGVECPTDGN PLLRSVDESE HQWISDMQFI TSHYRAQQKV VIGVDLADMA
YHRPFQSGGD STPDWNRVVE RAAAAILAMN PDWLIGVQPV GLNPPCLDAS APISDDNIKS
QHCVQLRIPA RNLLLMPRFA GTDMDTEAAL GAFSGKQTVL PNSLDATDAE QLAHRIDALL
AFGIRQGFYG SWMTSAQMPF GLLDNDGRTP RTALIAQLHR WWGVSRVDVA SENAATKNQT
TTDTNGCVTG DSSVPLNGWD TSFSGVATYT YTGYKGGALM LDPIQSHVQI TALNPTQLNL
GGIPAAMAGA YLRVQGPKGS TTVYVTDLYP TGSSGGLDLS PNAFASIGNM AQGRIPVQWK
VVSAPVSGNL IYRVKKGSSG WWAAIQVREH RYPVLKLEIC QDGTWLNLPK RNYNYFVGTR
LGNQPLSMRM TDIRGQTLID TLPALPKKAS SKAYSVNGNV QFSE