Gene Caci_5160 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaci_5160 
Symbol 
ID8336514 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCatenulispora acidiphila DSM 44928 
KingdomBacteria 
Replicon accessionNC_013131 
Strand
Start bp5927660 
End bp5929219 
Gene Length1560 bp 
Protein Length519 aa 
Translation table11 
GC content66% 
IMG OID644958258 
ProductXylan 1,4-beta-xylosidase 
Protein accessionYP_003115860 
Protein GI256394296 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value0.0027116 
Fosmid HitchhikerNo 
Fosmid clonabilitydecreased coverage 
 

Sequence

Gene sequence
ATGGCCCTCG GATCACGCCT GTCGCGTCTG TCCACGTTCA GACGCAAGGT GCTGGCGGCC 
GGCGCCGTGC TGGCGCTGAC CTTCGGCATC GCCGTGGAGA CGACCGCGGT TTCGCACGCC
GCCGTCACCG GGCCCTGCGA TATCTACGCC TCGGGCGGTA CGCCGTGCGT CGCCGCGCAT
AGCACCACGC GCGCGCTGTA CGGGTCCTAC AACGGCAGCC TGTACCAAGT GCGGCGCGCC
TCGGACAGTG CCACGGCGAA CATCGGCGTC CTGAGCGCCG GAGGTTATGC CAACGCCGCA
GCGCAGGACT CCTTCTGCGC TAACACGTCC TGCGTCATCA CGGTCATCTA CGACCAGTCC
GGACGCGGCA ACAACCTCAC CGACGCCCCG GCCGGCGGCG CGGCCGGAGG ACCGGACAAC
CTCGCCAACG CCTACGCCGC CCCGACCGTG CTCAACGGGC ACAAGGCCTA CGGCGTCTAC
GTCGCGGCCG GCACGGGCTA CCGCAACGAC CACACCAACG GCATCGCCAC CGGCGACAAC
GCCGAGGGTG AGTACGCCGT CTTCGACGGT ACGCACTACA ACGGCGGCTG CTGCTTCGAC
TACGGCAACG CCGAGACCAA CAACAACGAC GATGGCAACG GCACCATGGA AGCCATCTAC
TTCGGCAACA TCAAGGTCTG GGGATACGGC TCCGGAAACG GTCCGTGGAT CATGGCGGAC
ATGGAGAACG GCCTGTACTC CGGGCAGAAC GCCGGCTACA ACGCCAACGA CCCCACGATC
AACTACCGCT ACACCACCGC CATCATCAAG GGCACCGCCA ACAAGTGGGC GATCCGAGGC
GGTAACGCAC AATCCGGTGG CCTGTCCACC TTCTACAGCG GACCGCGGCC CAACGTCTCG
GGCTACAACC CGATGCGCAA GCAGGGTGCG ATCATCCTCG GCATCGGCGG CGACAACAGC
AAGGGATCGG CGGGAACCTT CTACGAAGGC GTCATGACCT CCGGCTACCC CTCGGACGCC
ACCGAGAACT CGGTCCAGGC CAACATCGTC GCCGCCGGAT ACAGCTCCGG CTCGAGCTCC
GGCGGTCTGA CGCCCGGCTC GCGCATCTCG GTGCGCGCGA CGACCTCATG CTGCACCAGC
GACTACCTCC GCCATGACGA CGCCGACACC AAGGCCGTCA TCTCGGCCAT CTCCTCCAGC
AGCTCGGCGA CCGACAAGGC CGACGCGACG TGGATCGTCC GCGCCGGTCT GGCCGACAGC
TCCTGCGTGT CCTTCGAGTC CGCCAACGAT TCCGGGCAGT TCCTCCGGCA TTACAACTAC
GAGCTCTACA TCGCGGCGAA CGACGGATCC TCGGTGTTCG CCCAGGACGC GACGTTCTGC
CCGAAGGCCG GGAACAGCGG CCAGGGCACG TCGTTCCAAT CGGTGAACTA CAACACCAAG
TACATCCGGC ACTTCAACTA CACCGCCTAC ATCGCCAGCA ACGGCGGCTC GAACGCCTGG
GACAGCACCA CGTCCTGGGC CGACGACACC AGCTGGGTCG TGAGCACTCC GTGGGCCTAA
 
Protein sequence
MALGSRLSRL STFRRKVLAA GAVLALTFGI AVETTAVSHA AVTGPCDIYA SGGTPCVAAH 
STTRALYGSY NGSLYQVRRA SDSATANIGV LSAGGYANAA AQDSFCANTS CVITVIYDQS
GRGNNLTDAP AGGAAGGPDN LANAYAAPTV LNGHKAYGVY VAAGTGYRND HTNGIATGDN
AEGEYAVFDG THYNGGCCFD YGNAETNNND DGNGTMEAIY FGNIKVWGYG SGNGPWIMAD
MENGLYSGQN AGYNANDPTI NYRYTTAIIK GTANKWAIRG GNAQSGGLST FYSGPRPNVS
GYNPMRKQGA IILGIGGDNS KGSAGTFYEG VMTSGYPSDA TENSVQANIV AAGYSSGSSS
GGLTPGSRIS VRATTSCCTS DYLRHDDADT KAVISAISSS SSATDKADAT WIVRAGLADS
SCVSFESAND SGQFLRHYNY ELYIAANDGS SVFAQDATFC PKAGNSGQGT SFQSVNYNTK
YIRHFNYTAY IASNGGSNAW DSTTSWADDT SWVVSTPWA