Gene Caci_5258 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaci_5258 
Symbol 
ID8336612 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCatenulispora acidiphila DSM 44928 
KingdomBacteria 
Replicon accessionNC_013131 
Strand
Start bp6054139 
End bp6056100 
Gene Length1962 bp 
Protein Length653 aa 
Translation table11 
GC content70% 
IMG OID644958356 
Productglycoside hydrolase family 9 
Protein accessionYP_003115958 
Protein GI256394394 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.35373 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones37 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGTTTCCA GGGCACGACC GCGTTTCCGC CACCGCAGGA CGGTCGCCGC GCTCAGCGCC 
GTCAGCCTGG CCGGCCTGAC GGTCGGCACC GCCACCGTGC TCTCGCAGCC CGCGCAGGCC
GCGCAGTCCG CGAACACCGG GCTGGTCCGC GTCGACCAGG CCGGGTACCT GGCCGGGGAC
GTGAAGCAGG CGTACCTGAT GACCGGCGGG GCGGTGTCTG GGGCGAAGTT CTCGGTGCTG
AACGCCAAGG GCAAGACGGT ACTCACCGGC AAGGTCGGCG GCACCAGCCT CGGCAAGTGG
AACGCCGCCT ATCCGGACGT CTACCCGATC GTCTTCAGCG GCTTGAAGAC GCCGGGGACC
TACCACATCG CGGTCGCCGG GAGCGCCTCG GGCAGCTCGC CGACGTTCAC CGTCACCAGC
TCCGGCTCGC TCTACGGCAA GCTGGTCACC GACGGCGTCA CCTTCTTCCA GACCCAGCGC
GACGGCTCGA ACGTCGTCCC CGGCGCCCTG AACCGCAAGC CCTCGCACCT GAACGACGCC
GCGGCGAGCC TCTACGCCTG GCCGACCTTC GCCCCCGACG ACTCGGACAC CATCACCGAC
GCCGACCTGA CCAAACTCGG CGGTACCGTC GACGTCTCCG GCGGCTGGTT CGACGCCGGC
GACTACTTGA AGTTCTCCAA CAACGAGGCC TTCGGCGACA TCACGCTGCT GGCCGCGCAG
CGCGCCCTGG GCTCCTCCGC CCCGGCCTCG CTGACCGCCG AGGCGCACTA CGGCGAGACC
TGGCTGAACA AGGCCTGGAA CCAGAAGACG AAGACGCTGG TCTTGCAGGT CGGCATCGGC
TCGGGCAATG CCGCCGGTAC TTTCACCGGC GATCACGACC TGTGGCGCCT GCCGCAGAAG
GACGACGGCG ACACCGCCAC CGCCGACCGC TACTCCGCCG CGCACCGCCC GGCGTTCCTC
GCCGCCAGTC CGGGGGCGAG GATTAGCCCG AACATCGCCG GGCGCGTGGC GGCGGCGTTC
GCCCTGGCCG CGCAGGTCGA CGCGAAGAGC AACCCCAAGC AGGCCGCCGC CGAGTACCAG
GCTGCCGCCT CGGTGTATGC GCAGGCTGAT ACCAGCGCTC CGCCGAGCCC GCTGACCACC
GCGCTGCCGA ACGGCTACTA CCCCGAGTCG ATCTGGCACG ACGCGATGGA GTTGGGCGGC
GCCGAACTGG CACTGGCCGC GCAGAAGCTG GGACACAGCC CTTCTTCGTA CCTGTCGCAG
GCCGCTACTT ACGCCAAGGA CTACATCGCC TCCGACACCG GCGACACGTT CAACCTCTAC
GACAACAGTG CCCTGGCACA CGCCGACCTG ATCAAGGCGA TCGCCGCCGC CGGCAACCCG
TCGGGGCTGG CGGTCACTCG TGCCGCACTG ACCGCGGACC TGAAGCGGCA GGTGCAGTCG
GCGGCGAGCA AGGCCTCCTC CGACGTCTTC CACGCCGGCG GCGACTACGC GGACTTCGAC
GTCAACGCGC ACACCTTCGG CTTCCTGACC GAGGAGGCGC TGTACCGGCA GGCCAGCGGC
GACACCTCGT TCCAGTCCTT CGCCACCGAA CAGCGCGACT GGCTGCTGGG CGCCAACGCC
TGGGGACAGG CGTTCATGGT GGGAGAGGGC AGCACCTTCC CGAAGTGCAT GCAGCACCAG
GTCGCGAACC TGTCCGGCAG CCTGAACGGC ACCGGCGCGA TCGCCACCGG CGCGGTGATG
AACGGCCCGA ACAACACCAG CAACTTCGAC GGCGGCCTCG GCTCCTACCA GGACGGCATG
AAGCCCTGCC CGCCCGGCGG CACTGACCCC GACACCAAGT TCACCGGCCA CAACAGCCGC
TTCTCCGACG ACGTCCGCTC CTGGCAGACC GACGAGCCGG CCCTGGACAT GACCGGCTCG
GCAGTCCTCG GCGCCGCGAT GCAGGAGACC CTCGGCGGCT GA
 
Protein sequence
MVSRARPRFR HRRTVAALSA VSLAGLTVGT ATVLSQPAQA AQSANTGLVR VDQAGYLAGD 
VKQAYLMTGG AVSGAKFSVL NAKGKTVLTG KVGGTSLGKW NAAYPDVYPI VFSGLKTPGT
YHIAVAGSAS GSSPTFTVTS SGSLYGKLVT DGVTFFQTQR DGSNVVPGAL NRKPSHLNDA
AASLYAWPTF APDDSDTITD ADLTKLGGTV DVSGGWFDAG DYLKFSNNEA FGDITLLAAQ
RALGSSAPAS LTAEAHYGET WLNKAWNQKT KTLVLQVGIG SGNAAGTFTG DHDLWRLPQK
DDGDTATADR YSAAHRPAFL AASPGARISP NIAGRVAAAF ALAAQVDAKS NPKQAAAEYQ
AAASVYAQAD TSAPPSPLTT ALPNGYYPES IWHDAMELGG AELALAAQKL GHSPSSYLSQ
AATYAKDYIA SDTGDTFNLY DNSALAHADL IKAIAAAGNP SGLAVTRAAL TADLKRQVQS
AASKASSDVF HAGGDYADFD VNAHTFGFLT EEALYRQASG DTSFQSFATE QRDWLLGANA
WGQAFMVGEG STFPKCMQHQ VANLSGSLNG TGAIATGAVM NGPNNTSNFD GGLGSYQDGM
KPCPPGGTDP DTKFTGHNSR FSDDVRSWQT DEPALDMTGS AVLGAAMQET LGG