Gene Caci_5225 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaci_5225 
Symbol 
ID8336579 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCatenulispora acidiphila DSM 44928 
KingdomBacteria 
Replicon accessionNC_013131 
Strand
Start bp6014274 
End bp6016118 
Gene Length1845 bp 
Protein Length614 aa 
Translation table11 
GC content69% 
IMG OID644958323 
Productglycoside hydrolase family 76 
Protein accessionYP_003115925 
Protein GI256394361 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG4833] Predicted glycosyl hydrolase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0174562 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones36 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGCGCAGAA TCCTGATCCT CCTCACAGCT CTGGTCACGG CGCTGGTACC GCTGCTCGCC 
ATGGCGCCGC CGCGTGCGAA CGCGGCGAGC GCCGTGTGCG CGCTGTACTG CGACACGCGT
GATCCCTCGC TGGCGCAGCA GGAGACGTTC CCGACCCCGA ACGTCTCCGA GAACGGCCGC
GTGATCGCAT TGCACGTGGA CGACGTCGAC GGCATGGCCT GGGCCAGCAT CGACAACGGC
CGGCTGAACG ACTCGGTCTG GATCGACCGG TCGTGGGACG CCGGCAGCAG CTGGGACGGC
TTGTTGGGCA AGGCGTGGAT CCCGAGCTCG TGGACCGGTA CACGGACCCT GATGTACAAC
ATGTACGACC CCTCCGACCA CCGCCGCGCG GTGGTGCGTG CCTGCGGCGA CGCCAGCGGG
GTGGTGTGTA CGAACTGGGT CCACCTGCCG GTGTGCGCGG CGCGGTGTGA CGGCGCCGAT
TCCAGGACCT CGGTCGGGAA CACCTCACCG GTCCCCGACG CCACGCTGTC CGGGCGCGAC
ATCGCGCTGC ACGTCGACTC CGGCGGCATG GCCTGGGCTT CGATCGCCGG CGGAGCGCCC
GGCGACGAGG TGTGGCTGGA CCGGTCGTGG GACGGCGGCG CGACGTGGCC GGACGGCTCG
AGCAAGGGCC GGGTGAGCGT GCCGTCGGGG GCGTCCGGTA CTCAGACTAT TGAGATCAAC
ATCGACGATC CGTTGGGCCG GCTGGCCGGG GGCGCCGTGC GCGCCTGCGG GCGTGCGGTG
ACCGGGCAGA ACGGCAGCTG CACGGCGTGG GCGCGCGCCG CCGCGGTCCC GGCGAAGGCT
GCCGCCGACG CGCTGATGTG GTCTTATGAC CCCTCCAACG CATGGTGGCC GTCGAGCTGG
TGGAATTCGG CGGTCGCACT GACGTCGGTG ATCGACTACA CGCGCGGCTC GGGCGATACG
GCATACGAGT GGATCGTCGA CCGCACGTTC CAGGTGAACA AGGTCGCCTT CCCGGCCGGC
GCGCGCAGCT CGGACCCCAT CCAGGGCGAC TTCATCAGCC AGGCGACCGA CGACACCGAG
TGGTGGGCGC TGGCGTGGAT CGACGCGTAC GACCTGACGG GGAATCGGAC GTACCTGAAC
GAGGCCGTCA CCATCACGAA CCATGTCAGT TCCCTGTGGA ACACCAGCAC CTGCGGCGGC
GGCGTGTGGT GGAACACGCA GAAGACGTAC AAGAACGCGG TGACCAATGC GCTGTATGTG
GATCTGACCG CCGCGCTGCA CAACCGCATC GCGGGCGACA CGGCGTGGCT GGCGCGGGCG
ACGACGTCCT GGAACTGGTT CCGCTCCAGC GGACTGATCA ACGGCTCGGG TCTGGTCAAC
GACGGCCTGA CGAACGCGTG CACGAACAAC GGCCAGACGG TCTGGACGTA CAACCAAGGG
CTGGCCATCG GCGCGGCGCA GGAGATGTAC CGCGCGACCG GCGACAGCGG CGACCTGAGC
GAGGCGCGCC ACCTCGCCGA CTCGGCGGTG CACTCCCCCA CACTGGTGAC GAACGGGCTG
CTCACGGAGT CGTGCGATGC GCTGACCGCC ACCTGCGACG ACAACCAGAA GCAGTTCAAG
GGGATCTTCA TGCGCTTCCT GGGCGAGCTG AACGCCGACG CGTCGGTCGG TGGCGCGTAC
AGCACGTTCA TCCAGGCGCA GACGTCGTCG CTGTGGAACG CGGACCGGAA CTCGCTCAAC
CAGCTCGGGG AGCGATGGTC GGGGCAGGGC TCGGGGACGA ATCCGAATGT GAGCGATTGG
CGGACGCAAG CGAGCGGGTT GGAGGCGCTG GACGCGGGGG TTTGA
 
Protein sequence
MRRILILLTA LVTALVPLLA MAPPRANAAS AVCALYCDTR DPSLAQQETF PTPNVSENGR 
VIALHVDDVD GMAWASIDNG RLNDSVWIDR SWDAGSSWDG LLGKAWIPSS WTGTRTLMYN
MYDPSDHRRA VVRACGDASG VVCTNWVHLP VCAARCDGAD SRTSVGNTSP VPDATLSGRD
IALHVDSGGM AWASIAGGAP GDEVWLDRSW DGGATWPDGS SKGRVSVPSG ASGTQTIEIN
IDDPLGRLAG GAVRACGRAV TGQNGSCTAW ARAAAVPAKA AADALMWSYD PSNAWWPSSW
WNSAVALTSV IDYTRGSGDT AYEWIVDRTF QVNKVAFPAG ARSSDPIQGD FISQATDDTE
WWALAWIDAY DLTGNRTYLN EAVTITNHVS SLWNTSTCGG GVWWNTQKTY KNAVTNALYV
DLTAALHNRI AGDTAWLARA TTSWNWFRSS GLINGSGLVN DGLTNACTNN GQTVWTYNQG
LAIGAAQEMY RATGDSGDLS EARHLADSAV HSPTLVTNGL LTESCDALTA TCDDNQKQFK
GIFMRFLGEL NADASVGGAY STFIQAQTSS LWNADRNSLN QLGERWSGQG SGTNPNVSDW
RTQASGLEAL DAGV