Gene Caci_5223 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaci_5223 
Symbol 
ID8336577 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCatenulispora acidiphila DSM 44928 
KingdomBacteria 
Replicon accessionNC_013131 
Strand
Start bp6010327 
End bp6012210 
Gene Length1884 bp 
Protein Length627 aa 
Translation table11 
GC content69% 
IMG OID644958321 
ProductAlpha-L-arabinofuranosidase B catalytic 
Protein accessionYP_003115923 
Protein GI256394359 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.960188 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones36 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGTTCAT CTCCACCTGC CCGCCTCACC GATCAGCGCG TCGAGTCCGG CTCGCGCCTG 
TGGCGTCGCC TCGCGATGTC GATCGGCGCG GTCGTGGTCC TCATCCTCGG CGTCCTCACC
ACCGGTCCGG GCGCCGCCCA GGCCGCGGGC TCGCAAGGGC CCTGCGACAT CTACGCCTCC
GCGGGTACCC CGTGTGTGGC GGCGCACAGC ACGGTCCGAG CGCTGTACGC GTCCTACAAC
GGACCGCTCT ACCAGGTCCG GCGTGCATCC GACGGTGCCA CCACGAACAT CGGCGTGACC
TCCGCCGGCG GGTCTGCGAA CTCTGACGCG CAGGACACGT TCTGCGTGCA GACCGCGTGC
ACGATCACGG TCGTCTACGA CCAGTCGCCG CAGCACAACA ACCTGACGAT CGAGGGCGGG
GGCGGCGCCA ACCCGAACGC CGACGTCGCC GCGAACGCCA ACGCCGCGCA CATCGTCGTC
AACGGGAACA AGGCGTACGG CGTGTACGTC GGTCCCGGCG TCGGCTACCG CGACGACAAC
ACCTCGGCGA TCCCGACCGG CCACCAGCCG CAGGGTGCTT ACATGGTCGC CAGCGGCACG
CACGTCAACG GCGGCTGCTG CTTCGACTAC GGCAACGCCG AGACCAACAA CCGCGACAAC
GGCAACGGCC ACATGGAGGC GGTGAACCTC GGGACCAGCT GCTGGTCCTC CCCGTGCAAC
GGCACCGGGC CGTGGATCAC CGGCGACCTG GAGAACGGCC TGTATCAGGG TGCCGGCGCG
AACCCGTCGA ACACCGGCAA CAACAGCCAG TTCGTGACCG CGATGCTCAA GAGCAACAAC
CAGACCACCT TCGAGATCGA AGGCGGCAAC TCCCAATCCG GAGGCTTGAC CACCTGGTAC
AACGGCTCGC TCCCGCCGAA CGGCTACACG CCGATGTCGC TGGAGGGCGC CATCGTCCTC
GGCACCGGCG GGGACAACAG CAACGCCTCG ATCGGCACGT TCTTCGAGGG CGTGATGACC
GCCGGCGTCC CGAGCGATTC GGCGGACGCC GCCGTGCAGG CGAACATCGT CGCGCAGGGC
TACAGCGGCA ACAGCGGCGG AAGCCCGCAG GCGTCCGGCG GCACCGTCAC CCTGCCAGGC
GGGCAGTGCG TCGACGTCAT CGGCGACGAC AGCGGCGGCG ACCTCACCGG GGTCAACCTG
TGGGGCTGCC AGTCCGGCGC CGTCGACCAG CGCTGGACGC ACAACACCGA CGGCTCGCTG
GAGACCTTGG GCCGCTGCCT GGACATCGAC GGCAACGGCA CGGCGGTCGG CACGAAGGTC
GAGCTGTGGG ACTGCAACGG CGTCGGCGGC CAGAAGTGGA TCCAGCAGAG CAACGGCGCG
CTGCTGAACC CGCAATCAGG CCTCTGCCTC GACGACCCGA GCGGCAACAC CGCCAACGGC
ACGCAGCTGC AGATCTACAC CTGCAACGGC ACCACCGCGC AGCAGTTCTC GGTCAACGGC
GGCGGCACGG TCAACGCCCC CGGCAAACAG TGTGTGGACG TCGCCGGCGA CGACAACGGC
GGCAACCTCA CCGTCGTCCA GCTCTGGACC TGCCAACCCT TCGCAGCCGA CCAGCACTGG
CACCACAACG CCAACGGCTC GCTGCAAACC CTCGGCCGCT GCCTGGACAT CGCCGGCAAC
GGCACGGCGG TCGGCACCAA GGTCGAGCTG TGGGACTGCA ACGGCGTCGG CGGCCAGGTC
TGGCAGCAGC AGTCCAACGG CGCGCTCCTG AACCCGCAAT CAGGCCTCTG CCTCGACGAC
CCGAGCGGCA ACACCGCCAA CGGCACCCAG CTGCAGATCT ACACCTGCAA CGGCACCGCG
GCACAGAAGT TCGCGCTGGA GTAG
 
Protein sequence
MRSSPPARLT DQRVESGSRL WRRLAMSIGA VVVLILGVLT TGPGAAQAAG SQGPCDIYAS 
AGTPCVAAHS TVRALYASYN GPLYQVRRAS DGATTNIGVT SAGGSANSDA QDTFCVQTAC
TITVVYDQSP QHNNLTIEGG GGANPNADVA ANANAAHIVV NGNKAYGVYV GPGVGYRDDN
TSAIPTGHQP QGAYMVASGT HVNGGCCFDY GNAETNNRDN GNGHMEAVNL GTSCWSSPCN
GTGPWITGDL ENGLYQGAGA NPSNTGNNSQ FVTAMLKSNN QTTFEIEGGN SQSGGLTTWY
NGSLPPNGYT PMSLEGAIVL GTGGDNSNAS IGTFFEGVMT AGVPSDSADA AVQANIVAQG
YSGNSGGSPQ ASGGTVTLPG GQCVDVIGDD SGGDLTGVNL WGCQSGAVDQ RWTHNTDGSL
ETLGRCLDID GNGTAVGTKV ELWDCNGVGG QKWIQQSNGA LLNPQSGLCL DDPSGNTANG
TQLQIYTCNG TTAQQFSVNG GGTVNAPGKQ CVDVAGDDNG GNLTVVQLWT CQPFAADQHW
HHNANGSLQT LGRCLDIAGN GTAVGTKVEL WDCNGVGGQV WQQQSNGALL NPQSGLCLDD
PSGNTANGTQ LQIYTCNGTA AQKFALE