Gene Caci_3590 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaci_3590 
Symbol 
ID8334943 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCatenulispora acidiphila DSM 44928 
KingdomBacteria 
Replicon accessionNC_013131 
Strand
Start bp4004843 
End bp4006366 
Gene Length1524 bp 
Protein Length507 aa 
Translation table11 
GC content67% 
IMG OID644956733 
ProductAlpha-N-arabinofuranosidase 
Protein accessionYP_003114336 
Protein GI256392772 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG3534] Alpha-L-arabinofuranosidase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.497097 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones32 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCCCGCG CGCACATCGT GCTCGACAAG CAGGCGGTCG TCGCCCCGGT CCGGCGGCGC 
ACCTTCGGCT CGTTCGTCGA GCACCTCGGA CGCTGTGTCT ACACCGGGAT CTACGAGCCG
GACCATCCGA GCGCGAACGA CGACGGGTTC CGGATGGACG TCGTGGACCT CGTCCGGGAG
CTCGGCAGCA CGACGATCCG CTATCCCGGC GGCAACTTCG TCTCCGGCTT CCGCTGGGAG
GACTCGGTCG GTCCGCGCGG GAAGCGCCCG GTACGGCGCG ACCTCGCGTG GCATTCGCTG
GAGTCCAACC AGGTCGGTCT CGACGAGTTC GCCGCGTGGC TCAAGCTCAC CGGCTCGGAG
CTGATGCTCG CGGTGAACGT CGCCACGCGG GGGATCCTGC CCGCCCTGGA CCTGCTGGAG
TACGCCAACC ATCCCAGCGG CACGGCGCTG TCGGATCTGC GGATCGCCAA CGGCGCGCCG
GATCCGCACA ACGTGCGCAT GTGGTGCCTC GGCAACGAGA TGGACGGACC CTGGCAGACC
GGCTTCATGT CCGCGGAGGA CTACGGCAAG ATCGCCGCCC GCACCGCCGC GGCGATGAAG
ATGGCTGACA AGGATCTCGA ACTCGTCGTC TGCGGCTCCT CCGGATCGGG GATGCCGACG
TTCGGCGACT GGGAGCGCAC GGTCCTGGAG CACAGCTATG ACCATGTCGA CTACGTGTCC
TGCCATGCCT ACTACCAGGA GCTCGACGGT GATCTCGGCT CCTTCCTGGC TTCGGCGCTG
GACATGGAGT ACTTCATCGA CACGGTGATC GCGACCGCCG ACCATGTCGG CTACAAGAAG
CGTTCCAGCA AGAAGATCGA CATCTCCTTC GACGAATGGA ACGTCTGGTA CCTCAAGGAG
CACCAGGAGT CCGAGAAGTC CAAAGAGGCC GACAACGAGT GGCGTCACGC GCCCCGGCAG
CTTGAGGACG TCTACACGGT GGCGGACGCC GTCGTGGTCG GCAACCTGCT GATGACGCTC
CTCAAGCGCG GCGACCGTGT CACGTCGGCG TCGCTCGCGC AGCTCGTCAA CGTGATCGCA
CCGATCATGA CCGAGCCCGG CGGACCGGCT TGGCGGCAGA CGACCTTCCA TCCCTTCTCC
ATCACCAGCC GGCTCGCCGC CGGGGAGGTG ATCCGGCCGG TGATCGACGC GCCGACGTAT
ACGACGGCAC GATATGGCGA GGCGTCCGTC GTCGACGCGG TGGCGACCGT GGATGGGGAT
CGGGCCGCGG TCTTCCTCGT CAACCGCGAT CTGACGCAGA GCGCGCAGGT CACGGTCGAC
GTGCGCAGCC TTGGTCTGTC CCGTGTCCTC GAGGCGCTCA CGCTTGCCGA CTCCGACGTC
TACGCGAAGA ACACGCTCGC CGAGCCTCTG CGTGTGGTTC CGCGGGCGAA CACCGGCGCG
ACGCTGTCCG ACGGTGTGCT CACCGTCGAA CTGCCGCCGG TTTCGTGGTC GGCGATCGCA
CTCGGTCAGA GTGCGGGCGA CTGA
 
Protein sequence
MPRAHIVLDK QAVVAPVRRR TFGSFVEHLG RCVYTGIYEP DHPSANDDGF RMDVVDLVRE 
LGSTTIRYPG GNFVSGFRWE DSVGPRGKRP VRRDLAWHSL ESNQVGLDEF AAWLKLTGSE
LMLAVNVATR GILPALDLLE YANHPSGTAL SDLRIANGAP DPHNVRMWCL GNEMDGPWQT
GFMSAEDYGK IAARTAAAMK MADKDLELVV CGSSGSGMPT FGDWERTVLE HSYDHVDYVS
CHAYYQELDG DLGSFLASAL DMEYFIDTVI ATADHVGYKK RSSKKIDISF DEWNVWYLKE
HQESEKSKEA DNEWRHAPRQ LEDVYTVADA VVVGNLLMTL LKRGDRVTSA SLAQLVNVIA
PIMTEPGGPA WRQTTFHPFS ITSRLAAGEV IRPVIDAPTY TTARYGEASV VDAVATVDGD
RAAVFLVNRD LTQSAQVTVD VRSLGLSRVL EALTLADSDV YAKNTLAEPL RVVPRANTGA
TLSDGVLTVE LPPVSWSAIA LGQSAGD