Gene Caci_4317 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaci_4317 
Symbol 
ID8335671 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCatenulispora acidiphila DSM 44928 
KingdomBacteria 
Replicon accessionNC_013131 
Strand
Start bp4900365 
End bp4901879 
Gene Length1515 bp 
Protein Length504 aa 
Translation table11 
GC content69% 
IMG OID644957420 
ProductAlpha-N-arabinofuranosidase 
Protein accessionYP_003115022 
Protein GI256393458 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG3534] Alpha-L-arabinofuranosidase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.341945 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value0.00470977 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
TTGCTGACAG CACATGCCAC CCTCGACCCG GCCTTCCGCG TCGCGCCGGT CGACCGCCGC 
CTGTTCGGTT CCTTCGTGGA GCACATGGGC CGCTGCGTCT ACGGCGGGAT CCACGAGCCC
GGACACCCCG AGTCCGACGC CGACGGAAAC CGGCTGGACG TCCTGGAACT CACCCGCGAC
CTCGGGATCA GCGTCATCCG CTACCCCGGC GGCAACTTCG TGTCCGGATA CCGCTGGGAG
GACGGCGTCG GGCCGGTCTC CGAGCGGCCG CGCCGGCAGG ACCTGGCCTG GCGCTCCATC
GAGACCAACG AGTTCGGCCT GAACGAGTTC ATGGCCTGGG CGAAGCTGGC GAACGTGGAG
CCGATGATGG CGGTGAACCT GGGGACCCGG GGCATCCAAG AGGCCTGCGA CCTGCTGGAG
TACGCCAACC ACCCGGGCGG CACGTACCTG TCCGACCTGC GCCGCAAGCA CGGCGTCGAG
GAGCCGCACG CGATCAAGCT GTGGTGCCTG GGCAACGAGA TGGACGGCCC GTGGCAGACC
GGCCACAAGA CCGCCGAGGA GTACGGCAGA CTCGCCGCCG AGACCGCCAA GGCGATGCGC
CAGGTCGACT CCGGCATCGA GCTGGTCGCC TGCGGCAGCT CCAACGCCCT CATGCCGACC
TTCGGCTCCT GGGAGTCCAC GGTCCTGGAG CACACCTTCG ACTACGTGGA CTACATCTCC
CTGCACGCCT ACTACGAGCA ACACGGCCAG GACCGCGCCA GCTTCCTGGC CGCCGCAGCC
GGCATGGACC GCTTCATCGA CGGCGTGGTG GCCACCGCCG ACCATGTCTC CGCGAAGAAG
AAGTCCCGCA AGAAGCTCAA CCTCTCCTTC GACGAGTGGA ACCTGTGGCA GGAAAGCCGC
TTCGCCGGCC ACACGAACCT GGACTGGGAA CAGGCGCCCC GACTGATCGA GGACACCTAC
ACCGTGCAGG ACGCGGTGGT CTTCGGCAGC CTGCTGATGT CCCTGCTCCG CCACGCCGAC
CGGGTCGCCA TCGCCTGCCT CGCGCAGCTG GTGAACGTGA TCGGCCCGAT CCGCGCCGAA
CCGGACCGCC CAGCCTGGGC ACAGACCATC TTCCACCCCT TCGCACTCAC CGCTCGGCAC
GCAGTCGGCG ACGTGCTCCG CGTGGAGACA GCCACCGACC TCTACGACAC CGCCGAGCAC
GGCGACGTAC CGCTGGTCGA CGTGGTGGCT ACGCACGACC GCGAAAGCGG GCAGCTCACG
GTGTTCGCGA TGAACCGGCA CACCGAGGAC CGGGCCCGCC TGGAGATCGA CCTGCGGGCT
TTCGGCGACC TGGTGGTCAC CGAGCACCTG CACCTCGGCG GCGAGCAGGA CCTCGACGCC
GTGAACTCGA TCGAGACGCC GCAGGCGGTC GCGCCGATCA GCGTTGAGGG CGCGAGCATC
GAGGGACGTC GGCTGACGGC TTCGCTCCCG CCGGTGTCCT GGAACATGAT CCGGTTCGTT
CCCGCCAATC GGTAG
 
Protein sequence
MLTAHATLDP AFRVAPVDRR LFGSFVEHMG RCVYGGIHEP GHPESDADGN RLDVLELTRD 
LGISVIRYPG GNFVSGYRWE DGVGPVSERP RRQDLAWRSI ETNEFGLNEF MAWAKLANVE
PMMAVNLGTR GIQEACDLLE YANHPGGTYL SDLRRKHGVE EPHAIKLWCL GNEMDGPWQT
GHKTAEEYGR LAAETAKAMR QVDSGIELVA CGSSNALMPT FGSWESTVLE HTFDYVDYIS
LHAYYEQHGQ DRASFLAAAA GMDRFIDGVV ATADHVSAKK KSRKKLNLSF DEWNLWQESR
FAGHTNLDWE QAPRLIEDTY TVQDAVVFGS LLMSLLRHAD RVAIACLAQL VNVIGPIRAE
PDRPAWAQTI FHPFALTARH AVGDVLRVET ATDLYDTAEH GDVPLVDVVA THDRESGQLT
VFAMNRHTED RARLEIDLRA FGDLVVTEHL HLGGEQDLDA VNSIETPQAV APISVEGASI
EGRRLTASLP PVSWNMIRFV PANR