Gene Caci_3581 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaci_3581 
Symbol 
ID8334934 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCatenulispora acidiphila DSM 44928 
KingdomBacteria 
Replicon accessionNC_013131 
Strand
Start bp3990550 
End bp3992955 
Gene Length2406 bp 
Protein Length801 aa 
Translation table11 
GC content68% 
IMG OID644956724 
ProductGlycoside hydrolase family 59 
Protein accessionYP_003114327 
Protein GI256392763 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones34 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCCACCG CCCGCGTCGT CAGACCACCC AGACTCCTGC TCTACCTCGC CCTCACCGCG 
CTGCTGGCGT CGGTGTTCTA CGTCGCCACC GCCGGCGCCC CGGCAAGCGC GGCGACCTCC
ACCACGATCT CCGCGAACGG TTCCAGCGGC GGCCGGACCT TCGACGGCAT CGGCGCGATC
AGCGGCGGCG GCGGGAACTC CCGGCTGCTG AAGGACTATC CGGCCGCTCA GCAGTCGCAG
ATCCTGGACT ACCTGTTCAA GCCCGGCTAC GGCGCGGACT TGCAGATGCT GAAGCTGGAG
ATCGGCGGCG ACGCCAACTC CACCGACGGC TCCGAGCCCT CGATCGAGCA CACCCGCGGT
GTGGTCAACT GCAACGCCGG CTATGAGTTC TGGCTCGCCC AACAAGCCAA GGCCCGCAAC
CCCAACATCG CCTTCTACGG CCTGGCCTGG GCCGCACCCG GCTGGATCTC CGGCGGCTTC
TGGTCCACCG ACACGATCAA CTACCTGATC AGCTGGCTGG GCTGCGCGAA AGCCGACGGC
GTCCCGGTGT CCTACCTCGG CGGCTGGAAC GAGCGCGGCT TCAACGCCGA CTGGTACATC
AACCTGCGCA CCGCCCTGAA CAACGCCGGC TACGGCGACG TCAAGATCGT CGCCGACGAC
TCCGGCTGGG ACGCCGTCGA CGCCGCCGCC TCCAACTCCG CGTTCAACAA CGCCGTGTCG
ATCTGGGGCG CGCACTACTC CTGCAACGGT GGTGACGGCG GCAACGCCGA CACCTGCTCC
AGCGACGCCG CGGCGAAAGC CAACGGCAAG CCGCTGTGGG ACAGCGAGCA GGGGTCGCAG
GACATGAACA CCGGCGCCCC GGCACTGATC CGCGCCATCA CCCGCGGCTA CATCGACGCC
AAGATGACCA GCTACTTCAA CTGGCCGCTG ATCGCCGCGA TCTATCCCAA CCTGCCGTAC
TCCACCGTCG GGCTGATGAC GGCCGGCTCG CCGTGGTCCG GCGCGTACTC CGTCGGCGCC
AACACCTGGG CCACCGCGCA GGTCACACAG TTCACGCAGC CGGGGTGGAA GTTCCTGGAC
AGCGGCTCGG GCTACCTCGG CGGCTCGGAG TCCAACGGAA CCTACGTCAC GCTGAAGTCC
ACCAACAACA GCGACTACAC CACGATCGTG GAGACCACCA CCGCCAGCGC CGCGCAGAAC
GTCACGATCA ACGTCAGCGG CGGGCTGTCC ACCGGCACCG CGCACGTCTG GGCGACCAAC
CTGAACAACC CCAGCACCGG GGCGTCGCTG ATCCACACCC AGGACGTCAC CCCGTCCAAC
GGCTCCTACA CGCTGTCCGT CCAGCCCGGC TACGTCTACT CGATCACGAC GACGACGGGC
CAGGGTAAGG GCACTGCGAC GTCCCCGGCG TCCGGCGCGC TGGCGCTGCC CTACAGCGAC
ACCTTCGACA GCGACGCCAC CAACACCGAG GCCAAGTACC TGTCGGACAT GCAGGGCTCC
TTCGAGGTCC GGCCGTGCGC CAACGGCCGC TCGGGGCAGT GCGTGCAGCA GGTCACGCCG
GTCATCCCGA TCGAGTGGCA GAACGACTCC GACGCCTTCT CCCTGCTCGG CGATCCCACC
TGGTCCAACT ACACCGTCAA GGTCGACGTG AACCTGCAGC AGGCCGGCAC CGCCGAACTG
CTGGGCCGCG CCGGCACGCA GTCCCGACCG CAGGGGAATC AGAACCTGTA CAAGTTCCGG
GTCTCCAACA CCGGCGCCTG GTCGATCGTG AAGAACTACA GCAGCGGCTC GTCCACCACT
CTGGCCAGCG GCACGACCAC CGCCCTGGGC ACCGGCACCT GGCACACCCT CAGCCTGGGC
TTCCAGGGCA CGACGATCAC CGCCATGGTC GACGGCAACA CCGTCAAGAC CGTCACCGAC
TCCACCTTCC TGTCCGGACA GGTCGGCATC GGCGTGGTCG GCTACCAGAC CGACCAGTTC
GACAACCTCA CCATCACTCC GGGCACCGGC ACGGCACAGC CTCCGACCGG CCCGATCACC
TCCGGCGTCG CCGGCAAGTG CCTGGACGAC AACGGCGGCT CGACCGTCAA CGGCACCGCC
GCCCAGATCT GGGACTGCAA CAACACCGCC GCCCAGCAGT GGACGTACAA CGGCGGGGCG
TTGCAGGTGA ACGGCAAGTG CCTGGACATC ACCGGAGCCG CCACCGCCAA CGGCACGCTG
GCGGAGATCT GGGACTGTAA CGGAGGCGGC AACCAGCAGT GGGTCCAGAA CGGCAACACC
CTGGTCAACC CGGCTTCCGG ACGCTGCCTG GACGACCCCG GGTTCAGCAC GACCAACGGC
ACGCAACTGG AGATCTGGGA CTGCAACGGC GGCACGAACC AGCAATGGAC GCTGCCGTCG
GCGTGA
 
Protein sequence
MPTARVVRPP RLLLYLALTA LLASVFYVAT AGAPASAATS TTISANGSSG GRTFDGIGAI 
SGGGGNSRLL KDYPAAQQSQ ILDYLFKPGY GADLQMLKLE IGGDANSTDG SEPSIEHTRG
VVNCNAGYEF WLAQQAKARN PNIAFYGLAW AAPGWISGGF WSTDTINYLI SWLGCAKADG
VPVSYLGGWN ERGFNADWYI NLRTALNNAG YGDVKIVADD SGWDAVDAAA SNSAFNNAVS
IWGAHYSCNG GDGGNADTCS SDAAAKANGK PLWDSEQGSQ DMNTGAPALI RAITRGYIDA
KMTSYFNWPL IAAIYPNLPY STVGLMTAGS PWSGAYSVGA NTWATAQVTQ FTQPGWKFLD
SGSGYLGGSE SNGTYVTLKS TNNSDYTTIV ETTTASAAQN VTINVSGGLS TGTAHVWATN
LNNPSTGASL IHTQDVTPSN GSYTLSVQPG YVYSITTTTG QGKGTATSPA SGALALPYSD
TFDSDATNTE AKYLSDMQGS FEVRPCANGR SGQCVQQVTP VIPIEWQNDS DAFSLLGDPT
WSNYTVKVDV NLQQAGTAEL LGRAGTQSRP QGNQNLYKFR VSNTGAWSIV KNYSSGSSTT
LASGTTTALG TGTWHTLSLG FQGTTITAMV DGNTVKTVTD STFLSGQVGI GVVGYQTDQF
DNLTITPGTG TAQPPTGPIT SGVAGKCLDD NGGSTVNGTA AQIWDCNNTA AQQWTYNGGA
LQVNGKCLDI TGAATANGTL AEIWDCNGGG NQQWVQNGNT LVNPASGRCL DDPGFSTTNG
TQLEIWDCNG GTNQQWTLPS A