Gene Caci_3707 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaci_3707 
Symbol 
ID8335060 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCatenulispora acidiphila DSM 44928 
KingdomBacteria 
Replicon accessionNC_013131 
Strand
Start bp4160733 
End bp4163735 
Gene Length3003 bp 
Protein Length1000 aa 
Translation table11 
GC content67% 
IMG OID644956847 
ProductCarbohydrate binding family 6 
Protein accessionYP_003114450 
Protein GI256392886 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.176968 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones21 
Fosmid unclonability p-value0.212982 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCCCCCA CACCCACCAC CCCACCCGCA GCATCCCCCT CATCGGCGAT ACCTCGAGGA 
CGCAGACGCG CCCTGGCGGC GTCGACCGCC GGAGCTGTCA CGTTCGCCAC CATCGCCGCC
TGGTCCATGG CGCACTCGGC ATCGGCCGCC GGTACGACCT ATGAGGCGGA GAAGGCGGCG
TTGTCCGGCG GTACGGTCGT CGCCAGCGAT CACGCGAACT ACACCGGGAC CGGTTTCGTC
GGCGGATACA CCGATGCGAA TAAGGGCGCT GCGAACACCA CGTTCACCGT GAACGCCTCC
GCTGCGGGGA ACGAGTCCGT GGCGCTGCGT TATGCGAACG GCACCGGTGC GCAGATGTCG
CTGTCGTTGT ACGTCAACGG CACCAAGCTG AAGCAGATCC TGCTTCCGGC GTCGGCCAAC
TGGGACACCT GGACGACCGA GACCGAGTCG GTCGCGCTGA AGGCCGGGAC CGACACGATC
TCGTACAAGT TCGACTCCGC CGATCTCGGC AACGTCAACG TGGACAACAT CACCGTCACT
CCGGCGGCAC CGCCTCCGGC CGGTCAGTTC GAGGCGGAGA GCGCGGCGCT TTCCGGCGGT
ACGGTCGTCG CCAGCGACCA TGCGAACTAC ACCGGCACCG GTTTCGTCGG CGGATACACC
GATGCGAATA AGGGCGCTGC GAACACCACG TTCACCGTCT CGGAATCCGG TGCCGGATCC
ACGACGACGA CGCTGCGCTA CGCGAACGGC ACCGGCGCGC AGATGTCGTT GTCGTTGTAC
GTCAACGGCA CCAAGATCAA GCAGATCCTG CTTCCGGCCA CGGCGAACTG GGACACGTGG
GGGACCGAGA CCGAAAGCGT CAGCCTGAAC GCGGGCAGCA ACGCGGTCTC CTACAAGTTC
GACTCCTCCG ACCTGGGCAA CGTCAACATC GACAACATCG TCGTCGGTGC GATCACCCCG
CCGACCACGA CGCCGTCGAC CTCTCCCAGC TCCAGCTCGC CGCCGCCGTC CGGCACGCCC
TATGAGGCCG AGACCGCCTT CACCGCCGGC GGCCCGAGCG TCGCGACGTC TATCAGCGGG
TACAGCGGCA CCGGCTACCT GACCGGCTTC ACCACCCAGG GCGCCGAGAC CGTCATCGAC
ACCGACGTCC CGGCTGCGGG TTCGGATGCC GTGACCCTTC GCTATGCGAA CTCCACCGGC
TCGGCGCAGA CGATCTCGCT GTATGTCAAT GGCCTGAAGA ACGCACAGCT CTCGCTGCCG
GCCGGCAGCG GCTGGCTGAC GTCGTCGCGG ACCATCGCGC TGCGCTCCGG GGAGAACCTC
ATCGGCGTCC AGCACGACAG CGGCGACACC GGCAACGTCG CCATCGACGA CGTCACCGTC
GCCAACGGCA CCGCTCTGGC CGCGGTCGGC GCGACGCTCC CGTACACCGA GTACACCGCC
ACGAGCTCGC AGACGCAGAC CAACGGCACG GTCCTGGCCG CCAGCACCGC CTACCCGAGC
ATCCAGGCCG AATCGACCGG CCGCCGGGCC GTCCAGCTGA CCGCCACCGG CCAGTACATG
CAGGTCACGT TGGCGCACCC GACCAACTCG ATCGTGGTCC GCTATTCGAT CCCTGACAAT
GGCGACGGTT CCGCGGCGAG CGCCCCGATC GCGTTGTACG CCAACGGGAA CAAGATCCAG
GATCTGACCC TCACCACCAA GTACTCCTGG CTCTACGGCG GCGGCTACTA CGACACAAAC
ACGCCGAGCA GCGGTCCCGC GCACCACTTC TATGACGAGA CCAGGGCCCT GATAGGCAAC
TGGCCGGCGG GAACGGTGCT GAAGCTCCAG AAGGACTCCG GCGACACGGC CGCCTCGTAC
ACCTTCGACG TCATCGACAC CGAACAGGTG GACCCTGCCT TCGCGATACC GGCGAACTTC
GTCCCGATCA CCAACTACGG CGTCACTCCG AACAACGGGG CGGACGACAC CAACGCGATC
AACAGCGCGC TGAGTGCTTT GGCCGGAACG GGCAAGGGCT TGTTCTTCCC GTCCGGAACC
TACGACATCT CGGGCCGCAT CAACATCAAC GGCGTGCCGG TGCGCGGCGC CGGCGAGTGG
TACACGACGA TCCAGTCCAC GGCCGTGAAC GGCAGCGGCG GTCTGTACAC CACCGCCGGC
GTGAACCAGA TCGCCGACCT GACGATCTCC GGCGATCAGA CCTCGCGGAA CAACGACTCC
GGCGCGGCCG CGATCGAGGG GACCTTCGCG CAGGGCTCGC TGCTGTTCGA CGTGTGGATG
GAGCACACGA AGGTCGGGCT GTGGGCGGTT CCGGGCGTCG GGCTCTACGC CTCCGGGCTG
CGCGTCCGCG ACGTCTTCGC CGACGGTCTC CACGTCCACG GCGGCAGCAA CGGCACCCGG
ATCGACCAGT CGCAGGTGCG CAACAGCGGC GACGACAACA TCGCGCTGGA CACCGAGGGC
GGCGACGTCG TCCGCTGCTC GCTGGTGCAC AACACCGTTC AGAGTCCGAT CCAGGCCAAC
GGCATCGGTG TCTACGGCGG CAACGGCAAC GCCGTCGTCG CCAATCAGGT CTCTGACACC
GTCGCGTTCG GCGCGGGCAT CACCGTCAGC ACCCGGTTCG GAGGCGGGTT CACCGGCCCG
ACCACGGTGT CCGGCAACGC GCTGACACGC GCCGGATCGT ATGAGTACAA CTGGGGTTCG
AGCCTCGGCG CACTGTGGAT CTACGCGAGC CAGTCCGACA TCACCCAGCC GGTGACCGTC
TCCACCAACA CGATCACCAG CGCCACCTAC GACGCCCTGC TTCTGGGTGA CAGCAAGCAG
ATCGCCAACC TGACGCTCGA TCACCTCGCG ATCAGCGGCG CGGGCGGATA CGGCATCAAC
ATCAAGAACC TGACCGGCGG GATGACGGCG AACTATGTGA CCGTCACCGG CGCCGCGTCC
GGCGGACTGA ACAACCCCTC GAACTACCCG ATCACGCGCG GTCCGGGGGA CAGCGGCTGG
TAG
 
Protein sequence
MPPTPTTPPA ASPSSAIPRG RRRALAASTA GAVTFATIAA WSMAHSASAA GTTYEAEKAA 
LSGGTVVASD HANYTGTGFV GGYTDANKGA ANTTFTVNAS AAGNESVALR YANGTGAQMS
LSLYVNGTKL KQILLPASAN WDTWTTETES VALKAGTDTI SYKFDSADLG NVNVDNITVT
PAAPPPAGQF EAESAALSGG TVVASDHANY TGTGFVGGYT DANKGAANTT FTVSESGAGS
TTTTLRYANG TGAQMSLSLY VNGTKIKQIL LPATANWDTW GTETESVSLN AGSNAVSYKF
DSSDLGNVNI DNIVVGAITP PTTTPSTSPS SSSPPPSGTP YEAETAFTAG GPSVATSISG
YSGTGYLTGF TTQGAETVID TDVPAAGSDA VTLRYANSTG SAQTISLYVN GLKNAQLSLP
AGSGWLTSSR TIALRSGENL IGVQHDSGDT GNVAIDDVTV ANGTALAAVG ATLPYTEYTA
TSSQTQTNGT VLAASTAYPS IQAESTGRRA VQLTATGQYM QVTLAHPTNS IVVRYSIPDN
GDGSAASAPI ALYANGNKIQ DLTLTTKYSW LYGGGYYDTN TPSSGPAHHF YDETRALIGN
WPAGTVLKLQ KDSGDTAASY TFDVIDTEQV DPAFAIPANF VPITNYGVTP NNGADDTNAI
NSALSALAGT GKGLFFPSGT YDISGRININ GVPVRGAGEW YTTIQSTAVN GSGGLYTTAG
VNQIADLTIS GDQTSRNNDS GAAAIEGTFA QGSLLFDVWM EHTKVGLWAV PGVGLYASGL
RVRDVFADGL HVHGGSNGTR IDQSQVRNSG DDNIALDTEG GDVVRCSLVH NTVQSPIQAN
GIGVYGGNGN AVVANQVSDT VAFGAGITVS TRFGGGFTGP TTVSGNALTR AGSYEYNWGS
SLGALWIYAS QSDITQPVTV STNTITSATY DALLLGDSKQ IANLTLDHLA ISGAGGYGIN
IKNLTGGMTA NYVTVTGAAS GGLNNPSNYP ITRGPGDSGW