Gene Caci_5224 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaci_5224 
Symbol 
ID8336578 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCatenulispora acidiphila DSM 44928 
KingdomBacteria 
Replicon accessionNC_013131 
Strand
Start bp6012541 
End bp6014241 
Gene Length1701 bp 
Protein Length566 aa 
Translation table11 
GC content69% 
IMG OID644958322 
ProductCarbohydrate binding family 6 
Protein accessionYP_003115924 
Protein GI256394360 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.105995 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones36 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATGAATC GCAGAAAGCT GCTGTCCTCC GCCGTCGCCC TCGGCGGCTC CGCACTGGGC 
CTGGGCACCG GCGCCCTGGC CTGGCGGGCT TCGGCGACCA CCCCCACCTT GCAGGTCGCC
CTGCAGAACA CCACGACGTC GAACCAGGTC TACGCCTACG TCACGGGCCA GGCGATCGAC
AACAACAATG CCCTGATGCT CTTGGAGGCC GACGGGCACA CGGTCTACTA CCCGACCTCG
CCGAGTTCGA CCGGCTCCCC GCTGGCCGCG GACTGCGCGA TCCGGCTCGG CGCGCCGGGC
AGCACCACCA CGATCACCAT CCCGCACATC GCCGGCGGCC GGATCTGGTT CGCCATCGGA
GCGCCGCTGA CCTTCCTGCT CAACCCGGGA CCGGGTCTGG TCGAGCCCTC CGTGAGCAAC
CAGTCGGACC CCAACATCAA CATCCGCTGG GACTTCTGCG AGTTCACGTA CAACGCCGCG
CAGATGTTCG CCAACATCAG CTACGTCGAC TTCGTCTCGA TCCCGATCTC GCTGGCGCTG
ACCAACGGCT CCGGCGCCAC GCAGACCGTC AGCGGCCTGC CGACCAACGG TCTGGACACG
GTCTGCTCGA ACCTGAACGC CCAGCACGCC GCGGACGGCG CCGGCTGGAA CCAGCTGGTC
GTCACCTCCG GCGGCGCCAA TTTGCGCGCG CTGAGCCCGA ACAACGGCAT CGTGATCAAC
AACTCGCTGT TCTCGGGGTA CTACCAGCCC TACGTGGACC AGGTGTGGTC CAAATACTCG
AGCCAGGCGC TGGCCGTGGA CACCCAGGCC TCCTGGGGCA CCGTCAACGG CCAGGTCTCC
GGCGGGACGT TGACCTTCCC GGGGCTGGGA AGCTTCGCCA AACCCTCGGC CGCCGACATC
TTCAGCTGCA GCACCGGGCC GTTCGCCAAC ACCGCCGGCG CGATGGGCCC GCTGGTCGCC
CGCATCAGCG CCGCCTTCAA TCGCAGCACC CTGCTGATCG ACGCCACCCA GCCCGACGGC
GAGAACCCCG CGAACTACTA CAAGAACGCG ATCACCAACC ACTACTCGCG GATCGCGCAC
GCGGCGAACC TGGACAGCCG CGGCTACGGC TTCCCCTACG ACGACGTGGC GCCCAACGGC
GGCGCCGACC AGTCCGGCGC GGTCTCCGAC GGCAACCCGA CGCTGCTGAC GGTGGCCGTC
GGCGGCGGCA CGGCCACCGG CCCGGGCGGC GGCGGACCGA GTCAGCCGTC GAGCCCGAGC
AGCACCGGCG GCGGAGGCGG CGGCACGGTC AGCGCCTTCA CCACGATCCA GGCGGCGAGC
TACAGCTCGC ACAACGGCAC GCAGAACGAG ACCACCAGCG ACACCGGCGG CGGCCAGGAC
GTCGGCTGGA TCGGAGGAGG CGACTGGCTC GCCTACGCCA ACGTCGACTT CGGCAGCGCG
GGCGCGACGC AGTTCAAGGC CCGGGTCGCC TCCGGCGCCG CGGCAGGTGT CAGCGGTCTG
ATCAAGGTCG CGCTGGACAG CCCGACGGCT GCGCCGGTCG GCAGCTTCGC CGTCGGCAAC
ACCGGCGGCT GGCAGACCTG GCAGACCGTG CCGGCCAACA TCAGCAAGGT CACCGGAAAG
CACACGGTCT ACCTGGTGTT CTCCAGCGGC CAGCCCGCCG ACTTCGTGAA CGTGCACTGG
TTCACCTTCA GCCAGACCTG A
 
Protein sequence
MMNRRKLLSS AVALGGSALG LGTGALAWRA SATTPTLQVA LQNTTTSNQV YAYVTGQAID 
NNNALMLLEA DGHTVYYPTS PSSTGSPLAA DCAIRLGAPG STTTITIPHI AGGRIWFAIG
APLTFLLNPG PGLVEPSVSN QSDPNINIRW DFCEFTYNAA QMFANISYVD FVSIPISLAL
TNGSGATQTV SGLPTNGLDT VCSNLNAQHA ADGAGWNQLV VTSGGANLRA LSPNNGIVIN
NSLFSGYYQP YVDQVWSKYS SQALAVDTQA SWGTVNGQVS GGTLTFPGLG SFAKPSAADI
FSCSTGPFAN TAGAMGPLVA RISAAFNRST LLIDATQPDG ENPANYYKNA ITNHYSRIAH
AANLDSRGYG FPYDDVAPNG GADQSGAVSD GNPTLLTVAV GGGTATGPGG GGPSQPSSPS
STGGGGGGTV SAFTTIQAAS YSSHNGTQNE TTSDTGGGQD VGWIGGGDWL AYANVDFGSA
GATQFKARVA SGAAAGVSGL IKVALDSPTA APVGSFAVGN TGGWQTWQTV PANISKVTGK
HTVYLVFSSG QPADFVNVHW FTFSQT