Gene Caci_0891 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaci_0891 
Symbol 
ID8332222 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCatenulispora acidiphila DSM 44928 
KingdomBacteria 
Replicon accessionNC_013131 
Strand
Start bp1037624 
End bp1038940 
Gene Length1317 bp 
Protein Length438 aa 
Translation table11 
GC content67% 
IMG OID644954042 
Productextracellular solute-binding protein family 1 
Protein accessionYP_003111665 
Protein GI256390101 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1653] ABC-type sugar transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value0.0493428 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGGCAGCA CCACCGGCCG GGGCCCCCGG CTTCTCGCGA TAGCCACCGC AGCCGTTCTC 
ACGGTCGGCG TCAGCGCGTG TTCCAGCAGC AAGAGCAGCT CCTCGGCCGT CGGCGGCGCC
GGCAAGTCCG GCGGCAGCTT CACGTACTGG TCGATGTGGC GTCAGGACGA GCCGCAGGCC
AAGGTCATCC AGGCGGCGAT CACCCAGTTC ACCGCCGACA CCGGCATCAA GGTCGACGTC
GAGTGGACCG GCCGGGACGT CGCCAAGAAG ATCGGCCCGG CGATGGCCGC GGGCAAGGCG
CCGGACATGT GGGACGAGGG CGCGGACGTC ATCTACGGCG CCACCGCGCA GAACGGCAAC
GCCAAGGACC TCTCGGCGGT CCTGGACATG ACCATCCCGA CCGACAACGA GAAAGTCTCC
GACGCGATCC CGTCGAAGTA CTGGGACTCG CTGCCGAAGG ACCCCAACGG CGGCCAGCAC
TGGGTGATCC CCTACGAGGC CAGCACCGCG GGCATCTTCT ACAACACCGC CGACCCGACC
GTCTCCGCGG CGATGGCCTC GCAGCCGTCC ACCTGGGACG CGTTCATGCA GGTCTGCGCG
ACTCTGAAGA CGAAGAGCGA GCCCTGTCTT GCCTCCGAGG GCGAAGACCC CTGGACCAAC
GGTCTGTGGT TCGACTACCT GATCAACGCA GGCGGCGTGA ACTTCAACGA CCTGGCGAAC
GACAAGACCG GCGCCAGCTG GGACAACCCG GCCGTGCTGA AGGCGGCGAC ACAGGTCGAG
CAGCTCGTCA AGGGCGGCTA CATCATCCCG ACCTACACCG CCACGAAGTA CCCGGCGCAG
CAGACCAACT GGGCCGGCGG CAAGGCCGGC TTCCTGATGA ACGGCAACTG GGTCACCGCC
GAGGTCGCCA AGCAGATCCC GGCGACCTGG AAGTACGGCT TCATGCTCCC GCCGGGCGCG
ACCCAGCCGG ACTCGATGGT CTTCGGCTTC GCCCTGACCA AGAACGCCAA GAACGTCAGC
CAGGCCGAGC AGTTCATGGC CTACTTCCTG CAGAAGAAGA CGCTCTCGGG CATCTCCACC
GAGGCCGGCA ACATCACGCC GCGCACCGAC ATCCCGGCGC CGGCGGAGCT GGCCGACGTG
CAGAAGACCC TGAACGCGCC CAAGCTGCGC CTCACCTTCG ACGGCGTGGC CGGGGACTGG
ACCACCAAGG TCTGGAACCA GAACTACCTG GACTTCTGGC ACGGCAAGAT CGACGCCGCG
ACCTTCGTGG CGAAGATGAA GTCCGCGCAG GTCTCGTTCT GGAAGAGCCA GAGCTGA
 
Protein sequence
MGSTTGRGPR LLAIATAAVL TVGVSACSSS KSSSSAVGGA GKSGGSFTYW SMWRQDEPQA 
KVIQAAITQF TADTGIKVDV EWTGRDVAKK IGPAMAAGKA PDMWDEGADV IYGATAQNGN
AKDLSAVLDM TIPTDNEKVS DAIPSKYWDS LPKDPNGGQH WVIPYEASTA GIFYNTADPT
VSAAMASQPS TWDAFMQVCA TLKTKSEPCL ASEGEDPWTN GLWFDYLINA GGVNFNDLAN
DKTGASWDNP AVLKAATQVE QLVKGGYIIP TYTATKYPAQ QTNWAGGKAG FLMNGNWVTA
EVAKQIPATW KYGFMLPPGA TQPDSMVFGF ALTKNAKNVS QAEQFMAYFL QKKTLSGIST
EAGNITPRTD IPAPAELADV QKTLNAPKLR LTFDGVAGDW TTKVWNQNYL DFWHGKIDAA
TFVAKMKSAQ VSFWKSQS