Gene Caci_2108 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaci_2108 
Symbol 
ID8333453 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCatenulispora acidiphila DSM 44928 
KingdomBacteria 
Replicon accessionNC_013131 
Strand
Start bp2392296 
End bp2393588 
Gene Length1293 bp 
Protein Length430 aa 
Translation table11 
GC content67% 
IMG OID644955258 
Productextracellular solute-binding protein family 1 
Protein accessionYP_003112868 
Protein GI256391304 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2182] Maltose-binding periplasmic proteins/domains 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones25 
Fosmid unclonability p-value0.553068 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGACGGA AGCGCGGCGC CGCGATTGCG GCGATAGCGG TAGTGGCTCT CGCTGCAAGT 
GCTTGCAGCA GCAGCAAGTC GGGTGGCTCC TCGGCCGCCG CCAAGGATCC GGGCTCGGTG
AAGGGGAGCG TCACCTGGTG GGACACCTCT GATGCCACCA ATGAGGCGCC GAACTACCAG
CCGATCATCA AGGCGTTCGA GGCCAAGTAC CCGAACATCA AGGTGAACTA CGTCAACGTG
CCGTTCTCGG ACGCCAAGGA CAAGTTCAAG ACCGCCGCGC AGTCCGGCAG CGGGGCTCCG
GACGTGCTGC GCGCGGACGT CGGCTGGACG CCCGCGTTCG CGCAGCTGGG CTACCTGCAG
CCGCTGGACG GGACGCCCGC GCTGCAGGAC GCCGCCGACT ACATGCCCGG GCCGTACGCC
TCGGACCACT ACAACGGCAA GATTTACGGC GTGCCGCAGG TCACCGACAC GCTCACGCTG
CTGTACAACA AGGACCTGCT CACCAAGGCC GGCATCACCA CGCCGCCGAA GACCTGGGCC
GAGCTGAAGA GCGACAGCCT GCAGATCAAG GCCAAGACCG GGGTGGACGG CACGTTCCTG
GACGCCGCGT CCTACTACCT GCTGCCGTTC ATCTACGGCG AGGGCGGCGA CATCATCGAC
GCCTCGGCCA AGAAGATCAC GGTCAACGAC GCGACCACGC TCAAGGCCGT GGGCATCGCG
CAGGACCTGG TGAAGTCCGG CGCGGCGGTC ACCGACGTGA CCAAGGACGG CTACACCAAC
ATGCAGACCG CCTTCAAGGA CGGCAAGGTC GCCATGGTCA TCAACGGCCC GTGGTCCACC
TCCGACGACC TGAAGGGCTC GGCCTTCGGC AGCGCCGACA ACCTCGGCAT CGCCACCGTC
CCGGCCGGCT CGGTCAAGGC CGGCGCCCCG GTCGGCGGCC ACAACCTGGT CGTCTACGCG
GGCTCGAAGA ACCTCGACGC CACCTACCTG TTCGTGGCGT TCCTGAACGA CGCGCAGAAC
CAGGCGACCA TCGCGGCCAA GAACAACGTG CTGCCGACCC GCACCTCGGC CTACTCCGAC
CCGCAGGTCG CGAACAACAA GATCCTGTCG GCCTTCGAGG GCCCGCTGAA GAACGCGGTC
GGCCGTCCGC CGGTCGCCGG CGCCTCGGAC CTGTTCACCC CGCTGGACAC CGACTACCAG
GCGATCCTCG GCGGTCAGAA GAGCGCCCAG GACGGGCTGA ACGACGCCGC GACGCAGTTC
GCGCAGATCC TGCCGGACTT CAGCAAGAGC TGA
 
Protein sequence
MRRKRGAAIA AIAVVALAAS ACSSSKSGGS SAAAKDPGSV KGSVTWWDTS DATNEAPNYQ 
PIIKAFEAKY PNIKVNYVNV PFSDAKDKFK TAAQSGSGAP DVLRADVGWT PAFAQLGYLQ
PLDGTPALQD AADYMPGPYA SDHYNGKIYG VPQVTDTLTL LYNKDLLTKA GITTPPKTWA
ELKSDSLQIK AKTGVDGTFL DAASYYLLPF IYGEGGDIID ASAKKITVND ATTLKAVGIA
QDLVKSGAAV TDVTKDGYTN MQTAFKDGKV AMVINGPWST SDDLKGSAFG SADNLGIATV
PAGSVKAGAP VGGHNLVVYA GSKNLDATYL FVAFLNDAQN QATIAAKNNV LPTRTSAYSD
PQVANNKILS AFEGPLKNAV GRPPVAGASD LFTPLDTDYQ AILGGQKSAQ DGLNDAATQF
AQILPDFSKS