Gene Caci_1941 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaci_1941 
Symbol 
ID8333284 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCatenulispora acidiphila DSM 44928 
KingdomBacteria 
Replicon accessionNC_013131 
Strand
Start bp2196203 
End bp2197666 
Gene Length1464 bp 
Protein Length487 aa 
Translation table11 
GC content67% 
IMG OID644955090 
Productextracellular solute-binding protein family 1 
Protein accessionYP_003112702 
Protein GI256391138 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1653] ABC-type sugar transport system, periplasmic component 
TIGRFAM ID[TIGR01409] Tat (twin-arginine translocation) pathway signal sequence 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value0.948884 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones24 
Fosmid unclonability p-value0.342758 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGTCAGATC ACGGTTTGCC AGCCGACGTC TCCCGCCGGC AGATGTTGGT CCGTTCGGCC 
GCGATCGCGG CCATGGTCGG CCCGGGCTCG GCCCTGGCAG CCGGGTGCGC GGCCGGCAGC
GGGAGCAGCC CCAAGAACAA CAACGCGGGC TCCACCACCG CCGCGAACTC CTCGGACCCG
AAGAACCCGT TCGGAGTGAA GGCCTCCAGC CCGCTGGATG TCTACGTCTT CAAGGGCGGC
TACGGCGACG ACTACGCGAA GGCCTTCGAG GCGATGTACT CCTCGAAGTT CTCCGGCTCG
CAGGTCTCGC ACCACAGCGG CCAAAACCTC ACCGGCGACC TGCAGCCCCG GTTCAACGCC
GGCAGCCCGC CGGACGTCAT CGACGACTCC GGCGCCCAGC AGCTGAAGCT GGACGTGTTG
AACAGCAGCG GCCAGCTGAC CAACCTGGAC CGGCTCTTGG ACGCGCCCTC GATCGACGAC
CCGAGCAAGA AGGTCCGCGA CACCCTGCTG GCCGGCACGA TCGAGACGGG CCAGCTCGGC
CAGAGCATGT TCTCGCTGAA CTACGCGTTC ACCGTCTTCG GGCTGTGGTA CTCCACCGCG
CTGTTCCAGA AGAACAACTG GCAGGTCCCC ACCTCGTGGG AAGACTTCAT GACGCTGTGC
GCGACCATCA AGGCCAGCGG CATCGCCCCG TTCGCGCACC AGGGCAAGTA CCCGTACTAC
ATGCTGGTGC CGCTGATGGA CATGGTCGCC AAGAACGGCG GCCCGGACGT GCAGACCGCC
ATCGACAACC TGGAGCCCAA CGCCTGGAAG TCCGACGCGG TCAAGAACAG CGTCGATGCC
CTTTACGAGC TGGTGGACAA GGGCTACATG CTCCCGGGCA CCGAGGGTCT GACCCACATC
CAGTCCCAGA CCCTGTGGAA CCAGGGCAAG GCGGCGGTCA TCCCCTGCGG TTCGTGGCTG
GAGAACGAGC AGCTGTCCGC CACCCCGGCG GGCTTCAACA TGGCCGTGTT CGCCATGCCC
TCGCTCAGCG GCGACAAGAT GCCGCAGACC GCGATCCGGG CCGGCGCCGG CGAGCCGTTC
ATCGTCCCGA GCAAGGCCAA GAACCCGGCC GGCGGCCTGG AGTTCCTGCG CATCATGTGC
TCCAAGGCCG GCGGCGCCTC CTTCGCGCAG AAGGCGAACT CCCTGTCGGT GGTGAAGGAC
GCGATCACCC CGGACATCGA GGCCAAGCTG CTGCCGGGCA CCAAGTCCAG CAACGACCTC
TACCAGGCCG CCAACGGCAA GGTCATCTCC TGGTACTACC TGAACTGGTA CTCCCAGATG
GAGAAGGACC TCGAGGACGC CATGGGCCAG CTGATGGCGA ACAAGATCAA GCCGGCCGAG
TTCATCACCC GCGCGCAGGC GGCGGCCGAC AAGTGCGCCG GCGACTCCTC GGTGCAGAAG
TTCAAGCGCC CGACCACCGC CTGA
 
Protein sequence
MSDHGLPADV SRRQMLVRSA AIAAMVGPGS ALAAGCAAGS GSSPKNNNAG STTAANSSDP 
KNPFGVKASS PLDVYVFKGG YGDDYAKAFE AMYSSKFSGS QVSHHSGQNL TGDLQPRFNA
GSPPDVIDDS GAQQLKLDVL NSSGQLTNLD RLLDAPSIDD PSKKVRDTLL AGTIETGQLG
QSMFSLNYAF TVFGLWYSTA LFQKNNWQVP TSWEDFMTLC ATIKASGIAP FAHQGKYPYY
MLVPLMDMVA KNGGPDVQTA IDNLEPNAWK SDAVKNSVDA LYELVDKGYM LPGTEGLTHI
QSQTLWNQGK AAVIPCGSWL ENEQLSATPA GFNMAVFAMP SLSGDKMPQT AIRAGAGEPF
IVPSKAKNPA GGLEFLRIMC SKAGGASFAQ KANSLSVVKD AITPDIEAKL LPGTKSSNDL
YQAANGKVIS WYYLNWYSQM EKDLEDAMGQ LMANKIKPAE FITRAQAAAD KCAGDSSVQK
FKRPTTA