Gene Caci_0130 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaci_0130 
Symbol 
ID8331455 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCatenulispora acidiphila DSM 44928 
KingdomBacteria 
Replicon accessionNC_013131 
Strand
Start bp133375 
End bp135060 
Gene Length1686 bp 
Protein Length561 aa 
Translation table11 
GC content67% 
IMG OID644953297 
Productextracellular solute-binding protein family 1 
Protein accessionYP_003110926 
Protein GI256389362 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1653] ABC-type sugar transport system, periplasmic component 
TIGRFAM ID[TIGR01409] Tat (twin-arginine translocation) pathway signal sequence 


Plasmid Coverage information

Num covering plasmid clones25 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones33 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCCTTCA CCAGCACCGC GCAGCAGATC TCCCGCAGGA CCGTCCTGCG TACGACCGGA 
GCCGCCGCCG TCGCGGCCGT CGCCGTTCCG GCCCTGGCGG CTTGCGGGGG GTCGAAGACC
TCCTCCGGCG CCGCGCAGTC CAACGTCGAC AAGAAGCTCA TGGCCTGGCC GACCTACACT
CCGGCGGCGG GTCTGCACCC TGATATGCCT GGCACCGCGG CAGGCGTGCA GGACACGTTC
CTGCGCTATC CGTCCAACCT GATCCAGTCG GTGCCCGCCA AGCCCGGCGA CGGGTCGAAG
GTGCGCGCGC TGATCGTCAC CTACGGTACG CAGCCCAAGG GCCCGGACCA GAACCAGCTG
TGGAAGGCGG TCAACGACGC CGTCGGCGTG GACCTCGAGC TGACCATGGT CACGGACGCC
GACTGGCAGA CCAAGCTCGG CGCCATGATG GCCGCCTCGG ATCTGCCGGA CATCATCATG
CTCGGGCTCT ACCAGCTGCC GAACGAGGCG CAGTTCCTGC AGGCCAAGTG CGAGCCGCTG
GGGCAGTACC TGGCCGGGGA CGCGGGCGCG AAGGCGTATC CGAACCTGGC CGCGATACCG
CCGTACAGCT GGGATTCGGT GGGACGCGTC GGCGGCGACT TCTATGGGAT CCCGATCCAC
CGGCCGCGTC TGGGGAACTC GTTCTTCGCC GACTCCGACC TGTTCCAGCA GGCCGGGATC
TGGAACCCGA AGCCGGGCGG GCTGTCGAAG GCGGAGCTGA CCGCCGGGCT GATGAAGCTG
AACACGCAAG GGCACTTCGC GCTCGGTACC AACAAGGTCG CCTCATTCGG TTACCTGACG
CACTCCGGGG TGCACGGCAC GCCTAACCTG TGGTCGCTCG CCAACGGTCA GTTCACCACC
GCGTACGGCA CCGACAGCAT GAAGCAGTCG CTGGCGACCA TGGCCGATTG GTACGGCAAG
GGACTCTACG ACCCGGCGGC GCTGACCGTG TCGAGCACGC AGTGCAAGAC CGACTTCCAG
AACGGTACTT ACGTCACCAC CACCGACGGC TTCGGCGGGT TCGGCGGCTA CGCGACCGCC
GTCAACGAGA AGTGGAAGGT CGACTTCGTC CGGCCCTTCG ACGCCGGTAC CGGCGCCAAG
CCGACGCCGT GGCTCACCCC CGGCTACTTC GGCTACACGG TACTGAAGAA GACGACACCG
GAGCGCGCCA AGATGCTGCT CGGCGTGCTG AACTTCCTCG CCGCGCCGTT CGGCTCCAAG
GAGTGGGAGC TGATCAACTA CGGGCTCGAA GGCGTGCACT TCAACCGGGG TGCGGACGGC
GGTCCGTCGG CGCCGACCGC GTTGGGCAAG ATCGAGAACT CGGTGAACGT GCCGGTCAAG
TACGTCATGG CCGCTCCGCT GGTGAACTAC CTCGCCGGCG AGCCGGAGGC AGCCAAGCGC
TGCTATCAGG CGCAGGTGGA CATCGTGCCC ATCGGCGTGA CAGACCCGAG CCTGGGCGTC
CAGTCGGCGA CCCGGAACAA GCAGTGGCCG ACGCTCTTGC AGCAGATCCA GGACGGGATG
AACCAGATCA TCACCGGGAA GGCGCAGCTC TCGTCCTGGG ACGACGTCAT CAAGAAGTGG
AAGAGCAGCG GCGGGGACCA GATCGCCGCC GAACTCGGCG CCGAGTACGC CAAGACTCAC
GGTTGA
 
Protein sequence
MPFTSTAQQI SRRTVLRTTG AAAVAAVAVP ALAACGGSKT SSGAAQSNVD KKLMAWPTYT 
PAAGLHPDMP GTAAGVQDTF LRYPSNLIQS VPAKPGDGSK VRALIVTYGT QPKGPDQNQL
WKAVNDAVGV DLELTMVTDA DWQTKLGAMM AASDLPDIIM LGLYQLPNEA QFLQAKCEPL
GQYLAGDAGA KAYPNLAAIP PYSWDSVGRV GGDFYGIPIH RPRLGNSFFA DSDLFQQAGI
WNPKPGGLSK AELTAGLMKL NTQGHFALGT NKVASFGYLT HSGVHGTPNL WSLANGQFTT
AYGTDSMKQS LATMADWYGK GLYDPAALTV SSTQCKTDFQ NGTYVTTTDG FGGFGGYATA
VNEKWKVDFV RPFDAGTGAK PTPWLTPGYF GYTVLKKTTP ERAKMLLGVL NFLAAPFGSK
EWELINYGLE GVHFNRGADG GPSAPTALGK IENSVNVPVK YVMAAPLVNY LAGEPEAAKR
CYQAQVDIVP IGVTDPSLGV QSATRNKQWP TLLQQIQDGM NQIITGKAQL SSWDDVIKKW
KSSGGDQIAA ELGAEYAKTH G