Gene Caci_6724 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaci_6724 
Symbol 
ID8338088 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCatenulispora acidiphila DSM 44928 
KingdomBacteria 
Replicon accessionNC_013131 
Strand
Start bp7754124 
End bp7755815 
Gene Length1692 bp 
Protein Length563 aa 
Translation table11 
GC content69% 
IMG OID644959818 
Productextracellular solute-binding protein family 1 
Protein accessionYP_003117411 
Protein GI256395847 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1653] ABC-type sugar transport system, periplasmic component 
TIGRFAM ID[TIGR01409] Tat (twin-arginine translocation) pathway signal sequence 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones33 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCACACAC CGCAGAGCCT CGGGCGCCGC GGATTCCTGC GCGGCGTCGG CGCCGCGGCG 
GCCCTCACCG CGGGCGGCTC GACGCTGGCC GCGTGCGGCA GCGGCAAGGC CGCGGCCGTG
AACGAGGCGG GCAGCGCCGC GCAGGTCCAG CTGCCGACGT ACACGCCGCT GGCCAACGGT
CCGACGCCGG ACCTGCCGGG CACCGACGCC GGCGTCCCGG CCGGGTTCTA CGACTACCCG
GCGTCCCCGA CCGCGGCGTT CGCCAGCCCG CCGCTGTCCG GCGGCAAGTT CTCGGCGATG
ACGCCGCTGT TCACCGCTCC CCCGCCGGCC CGCGGCTCGA ACCCGGCGTG GCAGGCGATG
GAGAAGAAGC TCGGCGCGAG CGTCGACATC ACGATGGTGG TCGGCGACGA CTTCGACACC
AAGCTCTCCA CCCTGATCGC CGGCGGCGGA CTGCCGGACC TGATCCAGTA CGACGGCCTC
GGCGGCGTCC CGACCATCAG CAACCTGCCG CAGTTCCTGG ACTCGCAGAT CGCCGACCTG
ACCGCGCTGA TCGGCGGCGA CAAGGTCAAG GAATACCCGC ATCTGGCCGC CATCCCGAAG
GTGTTCTGGG AGCAGTGCAC GGTCGCCGGG AAGCTGTACT TCATCCCCAT CCCGCGCGGC
ATCAGCGCCG GCGCCGGGCT GTACCGGCAG GACCTGTTCG CCGCCGCCGG AGTCACCAGC
AACAAGGACA TCAAGAACTC CGACGACTTC TTCACGCTTC TCAAGGAGCT GACGAACCCG
GGCAAGGACC GCTACGCGCT GGCCGGCAAC TCCGGCAACG GCGGCTATTC CGGGGCGATC
TTCGAGCAGA TCTTCGGGGT CCCGAACAAG TGGCGGGTGG ACGGCGGCGG CAAGCTGACC
GCGGACATCG AGACCGACGA GTTCCGGGCG GCGCTGGAGT TCATGGTGAA GGTAGCCAAG
GCCGGCTGCT TCTATCCGGG CGCGCAGGGC TGGACCAAGG CCAAGATGGA GGACGCCTTC
CAGTCCGGTA AGGCGGCCAT GATCTACGAC GGTCTCCCGG CGCTGTCCAC CAGCGTCTGG
GCCACCGCGC AGAAGATCGA CCCGAACGCC AAGCTGATGC CGTTCGTCCC GTTCGGCGCC
ACCGGCGGAC CCGGCGTCGC CTGGCAGGAC AACGTGGTCT TCGCCGGCAC GATGCTGAAG
AAGGCGGACC CGGCGAAGCT GGCGGAGGTC CTGAAGCTCG CCGACTTCCT CGCCGCGCCG
TTCGGCACCG AGGAGTACCT GCTCAAGACC TACGGCGTCG AAGGCGCGGA CTACACGCTG
GACGCCAACC ACAACCCGGT GCAGACCGCC CAAGGCAAGA ACGACGCGAA CGTCACCTGG
AAGTACGTCG CCGCGCCGCA GCTGGTCACC TACAACCCCG GTGTCAATGC TCTGACGGAC
GCGGTCCACC AGGCCTACAC CGAGCTGGTG CCGATCGCCG TGCCGAACCC GACCGCGACG
CTGTACTCGC CGACCTTCGG CAAGCAAGGC GTGGCGCTGT ACAAGGCGGT CACGGACACC
GTGACGCAGG TGATCGGCGG GCAGTCGAGC ATGAGCGCCT TCGACAACGC GGTGAAGACC
TGGCGCAGCG GCGGCGGCGA CCAGATGCGG TCGGAGTTCG AGCAGGCGTA CGCGAGCGCG
CCGAAGAGCT GA
 
Protein sequence
MHTPQSLGRR GFLRGVGAAA ALTAGGSTLA ACGSGKAAAV NEAGSAAQVQ LPTYTPLANG 
PTPDLPGTDA GVPAGFYDYP ASPTAAFASP PLSGGKFSAM TPLFTAPPPA RGSNPAWQAM
EKKLGASVDI TMVVGDDFDT KLSTLIAGGG LPDLIQYDGL GGVPTISNLP QFLDSQIADL
TALIGGDKVK EYPHLAAIPK VFWEQCTVAG KLYFIPIPRG ISAGAGLYRQ DLFAAAGVTS
NKDIKNSDDF FTLLKELTNP GKDRYALAGN SGNGGYSGAI FEQIFGVPNK WRVDGGGKLT
ADIETDEFRA ALEFMVKVAK AGCFYPGAQG WTKAKMEDAF QSGKAAMIYD GLPALSTSVW
ATAQKIDPNA KLMPFVPFGA TGGPGVAWQD NVVFAGTMLK KADPAKLAEV LKLADFLAAP
FGTEEYLLKT YGVEGADYTL DANHNPVQTA QGKNDANVTW KYVAAPQLVT YNPGVNALTD
AVHQAYTELV PIAVPNPTAT LYSPTFGKQG VALYKAVTDT VTQVIGGQSS MSAFDNAVKT
WRSGGGDQMR SEFEQAYASA PKS