Gene Caci_4320 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaci_4320 
Symbol 
ID8335674 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCatenulispora acidiphila DSM 44928 
KingdomBacteria 
Replicon accessionNC_013131 
Strand
Start bp4903746 
End bp4905023 
Gene Length1278 bp 
Protein Length425 aa 
Translation table11 
GC content70% 
IMG OID644957423 
Productextracellular solute-binding protein family 1 
Protein accessionYP_003115025 
Protein GI256393461 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1653] ABC-type sugar transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value0.811375 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value0.00841489 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGAAGACGC ATGTGAAAGT CGCCGCCGCG GCCGTCGCCG CCGCCGGCTT GCTCACCGCC 
GCGGGCTGCG GCAGCAGTTC CCCGGCCGGC TCCGGCTCCG TGTCGGGGTC GGGCGGCGGC
GTGGTGAAGC TGTCGGTGCT GACCGGGTTC ACCGGTCCGG ACGGACCCTC CTACCAGGCG
CTGGTCTCGC AGTTCAACGC CTCGCACCCG ACCATCAAGG TCACCATGGA CATCCAGCCC
TGGGACGCCA TCGGGCAGAA GCTGCCGGCG GAGTGGGCCA CCGGCCAGGG CCCGGACCTG
GCGACGCCGA ACTTCGACCC GGGCGTGATC TTCAACTACA TCAAGACCAA TTCGGTGCTG
CCGCTGGACT CCTCGGTGGG CGCCGCCGAC AGCCAGATCA ACGCCAGCGC CTTCCCGCCC
GCGGTGACCA AAGCGTTCAC GGTCAACGGC CACCTGTACG CGGTCCCGGC GAACCTGGCG
ACCGTCGCGC TCTACTACAA CAAGACGATG TTCACCGCCG CCGGCATCAC CGACCCGCCG
AAGACCTCGG AGGAGTTCGT CGCCGACGTC AAGAAGCTGA CACTCGGCGG GGCGAGCCCG
ACGCAGTACG GCATCTCGCT GGCCGACCAC CAGACCATCG AGATGTGGCC GATCCTGCAG
TGGATGAACG GCGGGGACAT CGTCGGCCCG GACGGCTGCG CGACGATCGA CTCCGCGGCC
AGCGTGCAGG CGCTGTCGAC CTGGGCCGGG ATGGTCCAGA ACCAGCACGT CAGCCCGGTG
GGCCAGACCG GCGCCGACGC CGACACGCTG TTCTCGGCGA AGAAGGCGGC GATGGAGCTC
AACGGTCCCT GGGCCGCCGA CGGCTTCCGC AAGGCCGGGA TCGACCTGGG CATCGCCCCG
GTCCCGGCCG GCTCCTCCGG CCCGGTCACC CTGGCCTCGA CCGTGCCGAT GATGGTCGCC
AAGAACACCA AGCACAAGGA GCAGGCGCTG GAGTTCCTGA GCTGGTGGAC CGGAAAGACC
GCGCAGGCCT CCTTCTCCAA GGGCTCCGGC TACCCGCCGG CGCGCAGCGA CGTCACCGTC
TCAGACCCCA ACGTCGCGGT GTTCGCCCAA GGGCTGCCGA CCGCGCGCCT CTACCTGGCC
GGACTCCCCA CCTCCTCGCA GATCGACACC GACATCTACA CCCCGCTGCT GGGCCAGCTC
ACCCGGGGCG CGGACGCCCA GAAGTCGGCC GACGCGGCCG CGAAGTCCAT CAACCAGCTC
ACCGGCTGCA AGAGCTGA
 
Protein sequence
MKTHVKVAAA AVAAAGLLTA AGCGSSSPAG SGSVSGSGGG VVKLSVLTGF TGPDGPSYQA 
LVSQFNASHP TIKVTMDIQP WDAIGQKLPA EWATGQGPDL ATPNFDPGVI FNYIKTNSVL
PLDSSVGAAD SQINASAFPP AVTKAFTVNG HLYAVPANLA TVALYYNKTM FTAAGITDPP
KTSEEFVADV KKLTLGGASP TQYGISLADH QTIEMWPILQ WMNGGDIVGP DGCATIDSAA
SVQALSTWAG MVQNQHVSPV GQTGADADTL FSAKKAAMEL NGPWAADGFR KAGIDLGIAP
VPAGSSGPVT LASTVPMMVA KNTKHKEQAL EFLSWWTGKT AQASFSKGSG YPPARSDVTV
SDPNVAVFAQ GLPTARLYLA GLPTSSQIDT DIYTPLLGQL TRGADAQKSA DAAAKSINQL
TGCKS