Gene Caci_6971 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaci_6971 
Symbol 
ID8338337 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCatenulispora acidiphila DSM 44928 
KingdomBacteria 
Replicon accessionNC_013131 
Strand
Start bp8061540 
End bp8062835 
Gene Length1296 bp 
Protein Length431 aa 
Translation table11 
GC content64% 
IMG OID644960051 
Productextracellular solute-binding protein family 1 
Protein accessionYP_003117642 
Protein GI256396078 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1653] ABC-type sugar transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.487837 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones28 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGCGCTCCA CTAAGTACGC GGTCTGCGTG GCATTGGCCG CGGCCGTGTC GCTGGTTTTG 
GCCGGCTGCG GGAGTTCCAG CTCCAAAGCC GCCTCGGGCG GCGGCGGCAA GACGCTCGTG
GTCTGGGACT ACGAAGCCAA CGACAGTGCC TCCGGCATCG CACGCGCCGA GGCGATCAAG
GAGTTCCAGG CCGCCCATCC GGGGGTGACG GTCAAGTTCG AGGCGAAGAG CTTCGACCAG
ATCCAGCAGA ACGCCGGGAT GATCCTCAAC TCCAACGACG TGCCCGACGT GATGGAGTAC
AACAAGGGCA ACTCCACCGC CGGCCTGTTG TCCAAGCAGG GTCTGCTGAC CGATCTGAGC
AGCCAGGCCG CCTCGCGCGG CTGGGACAAG ACGTTGAGTC CGTCGCTGCA GACCACCGCC
AAGTACACCG GCGGCATCAT GGGCGGCAGC ACTTGGTACG GCGTGCCGAT GAACGGCGAA
TTCCTCACCG TCTACTACAA CAAGGACCTG TTCGCGAAGT ACAACGTCCC GGTCCCCACC
ACGCCCGATC AGTTCACGGC GGCGATGGCG ACGTTCAAGG GCGCCGGGGT CACCCCGCTG
GCCATGAGCG GCGCGGACTA CCTCGGGGTG CACCTGTTCT ACGAACTGGC GCTGTCCAAG
GCCGACCGCA CCTGGGTCAA CGACTACCAG CTGTTCAAGG GCAAGGTCGA CTTCCAGGGC
CCGCAGATGT CCTACGCCGC GAACACCTTC GCCGACTGGG TGAAGAAGGG CTACATCAGC
AAGGACTCCG CCGCGGTCAA GGCCCAGGAC GAGGCCAACG CCTTCGAGCA GGGCAAGATC
CCGATGATGT TCTCCGGGAA CTGGTGGTAC GGCCAATTCC TGAGCGAGGT CAAGGGCATG
CAGTGGGGCA CGTTCCTGTT CCCCGGCAAC ACGCTGCAGG TCGGCTCCAG CGGCAACCTG
TGGGTGGTGC CCACCAAGGC CAAGAACAAG GACCTGGCCT ACGACTTCAT CGACACCACG
CTCAGCAAGA ACGTCCAGAA CCTGATGGGC AACAGCGGCG GGGTGCCGGT GGCCGCGGAT
CCGGCGGCGA TCACGAACCC CAGCAGCAAG GAACTGATCA CCGAGTTCGA CTCGATCACC
GCCAAGGACG GCCTGGGGTT CTACCCGGAC TGGCCGGTCA CCGGCTACTA CGACACGCTC
CAGCACGCGA TCCAGGAACT GATCAACGGA TCCAAGAGCC CGAGTTCGAT GCTCGACACC
ATCGGCTCCG CCTACAAGCA GAACGCACCG CAGTAG
 
Protein sequence
MRSTKYAVCV ALAAAVSLVL AGCGSSSSKA ASGGGGKTLV VWDYEANDSA SGIARAEAIK 
EFQAAHPGVT VKFEAKSFDQ IQQNAGMILN SNDVPDVMEY NKGNSTAGLL SKQGLLTDLS
SQAASRGWDK TLSPSLQTTA KYTGGIMGGS TWYGVPMNGE FLTVYYNKDL FAKYNVPVPT
TPDQFTAAMA TFKGAGVTPL AMSGADYLGV HLFYELALSK ADRTWVNDYQ LFKGKVDFQG
PQMSYAANTF ADWVKKGYIS KDSAAVKAQD EANAFEQGKI PMMFSGNWWY GQFLSEVKGM
QWGTFLFPGN TLQVGSSGNL WVVPTKAKNK DLAYDFIDTT LSKNVQNLMG NSGGVPVAAD
PAAITNPSSK ELITEFDSIT AKDGLGFYPD WPVTGYYDTL QHAIQELING SKSPSSMLDT
IGSAYKQNAP Q