Gene Caci_3587 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaci_3587 
Symbol 
ID8334940 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCatenulispora acidiphila DSM 44928 
KingdomBacteria 
Replicon accessionNC_013131 
Strand
Start bp4001584 
End bp4002909 
Gene Length1326 bp 
Protein Length441 aa 
Translation table11 
GC content68% 
IMG OID644956730 
Productextracellular solute-binding protein family 1 
Protein accessionYP_003114333 
Protein GI256392769 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1653] ABC-type sugar transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value0.935518 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones27 
Fosmid unclonability p-value0.988969 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGACGAGGA ATACGCGCAA CGTGATAGCC GTCGTGCTCG GTTTGAGCGC GGCACTGGCC 
GCCGCCGGCT GTGCCGGCGG CAGCAGCAAC GGCGGAGGCG GCGGCAGCGG CAGCAGCGGC
GGCGGCGGTT CGCAAGGCGG CGGGCAGGTC AAGCTGACCG TCTGGCAGAA CTCGACCACC
GGTCCCGGTC AGCAGTTCTT CCTGACCGCC GCGAAGGACT ACCACGCGCA GCACCCGAAC
GTCACGATCA ACGTCCAGAC GATCCAGAAC GAGGATCTCG ACGGCAAGCT GCAGACCGCG
CTCAACGCGA ACGCCGCGCC GGACATCTTC CTGCAGCGCG GCGGCGGCAA GATGCAGGCG
ATGGTCACCG CCGGCCAGAT CCAGGAGCTG GATCTCTCGG CCACCGACAA GGCGAACGTC
GGCACGGCGG CGCTGGCCGC CGAATCGCTC GACGGCAAGG TCTACGCGAT GCCGATGGAC
ACCCAGCCGG AGGGCTTCTA CTACAGCAAA GACCTGTTCC AGCAGGCCGG CATCACCGCG
ACGCCGACGA CGATCGACGA GCTCGAGGCC GACGTGGCCA AGCTCAAGGC GATCAACGTC
TCGCCGATCG CGGTCGGCGC CAAGGACGCC TGGCCGGCGG CGCACTGGTA CTACAACTTC
GCCCTGCGCG AGTGCAGCCA GGCCACGATG ACCAGCACCG CCAAGTCGCT GAAGTTCAGC
GATCCCTGTT GGACCAAGGC CGGCGACGAC GTGGCCGCGT TCCTGAAGTC CGATCCCTTC
CAGAAGGGCT TCCTGACCAC CTCGGCGCAG CAGGGCGCGG GGTCCTCGGC GGGCCTGCTC
GCGAACCACA AGGCGGGCAT GGAACTCATG GGCAACTGGG ACCCCGGGGT GATCGCGAGC
CTGACCCCGG ACCAGAAGCC GCTGCCGGAC CTGGGCTGGT TCCCCTTCCC CGCCGTCGCC
GGCGGCCAGG GCGACCCGAC CGCCATCATG GGCGGCGCCG ACGGCTACTC GGTGTCGAAG
AAGGCGCCCA AGGAAGCCTT CCAGTTCCTG GAATTCCTGG CGACCAAGGA AGAGCAGGAG
GCCTACGCCA AGGCCTTCGA CGCGATCCCG GTCAACCCGG CGGCTCAGGA CGCCGTGACC
GACCCGTACA ACGTCTCGAC GCTGCAGGCG TTCAGCAAGG CCGCCTATGC GATGCAATAC
CTTGACACCC AGTTCGGCCA GAACGTCGGC AACGCGATGA ACACGGCCGT GGTGAGCCTG
ATGGCCGGCA AGGGCAGCGC GGCGAACATC GTCTCGGCGA CCAACAGCGC CGCCGCGAGA
GGCTGA
 
Protein sequence
MTRNTRNVIA VVLGLSAALA AAGCAGGSSN GGGGGSGSSG GGGSQGGGQV KLTVWQNSTT 
GPGQQFFLTA AKDYHAQHPN VTINVQTIQN EDLDGKLQTA LNANAAPDIF LQRGGGKMQA
MVTAGQIQEL DLSATDKANV GTAALAAESL DGKVYAMPMD TQPEGFYYSK DLFQQAGITA
TPTTIDELEA DVAKLKAINV SPIAVGAKDA WPAAHWYYNF ALRECSQATM TSTAKSLKFS
DPCWTKAGDD VAAFLKSDPF QKGFLTTSAQ QGAGSSAGLL ANHKAGMELM GNWDPGVIAS
LTPDQKPLPD LGWFPFPAVA GGQGDPTAIM GGADGYSVSK KAPKEAFQFL EFLATKEEQE
AYAKAFDAIP VNPAAQDAVT DPYNVSTLQA FSKAAYAMQY LDTQFGQNVG NAMNTAVVSL
MAGKGSAANI VSATNSAAAR G