Gene Caci_1597 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaci_1597 
Symbol 
ID8332940 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCatenulispora acidiphila DSM 44928 
KingdomBacteria 
Replicon accessionNC_013131 
Strand
Start bp1810685 
End bp1812055 
Gene Length1371 bp 
Protein Length456 aa 
Translation table11 
GC content68% 
IMG OID644954747 
Productextracellular solute-binding protein family 1 
Protein accessionYP_003112359 
Protein GI256390795 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1653] ABC-type sugar transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones31 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCCAGAT CAACATCTTT GCAATCAGTG CGGTGTCGCC GCGCTCTCGC CGCGCTCGCC 
GCCGTACCGC TCGCCGTGGG CCTGGCCGCC TGCGGCAGCA GCAGCGGGAG CGGTTCGGGG
TCGTCGCAGA CGCTGACGAT CGCGATGTGG ACGAACCCGG CCGCCGTCGC CCAGACCCAG
AAGGTCGACG CCGAGTTCGA GAAGGCGCAC CCCGGCGTCA AGGTCAAGCT CCAGACCGCG
CCGACCGCCG CGAACGCGTG GCCGACGCTG TGGCAGAGCC TGGTCTCGGC CAAGAGCGTG
GACGTGCTCG CGCAGTTCCC GCCGACCCCG CACGCCTACC CGCCGGCCTC GACCGGCATC
GTCCCGCAGG GCACGCCCGC GCTGATCCAG TCCGGGCAGT TCGTCGACCT GACGAACCAG
CCGTTCATGA AGCGCTTCGA CCCGGCGGCG CAGAAGTACG CCATGGGCTA CAACAACGGC
ACGTACGGCG TGATGACGGC CGAGTACGTC AACAACTCCG GCATGTTCTA CAAGAAGGAC
CTGCTGACCA AGTACGGCCT GTCGGTCCCG ACGACCTACA GCGAGTTCAT CAAGGACCTG
GACGTCCTGA AGTCCAAGGG CGTCACGCCG CTGTACGTGG CCGGCAAGGA CGGCTACCAG
AGCATCGCGT GGTTCGGCAT CATGAACCAG CTGCTGATGC AGGGCAAGCA GAGCACTGAC
GCCCCGGCGG TGTGGGAGAA GCGCGCGCAG GACTTCTGGG ACGGTACCCA GAGCTGGACC
GACCCGGTCT ACGCCGACAC CGCGAACCGG TACGAGAAGG TCCTGTCCTA CATGGAGCCG
AACGCCGCGG GTGTGCCCGC GCAGTCGGCG CCGGGGGTCT GGGCCGCCAA GACCAACGAC
TTCCCGTTCT TCTTCGACGG CTCCTATGAC GGCAACACCA TCGCGCAGTC CAACCCGAGC
CTGAACTTCG GCTTCATGGC GCTGCCCGGC GCCGACGACG CCGCGGCGAA CCGCGCCGTC
CTGGCGCCGG ACCTGTCCTG GGTGGTCCCG ACCTGGTCCA AGCACCAGCA GCTGGCGATG
GAGTGGCTGG ACATGTTCAC CTCGCCGGAC AACTACGCGG CGTGGCTGAA GGCCACTGGA
TCGATCTCCA CCGAGCCGGC GGTGCCCACG CCCTCGCTGT CCTGGACCGA CTGGCTGTCC
ACGCACGCCT CGCAGGGCTT CGTCAACGCC GAGCAGCCCT GGACCTCGAC CAAGTTCCCC
ACGGCGGCCG GCGACCAGGA CCGGACCAAG ATGCAGCCGT TCGGCTCGCA GACCCCGGCC
CAGGCACTCA AGGAATCGGC GGACGCGTAC AAGTCGGCAG CGGGGCACTG A
 
Protein sequence
MARSTSLQSV RCRRALAALA AVPLAVGLAA CGSSSGSGSG SSQTLTIAMW TNPAAVAQTQ 
KVDAEFEKAH PGVKVKLQTA PTAANAWPTL WQSLVSAKSV DVLAQFPPTP HAYPPASTGI
VPQGTPALIQ SGQFVDLTNQ PFMKRFDPAA QKYAMGYNNG TYGVMTAEYV NNSGMFYKKD
LLTKYGLSVP TTYSEFIKDL DVLKSKGVTP LYVAGKDGYQ SIAWFGIMNQ LLMQGKQSTD
APAVWEKRAQ DFWDGTQSWT DPVYADTANR YEKVLSYMEP NAAGVPAQSA PGVWAAKTND
FPFFFDGSYD GNTIAQSNPS LNFGFMALPG ADDAAANRAV LAPDLSWVVP TWSKHQQLAM
EWLDMFTSPD NYAAWLKATG SISTEPAVPT PSLSWTDWLS THASQGFVNA EQPWTSTKFP
TAAGDQDRTK MQPFGSQTPA QALKESADAY KSAAGH