Gene Caci_4697 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaci_4697 
Symbol 
ID8336051 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCatenulispora acidiphila DSM 44928 
KingdomBacteria 
Replicon accessionNC_013131 
Strand
Start bp5353428 
End bp5354756 
Gene Length1329 bp 
Protein Length442 aa 
Translation table11 
GC content67% 
IMG OID644957797 
Productextracellular solute-binding protein family 1 
Protein accessionYP_003115399 
Protein GI256393835 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1653] ABC-type sugar transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00367387 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones28 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGCATCCAA GAACGGCACG AACCCTTCGC TCGGCCGCCG CGGCCGCCAC CACCGCCGTG 
CTGGCGCTGA CCGCCGCGTG CTCCTCCTCG GCGTCCTCGT CCTCGAAGGC GCCGGCGGAT
CCGACGAAGC CGGTCACCAT CACCGTGTGG ACCGGACAGG ACGTGAACCC CGAGAAGCTG
CTGGAGGGTC TGGCCAAGCA GTTCCACGCG GCGCACCCGA ACGTCACCGT CGACATCTCG
CCCGGCGCGC CGACCACCGA CCAGCTGCTG CCGAAGCTGA TCGCCGCCTT CACCAGCGGC
ACCTACCCGG ACATCTCCTA CAACTTCGGC AGCTGGGCCA CCCAGATGCA GCTGTCCGGC
CGCACCCTGG ACATCACCGA CAAGGTCAAG GACCCGGCGG TGAAGTGGGA CGAGTTCCCG
CCGGCGGCGC GGCAGACCGC GACGCCCAAC GGGCACGTGA TCGGCTTCCC GGCGGTCGTG
GACAACCTGG CGCTGATGTA CAACAAGAAG CTGTTCCAGG CCGCGGGCCT GTCCGAGCCC
ACGAACACCT GGACCTGGGA CGACTTCCGG GCGGCGGCGA AGAAGCTCAC CGACCCGGCG
AAGAACGTCT ACGGCACCGC CTACTCCGTC TCCGGGACCG AGGACACCAC CTGGCACTTC
TGGCCGCTGC TGTGGCAGAA GGGCGGCACC GTCCTGAACT CCGACAACAG CAAGGCCGCC
TTCGACTCCG ACGCCGGTGT GGCAGCCCTG ACGTTCCTGC AGCAGATGGC TGTGACCGAC
AAGTCCGTGT ACCTATCGCA GGACGACCAG AAGTACGCCG ACCTGTTCAA GTCCGGCCTG
ATCGGCATGA TCATGAGCGG GCCGTGGCAG CTGTCGGACA ACGTCGGAGC GAACCTGGAC
TACGGCGTCA CCTACCTGCC CTCCTTCGAC GGCACGAGCC ACCAGACGAT ATCCGGCCCG
GACCTGTGGA CCCTGTACGA CCACCACGAT GCCAACCGGA CCTATTGGTC CTACCAGTTC
GCGCAGTGGC TGACCTCGGC GCAGACCGAC CCGCAGTTCA ACCTCGCCAC CGGGAACCTG
CCGCTGCGCA GCAGCGAGGC CGGCAGCCAG GAGTTCCAGG ACTACGCCAA GCAGTACCCC
GGCGCGCAGA CACTCTTCGA CAACCTGAAG AACGCCACGA CCGCCCGGCC GACCGTCCCC
GGCTACGTCG GCTTGTCCCA GGCTGTCGGA CAGGCGATCG CCAAGGTGCT GCAAGGTCAG
GGGGATCCCA AGTCGGCCTT GCAAGACGCG GCGAAGGCGG CGGACGTCGC GCTGGCGCAA
GCGGACTGA
 
Protein sequence
MHPRTARTLR SAAAAATTAV LALTAACSSS ASSSSKAPAD PTKPVTITVW TGQDVNPEKL 
LEGLAKQFHA AHPNVTVDIS PGAPTTDQLL PKLIAAFTSG TYPDISYNFG SWATQMQLSG
RTLDITDKVK DPAVKWDEFP PAARQTATPN GHVIGFPAVV DNLALMYNKK LFQAAGLSEP
TNTWTWDDFR AAAKKLTDPA KNVYGTAYSV SGTEDTTWHF WPLLWQKGGT VLNSDNSKAA
FDSDAGVAAL TFLQQMAVTD KSVYLSQDDQ KYADLFKSGL IGMIMSGPWQ LSDNVGANLD
YGVTYLPSFD GTSHQTISGP DLWTLYDHHD ANRTYWSYQF AQWLTSAQTD PQFNLATGNL
PLRSSEAGSQ EFQDYAKQYP GAQTLFDNLK NATTARPTVP GYVGLSQAVG QAIAKVLQGQ
GDPKSALQDA AKAADVALAQ AD