Gene Caci_3719 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaci_3719 
Symbol 
ID8335072 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCatenulispora acidiphila DSM 44928 
KingdomBacteria 
Replicon accessionNC_013131 
Strand
Start bp4185855 
End bp4187180 
Gene Length1326 bp 
Protein Length441 aa 
Translation table11 
GC content68% 
IMG OID644956859 
Productextracellular solute-binding protein family 1 
Protein accessionYP_003114462 
Protein GI256392898 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1653] ABC-type sugar transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value0.707833 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones19 
Fosmid unclonability p-value0.0540576 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGGAAGC GAACAACGGT CCTGGCCGCC GTGATCACGG CCTGCGCCCT GGCTCTGGCC 
GCCTGCTCCG GCGGCGCCCA CGGCACCGGC GGCAGCGCCG CCGCCACCGC CACCGACCCG
GCCGGCGTGA GCGGCGACAT CACGGTCCTG ACTCACAAGA CCGACCTCGC CGCCGACGGC
ACCCTGGCCC GCTACGCCGC GGAGTTCAAC AAGATCTATC CGAACGTGCA CGTCAAGTTC
GAACCCGTCG TCGACTACGA AGGCGACGTG AAGATCCGCC TCAACAGCAG CGACTACGGC
GATGTGCTGA TGATCCCGGC CTCGGTCCCG GTCGCGGACT ACCCCAAGTT CTTCGCCCCG
CTCGGCACCC CCGCCGACCT GGACCAGAAG TACCGCTTCA TCGACCACGG CACGTACAGC
GGCCAGGTGT ACGGCATAGC CATCAACGGC AACGCCACCG GCATGGTCTA CAACAAGACG
GTGTGGCAGC AGGCCGGCGT CACCAGCTGG CCCACCACGC CCGACCAGTT CCTCGCCGAC
CTGCAGGCGA TCAAGACCAA GACCCAGGCC ACCCCGCTCT ACACGGTCTA CCACGAGGGC
TGGCCGATGA CCGCCTGGCA GTCCTACCTC GGCGAGCTCA GCTGCGATCC CAAGGCCTCC
GACGACCTCG CCACCGACGC CGCGCCCTGG GGTCCGGGCA AGGAGCTCAA CCAGATCGAC
ACGATGCTCT ACAACGTCGT CCACAATCAG CTCACTGAGA AGGACCCGAC GACGACGGCC
TGGGACGCCG CCAAGAGCGG CATGGGCTCG GGCAAGATCG GCACGCTGGC GCTGGCCTCC
TGGGCGGTCT CCCAGATGCA GCTGGCCGCC AAGACCGCCG GCGCCGACCC CGCCAGCATC
GGCTTCATGC CCTACCCGAC GCAGGTCGGC GGACACTTCT GCTCCGTGGT CTCGCCGGAC
TACATGGAGG CCGTGAGCAT CCACTCCCAG CACAAGGCCG CGGCGCGCGC CTGGGTCGAC
TGGTTTGTCG ACAAGTGCAC CTACGCTCAG GACCAGGGTC TGCTCCCGAC GTTGAAGACC
GGGGCGATGC CGCCGGAGCT GGCCGCGTAC CAGAGTGCCG GCGTGCAGTT CATCGAGCTC
GCGCAGAACG CCAACACGCA GATCTCCACC ATCGACAACG ATTCCGAGAT CGGTCTGCAG
AAGCCGGACT ACCGGCAGCA CATCGTGGAC TTGGCGCGCG GCGCCGCCGG CGGGAGTCTG
GACGGTTATT TCGCCGACCT GGACAAGAAA TGGGCGGCAG CCGTGAAGAC CGCCGCCGGG
TCCTGA
 
Protein sequence
MRKRTTVLAA VITACALALA ACSGGAHGTG GSAAATATDP AGVSGDITVL THKTDLAADG 
TLARYAAEFN KIYPNVHVKF EPVVDYEGDV KIRLNSSDYG DVLMIPASVP VADYPKFFAP
LGTPADLDQK YRFIDHGTYS GQVYGIAING NATGMVYNKT VWQQAGVTSW PTTPDQFLAD
LQAIKTKTQA TPLYTVYHEG WPMTAWQSYL GELSCDPKAS DDLATDAAPW GPGKELNQID
TMLYNVVHNQ LTEKDPTTTA WDAAKSGMGS GKIGTLALAS WAVSQMQLAA KTAGADPASI
GFMPYPTQVG GHFCSVVSPD YMEAVSIHSQ HKAAARAWVD WFVDKCTYAQ DQGLLPTLKT
GAMPPELAAY QSAGVQFIEL AQNANTQIST IDNDSEIGLQ KPDYRQHIVD LARGAAGGSL
DGYFADLDKK WAAAVKTAAG S