Gene Caci_7201 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaci_7201 
Symbol 
ID8338569 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCatenulispora acidiphila DSM 44928 
KingdomBacteria 
Replicon accessionNC_013131 
Strand
Start bp8370253 
End bp8371596 
Gene Length1344 bp 
Protein Length447 aa 
Translation table11 
GC content69% 
IMG OID644960282 
Productextracellular solute-binding protein family 1 
Protein accessionYP_003117871 
Protein GI256396307 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1653] ABC-type sugar transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones31 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGCATAC GCACTTCGGC CGCGCTGGCC GTGGGAAGTG TCCTCGTCCT CGGCGCCACG 
GCCTGCTCCA GCGCGGCGTC CAAGTCGTCC GGGCCCGCCG GCTCCGGCTC CGGCACGCAG
CCGGCCGCCG TGGCCGGGGC GGGCTCCGGC GCCGGCAAGA CACTGACCGT CTGGTACATG
GACGGCGACC TGTCCGACGC CGCCACCAAG GCCATCAACG ACAAGTTCAC CGCCGCCACC
GGCGCCCAGG TGAAGGTCGC GATCCAGCAG TGGGACGGGA TCAACACCAA GATCGCCACC
GCGCTGGCGC AGGACAACCC GCCGGACGTC ATCGAGATCG GCAACACCGA CGTCCCGCTG
TTCGCCGCCA GCTCCGGCCT GACCGACATC ACCTCGGCAC TCCCGCAGCT GCAAGCCGAC
CAGCACTGGC TCCCCGGCCT CGCGGGCCCT GCCACCGTCG ACGGCCACAA CTACGGCGCG
CCCCTGTTCG CCGGCAACCG CGCGGTCATC TACAACAAGA AGATCTGGGC CGCCGCCGGC
ATCACCGCCG CGCCGACCAC CTTCGCGCAG CTGACCGCGG ACCTGGACGC CATCAAGGCG
AAGAACACCG CGCCGGACTT CTCAGCCTTC TACTTCCCCG GCCGCTACTG GTACGGCGCG
ATGCAGTTCG TGTGGGACGC CGGAGGCCAG CTCGCCAGCC AGGCCGGCGG CAAGTGGACC
GGCCAGCTGG AGTCGCCGCA GGCCCAGCAG GGACTCCAGG CCTGGAAGAC CTTCATCGGC
AAGTACTCCG CCGGCGCCTC CCAGGACGTC GACACCACCG CGCCGGACTT CAACACCCTG
TTCGCGCAGG GCAAGACCGC GACCATCCTG AACTCGAACG TCAACAAGAT CCTGAAGGTC
GACCCTTCCC TGACCGACCA GATCGGCACC TTCCCCTTCC CCAGCGCCAC CGACGGCAAG
ACCCAGCCGG TCTTCCTCGG CGGCTCCGAC CTCGCGGTCG CGGCCAAGAG CAAGAACCAG
GCGCTGGCCC TGGCCTACCT GAAGGCCGCC ACGGACCCCG CGGTCCAGGC CTCCGCGATC
GTCGGCATCG ACCACTGGAT CCCCGCCTCC ACCGAGGTGA TCGACCAGAC CATCAGCAGC
CTGCCGGACG TCTCGAAGGC GTTCTTCACA GCGGCGAAGA CCTCCGTCGC GACCCCAGCC
GTCGCCGGCT GGGCGACCAT CGAGTCGGAC AAGTCCATCA ACGACTTCTT CGCCGACATC
GCCACCGGCC GCAAATCCCC GGCCGACGCC GCCAAAACCC TGGACGCACA CCTGAACCAG
GCACTCAACG CCCCGGCGCA ATGA
 
Protein sequence
MRIRTSAALA VGSVLVLGAT ACSSAASKSS GPAGSGSGTQ PAAVAGAGSG AGKTLTVWYM 
DGDLSDAATK AINDKFTAAT GAQVKVAIQQ WDGINTKIAT ALAQDNPPDV IEIGNTDVPL
FAASSGLTDI TSALPQLQAD QHWLPGLAGP ATVDGHNYGA PLFAGNRAVI YNKKIWAAAG
ITAAPTTFAQ LTADLDAIKA KNTAPDFSAF YFPGRYWYGA MQFVWDAGGQ LASQAGGKWT
GQLESPQAQQ GLQAWKTFIG KYSAGASQDV DTTAPDFNTL FAQGKTATIL NSNVNKILKV
DPSLTDQIGT FPFPSATDGK TQPVFLGGSD LAVAAKSKNQ ALALAYLKAA TDPAVQASAI
VGIDHWIPAS TEVIDQTISS LPDVSKAFFT AAKTSVATPA VAGWATIESD KSINDFFADI
ATGRKSPADA AKTLDAHLNQ ALNAPAQ