Gene Caci_4688 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaci_4688 
Symbol 
ID8336042 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCatenulispora acidiphila DSM 44928 
KingdomBacteria 
Replicon accessionNC_013131 
Strand
Start bp5342253 
End bp5343509 
Gene Length1257 bp 
Protein Length418 aa 
Translation table11 
GC content67% 
IMG OID644957788 
Productextracellular solute-binding protein family 1 
Protein accessionYP_003115390 
Protein GI256393826 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1653] ABC-type sugar transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones28 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones26 
Fosmid unclonability p-value0.733472 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGGCATAC GCACAGTGGT GGCGGCGTCG GCGGTTCTCG CGCTGGCCAC TTCGACGGCC 
GCTTGTTCGA GCAGCGCGAG TTCCTCGGCG AGCGCCGGCA AGGTTTCCCT CAGCTACGGG
GTCTGGGACG CGACGCAGGT CCCGGCCATG CAGAAGATCA TCGCAGCCTT CGAGGCACAG
AACCCGACCA TCACGGTCAC CATCCAGCAG ACGCCGTGGG CGGACTACTG GACCAAGCTC
CAGGCGGCCG CCTCCGGCGG TTCGGCGCCC GACGTCTTCT GGATGAACGG CCCGAACTTC
CAGCTCTACG CCGCCAACCA CGTCCTGCGG CCGCTGACCG ACCTGCACCC GGACACCTCG
GTCTACCCCC CGGCGCTGGC GCAGCTCTAC CAGTACAAGG GCGTGCAGTA CGGGCTGCCG
AAGGACTTCG ACACCGTGGG GCTCTGGTAC AACAAGGCCA TCTTCGACGC CGCGGGCGTC
GCCTACCCCA CCACCGCCTG GACCTGGGCT GATTTCCAAG CGGCGGCGAA AAAACTCACC
GACCCCGCCA AGGGCGTCTA CGGTGTCGGC GCCAACCTGG AAGGCCAGGA GAACTACTAC
GACACGATCT ACCAGGCCGG CGGCTACGTC ATCTCCCCCG ACGGCAAGAA GTCCGGATAC
GCCGATCCGG CCGGTATCGC CGGGCTGAAG TTCTGGACCG ATCTGGTCGC GGCCAAGGAG
TCGCCGAGCC TGAAGCAGAT GACGGACACC GCGCCGCTGA ACCTGTTCGA GTCCGGCAAG
CTCGCCATGT ACTGGGGCGG GTCGTGGGAC GCGAAGGCGT TCGCCGCGAA CGACTCCACC
AAGACCGCCG TCGACGTGAC CGCGCTGCCA GCCGGGGTGA AGAAGGCGAC GGTCATCCAC
GGCCTGGCCA ACGTCGTCTT CACGCACACC TCGCACCCGG CGCAGGCGGA GAAGTTCGCC
GCGTTCCTCG GCTCGCAGGC GGCGGCGCAG ATCGAGGCGG ACACCGGGAC CGTGATCCCG
GCGTACAACG GCACACAGCA GAGCTGGGTC AAGGCATACC CGCAGTACCA CCTCCAGTCC
TTCTTGGATC AGCTTCCTGA CGCGGTCCCG TACCCGATCT CCAAGGACAC CGCGGCCTGG
AACACCCTGG AGACGAACGT CCTGACCAAG GCCTGGGACG GCAGCGAACC GATCGACAAG
GCCGCCGGCG ACCTCGCCAC GCAGATGAAC GCGGCGCTGG CCAAGGAGGG TCCGTGA
 
Protein sequence
MGIRTVVAAS AVLALATSTA ACSSSASSSA SAGKVSLSYG VWDATQVPAM QKIIAAFEAQ 
NPTITVTIQQ TPWADYWTKL QAAASGGSAP DVFWMNGPNF QLYAANHVLR PLTDLHPDTS
VYPPALAQLY QYKGVQYGLP KDFDTVGLWY NKAIFDAAGV AYPTTAWTWA DFQAAAKKLT
DPAKGVYGVG ANLEGQENYY DTIYQAGGYV ISPDGKKSGY ADPAGIAGLK FWTDLVAAKE
SPSLKQMTDT APLNLFESGK LAMYWGGSWD AKAFAANDST KTAVDVTALP AGVKKATVIH
GLANVVFTHT SHPAQAEKFA AFLGSQAAAQ IEADTGTVIP AYNGTQQSWV KAYPQYHLQS
FLDQLPDAVP YPISKDTAAW NTLETNVLTK AWDGSEPIDK AAGDLATQMN AALAKEGP