Gene Caci_4682 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaci_4682 
Symbol 
ID8336036 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCatenulispora acidiphila DSM 44928 
KingdomBacteria 
Replicon accessionNC_013131 
Strand
Start bp5332083 
End bp5333780 
Gene Length1698 bp 
Protein Length565 aa 
Translation table11 
GC content68% 
IMG OID644957782 
Productextracellular solute-binding protein family 1 
Protein accessionYP_003115384 
Protein GI256393820 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1653] ABC-type sugar transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00316141 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clones29 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACGAGTT CGAACGGCTT CTCCCGCCGC CAGGTGTTCA AGACCTCCGC GGCGATCGGC 
GGAGCGATAG CCGCCGCTCC GCTGCTGTCC GCCTGCGGTT CCGGCAAGGC GGCGACCAAG
GCCAGCGGGG TCGCGCCCAA GTCCGCGGTG CAGGCGGTGC TGCCGACGTA CAAGCCGAGC
AGCGCGGTCA CCGCCGACAT CCCCTCGGTC ACCGGCGCGA ACGGCGCGGC CAGCGACCCG
GCGTTCCTGT CCTACCCGGC CAGCCCGCCG AAGTCGGTGA CCGGCGCGGT CGGCAACGGC
GGCTCGTACT CGGCGGTCTC GCCGATCTGG GGCTCGGTCC CGGCGCCGGG GAACAGCTAC
TACACGGCGG TGAACCAGGC TCTGGGCGCG ACGCTCACCG ACAGCCCGTC CGACGGCACC
ACCTTCGCCA CGATGCTGGC GACCCGCTTC GCCTCCGGCA ACATCCCGGA CTGGCTGGAC
GTGCCCGGCT GGAACGTCTC CTCGATTCAG AACTTCGCCG AGGGCGTCGA CAAGTTCTTC
AAGGACCTGA CGCCCTACCT CGGCGGCGAC AAGGTGCTGG ACTACCCGAA CCTCGCCGCG
ATCCCGACCG GCGGCTGGCA GGCCGCGGTG TGGAACGGCA AGCTGTACGG CATCCCGCTG
TGGACCTCGG CGGCCAGCAT CCCCGGCGCG ATGTTCTACC GCGCGGACAT CTTCAAGGCC
GCCGGCATCG ACGCCGCCTC GGTCACCACC GCCGACACCC TCAAGGCGGT CGGCAAGCAG
GTCACCGTCC CGGCGAAGGG CCAGTACGCC TTCGAGGACC TCAGCTCCTT CCTCTACCAG
CTGTTCAACG TCCCGGCGAA CAACGGGCGG ACCGGCTGGA AGCGCGACAG CACCGGCAAG
CTGGTCAACG GCTACGAGGT GCCGGAGTTC CTGGAGATGC TGAACTTCGC CAACGGCCTG
GCCAAGGGCG GCCTGATCCA CCCCGACGCG CTGGCCGGCG ACTCCTCGAA GGCCAAGAAC
CGCTTCTGGG CCGGCAAGAC CGTGATCACC GCCGACGGCA CCGGGGCGTG GAACAAGGGC
GACGCGCAAA GCGGCGTCGC GGCCAACCCC TCCTATGAGC GCCAGGCCTT CAAGATCTTC
GCCTACGACG GCGGCAAGGC GACGATGCCC CTGTATCCGG GCGCCGGGAT GTTCTCCTAC
CTGAACAAGA AGCTCTCCGA CGCGCAGGTC AAGGAGCTGC TGCGGATCGC CAACTACCTC
GCCGCGCCGT TCGGCAGCGC CGAGTACCTG GTGTCGCGGT ACGGCAAGGA AGGCGTGGAC
TACACGATGA CCAGCGGCGC GCCGATCCTC ACCGACCAGG GCAACAAGGA CGTCACCGAC
ACCCTGGACC AGCTGGCCAA CTGCCAGTCG GTGACGTTCA ACGCCGGCTA CAACCAGATC
ACCAAGGACT ACGCCGCCTG GCAGGGCGAC ATGGTGCAGC ACGCGTACAA GCCGCTGTTC
TACGCGATGA ACATCAGCGA GCCGGCGCAG ACCGCGAAGG CGAGCACGGC GCTGGAGGCG
GTCATCACCG ACGTGCGCAT GGGCCGCAAG AGCGTGGCGG ACTTCCAGTC GGCGCTGAGC
ACTTGGCAGA ACGCCGGCGG CAACCAGCTG CGGGACTTCT ACGACGGCAT CGCCAAGCAG
TACGGCACGG GGAACTGA
 
Protein sequence
MTSSNGFSRR QVFKTSAAIG GAIAAAPLLS ACGSGKAATK ASGVAPKSAV QAVLPTYKPS 
SAVTADIPSV TGANGAASDP AFLSYPASPP KSVTGAVGNG GSYSAVSPIW GSVPAPGNSY
YTAVNQALGA TLTDSPSDGT TFATMLATRF ASGNIPDWLD VPGWNVSSIQ NFAEGVDKFF
KDLTPYLGGD KVLDYPNLAA IPTGGWQAAV WNGKLYGIPL WTSAASIPGA MFYRADIFKA
AGIDAASVTT ADTLKAVGKQ VTVPAKGQYA FEDLSSFLYQ LFNVPANNGR TGWKRDSTGK
LVNGYEVPEF LEMLNFANGL AKGGLIHPDA LAGDSSKAKN RFWAGKTVIT ADGTGAWNKG
DAQSGVAANP SYERQAFKIF AYDGGKATMP LYPGAGMFSY LNKKLSDAQV KELLRIANYL
AAPFGSAEYL VSRYGKEGVD YTMTSGAPIL TDQGNKDVTD TLDQLANCQS VTFNAGYNQI
TKDYAAWQGD MVQHAYKPLF YAMNISEPAQ TAKASTALEA VITDVRMGRK SVADFQSALS
TWQNAGGNQL RDFYDGIAKQ YGTGN